Comment by riddlemethat
Comment by riddlemethat 4 days ago
I do this with Chrome recording and Playwright. What I need is an AI agent to meander through my product as if it were the target user and test/break things so I can pass that to my LLM to fix. Does anyone have that?
Different from our core use case, but our agents can do open-ended exploration as well. You could prompt something like "navigate to this app as a new user and try common. flows" with structured outputs for findings. Session recording will show what happened. Not sure if it fully solves your problem - but happy to explore this together if you want to try it.