A new agentic way to build automations

Reddit r/AI_Agents Tools

Summary

A new tool lets users screen record themselves performing a task; an agent learns the task and builds a deterministic script to automate it, with fallback handling when scripts break.

For a lot of personal automations, it is easier to show than prompt since we already do them on our own browser/computer. For example, it is easier to do a screen recording and say, * download data by clicking on this button on the dashboard, * if the data is fine, update this google sheet like this, * if the data is missing send a message to this group on whatsapp. That is what I built. You can screen record yourself doing a task. The agent watches, confirms inputs and outputs and approach of the task, learns to do the task and builds a deterministic script to do it. This simplifies giving the right context to the coding agent. This also also ensures that we are able to give all the details of the automation upfront without having to iteratively prompt. The agent iterates to learn how to do the task. What happens when the script breaks? It fallbacks to the agent. It passes the originally learnt context, script error logs to finish the task and heal the script if needed. Would love to know your thoughts about. Attached the github link the comments and link to some of the examples, I have tried.
Original Article

Similar Articles

New tools for building agents

OpenAI Blog

OpenAI launches new tools for building agents including the Responses API, built-in tools (web search, file search, computer use), Agents SDK, and observability features designed to simplify agentic application development.