A new agentic way to build automations

Reddit r/AI_Agents 06/08/26, 05:44 AM Tools

Summary

A new tool lets users screen record themselves performing a task; an agent learns the task and builds a deterministic script to automate it, with fallback handling when scripts break.

For a lot of personal automations, it is easier to show than prompt since we already do them on our own browser/computer. For example, it is easier to do a screen recording and say, * download data by clicking on this button on the dashboard, * if the data is fine, update this google sheet like this, * if the data is missing send a message to this group on whatsapp. That is what I built. You can screen record yourself doing a task. The agent watches, confirms inputs and outputs and approach of the task, learns to do the task and builds a deterministic script to do it. This simplifies giving the right context to the coding agent. This also also ensures that we are able to give all the details of the automation upfront without having to iteratively prompt. The agent iterates to learn how to do the task. What happens when the script breaks? It fallbacks to the agent. It passes the originally learnt context, script error logs to finish the task and heal the script if needed. Would love to know your thoughts about. Attached the github link the comments and link to some of the examples, I have tried.

Original Article

Similar Articles

Been recording my repetitive tasks and turning them into agent skills. Sharing the tool.

Reddit r/AI_Agents

A tool that records repetitive tasks via screen recording and compiles them into reusable agent skills following the agentskills.io standard, shipping as an MCP server for cross-platform use.

@dabit3: Now that agents can act, we ask: when should they run, what can they touch, how is their work checked, and what context…

X AI KOLs Following

The author proposes Automation Engineering as a discipline for designing triggers, guardrails, and success checks to make AI agents safe and reliable without constant human oversight.

The future of automation is going to be the following : screenshot->think->respond->screenshot

Reddit r/AI_Agents

The author shares a personal experience of using harness engineering and free NVIDIA NIM endpoints to build AI agents capable of automating complex multi-step workflows, such as creating AI video series and hotel booking systems, suggesting that web crawling services can be replaced by free frontier models.

I built an agent that records everything your agents actually do

Reddit r/AI_Agents

Built Trovis, a tool that records and explains agent actions in production, showing deviations from expected behavior.

Built an agent that actually uses your computer. Here's what works and what doesn't.

Reddit r/AI_Agents

The author describes Clark Agent, an AI agent that performs actions on the user's computer (browser, email, calendar, files), sharing what works well and common pain points like CAPTCHAs and brittle long workflows.

Similar Articles

Been recording my repetitive tasks and turning them into agent skills. Sharing the tool.

@dabit3: Now that agents can act, we ask: when should they run, what can they touch, how is their work checked, and what context…

The future of automation is going to be the following : screenshot->think->respond->screenshot

I built an agent that records everything your agents actually do

Built an agent that actually uses your computer. Here's what works and what doesn't.

Submit Feedback