desktop-automation

Tag

Cards List
#desktop-automation

@shedntcare_: China just released a desktop automation agent that runs 100% locally. It can run any desktop app, open files, browse w…

X AI KOLs Timeline · yesterday Cached

China released an open-source desktop automation agent that runs 100% locally, capable of controlling desktop apps, files, and browsing without internet.

0 favorites 0 likes
#desktop-automation

@billtheinvestor: ByteDance open-sources UI-TARS Desktop (3.6k stars). Core logic: 100% local execution, pixel-only, no API calls. Compared to OpenAI/Anthropic cloud-based approaches, it solves two pain points: 1. Data privacy (data stays on machine); 2. Zero-cost zero-latency (no API fees). Build private…

X AI KOLs Following · 2026-06-16 Cached

ByteDance open-sources UI-TARS Desktop, a 100% local desktop automation tool that operates purely on pixels with no API calls, resolving the two major pain points of data privacy and API costs, providing an efficient open-source solution for building private automation workflows.

0 favorites 0 likes
#desktop-automation

@GitTrend0x: A pure local desktop automation powerhouse, and most importantly, saves money! https://github.com/microsoft/fara This is Fara-7B, an efficient Computer Use Agent small model from Microsoft! In a word, it surpasses traditional large model CUA: only 7B parameters...

X AI KOLs Timeline · 2026-06-15 Cached

Microsoft launches Fara-7B, an efficient Computer Use Agent with only 7B parameters, surpassing larger models on web tasks, supporting pure local deployment, and achieving low-cost desktop automation.

0 favorites 0 likes
#desktop-automation

ProCUA-SFT Technical Report

Hugging Face Daily Papers · 2026-06-15 Cached

ProCUA-SFT is a large-scale synthetic dataset of 3.1M step-level SFT samples for training computer-use agents, produced via an automated pipeline using a single VLM (Kimi-K2.5). Fine-tuning UI-TARS 7B on it achieves 45.0% on OSWorld, an 18.7 point improvement over the base model.

0 favorites 0 likes
#desktop-automation

@nini_incrypto_: Microsoft's recent practical release lets a 7B model take over your mouse and keyboard! FARA abandons pointless chat and focuses purely on local desktop automation. Its core advantages boil down to two words: obedient and cost-effective. 1. Pure desktop execution: opens web pages, fills forms, and automatically runs all repetitive mechanical workflows. 2. ...

X AI KOLs Timeline · 2026-06-14 Cached

Microsoft has released Fara-7B, a small 7B-parameter language model focused on pure local desktop automation. It can directly take over your mouse and keyboard to execute repetitive workflows, with low cost and no need for internet connectivity.

0 favorites 0 likes
#desktop-automation

Launch HN: Minicor (YC P26) – Windows desktop automations at scale

Hacker News Top · 2026-05-26 Cached

Minicor is a Y Combinator-backed platform that deploys self-healing AI agents for scalable desktop automations, enabling integration with legacy systems lacking APIs.

0 favorites 0 likes
#desktop-automation

@quanruzhuoxiu: When using Midscene's Computer Agent, desktop automation runs headless in Linux CI. Everyone assumes desktop UI automation must use a real machine or VM, so Mac/Windows desktop E2E can only run locally and cannot enter CI. Result...

X AI KOLs Timeline · 2026-05-24 Cached

Midscene's Computer Agent enables desktop UI automation to run headless in Linux CI, automated via xvfb-run, without needing a real machine or VM, and supports Electron, Qt, and GTK applications.

0 favorites 0 likes
#desktop-automation

@QingQ77: An operational Agent runtime built for llama.cpp local inference models, allowing local models to execute real-world tasks like browser, file, and Shell operations like a desktop operator https://github.com/AtomicBot-ai/atomic-agent… Atom…

X AI KOLs Timeline · 2026-05-22 Cached

Atomic-Agent is a desktop operation Agent designed for llama.cpp local inference models, optimizing the runtime architecture to enable small local models to reliably execute multi-step desktop tasks.

0 favorites 0 likes
#desktop-automation

IrisGo, a startup backed by Andrew Ng, looks to become the AI desktop buddy you never knew you needed

TechCrunch AI · 2026-05-20 Cached

IrisGo, backed by Andrew Ng, launches an AI desktop companion that learns user workflows and automates repetitive tasks on-device for privacy, targeting knowledge workers.

0 favorites 0 likes
#desktop-automation

what non-coding tasks have you gotten a local model to do autonomously?

Reddit r/LocalLLaMA · 2026-05-19

The author discusses building a small VLM for desktop GUI automation to move data between apps without APIs, expressing interest in non-coding autonomous use cases for local models.

0 favorites 0 likes
#desktop-automation

OpenComputer: Verifiable Software Worlds for Computer-Use Agents

Hugging Face Daily Papers · 2026-05-19 Cached

OpenComputer presents a framework for creating verifiable software environments for computer-use agents, integrating state verifiers, self-improving verification layers, task synthesis, and evaluation systems across 33 desktop applications. Experiments show its verifiers align better with human judgment than LLM-as-judge, and frontier agents struggle with end-to-end completion.

0 favorites 0 likes
#desktop-automation

Codex will soon be able to control other desktop devices via Computer Use (2 minute read)

TLDR AI · 2026-05-18 Cached

OpenAI is developing a feature for Codex to control macOS applications via Computer Use even when the laptop is locked or asleep, and to remotely operate other desktop devices running the Codex app, extending its remote control capabilities.

0 favorites 0 likes
#desktop-automation

My CLI now controls my entire desktop, whats a good test to see if it works really good.

Reddit r/AI_Agents · 2026-05-15

A user describes a CLI tool that controls the entire desktop via hybrid mouse, keyboard, and screenshot methods, successfully performing tasks like sending email screenshots and remote desktop control. They seek challenging tests to validate its robustness.

0 favorites 0 likes
#desktop-automation

🤔 How do we secure local desktop automation in AI workflows? (Review & Beta Testing)

Reddit r/AI_Agents · 2026-05-13

MountainDesk is a local-first tool that bridges AI model inference with desktop automation, offering features like system state anchors, multi-agent orchestration, and background monitoring. The creator seeks feedback on security and workflow integration.

0 favorites 0 likes
#desktop-automation

@Teknium: Give our early preview of Computer Use (with ANY model) a try today! Built into the latest Hermes Agent and powered by …

X AI KOLs Following · 2026-05-11

Teknium introduces an early preview of Computer Use built into the Hermes Agent and powered by TryCua, enabling any AI model to interact with and control a desktop environment in the background without overriding direct user input.

0 favorites 0 likes
#desktop-automation

@intheworldofai: Hermes Agent + AionUi basically turns your computer into an Agentic AI Operating System. Multiple autonomous AI agents …

X AI KOLs Timeline · 2026-05-11 Cached

将 Hermes Agent 与 AionUI 结合,可将个人电脑升级为支持多智能体并行、具备长期记忆与自我进化能力的 Agentic AI 操作系统,实现从数据分析、文件管理到代码编写的全自动化本地工作流。

0 favorites 0 likes
#desktop-automation

Accessibility API and Set-of-Marks: making computer-use agents more reliable

Reddit r/ArtificialInteligence · 2026-05-11

The article introduces Opendesk, an open-source tool that enhances the reliability of computer-use agents by leveraging native accessibility APIs to identify interactive elements, replacing error-prone pixel-coordinate guessing.

0 favorites 0 likes
#desktop-automation

@VincentLogic: Found an incredible open-source desktop AI tool from ByteDance! UI-TARS Desktop, with 31k stars, truly lives up to the hype. It can actually understand your screen and automate computer operations for you. Just tell it "Enable auto-save in VS Code and set the delay to 500ms", and it will automatically: -…

X AI KOLs Timeline · 2026-05-11

ByteDance's open-source desktop AI automation tool, UI-TARS Desktop, supports local execution and screen visual understanding. It can autonomously control your computer to handle daily tasks through natural language commands.

0 favorites 0 likes
#desktop-automation

@GitTrend0x: A Killer Open-Source Gem for 100% Local Desktop AI Agents https://github.com/bytedance/UI-TARS-desktop… This is UI-TARS-desktop, a multi-modal desktop automation agent open-sourced by ByteDance with 31k stars! …

X AI KOLs Timeline · 2026-05-09 Cached

UI-TARS-desktop is a highly popular open-source tool by ByteDance that enables 100% local multimodal desktop automation, allowing users to control apps and browsers via natural language without cloud data leaks.

0 favorites 0 likes
#desktop-automation

@MiguelMaestroIA: China does it again! It has open-sourced a desktop agent that sees your screen and runs 100% locally Screen/mouse/keybo…

X AI KOLs Timeline · 2026-05-08 Cached

China has open-sourced a desktop AI agent that can see the screen and control mouse/keyboard via natural language, running entirely locally without cloud dependency.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback