Tag
Meituan's GN06 team officially launched AI browser Tabbit 1.0, which integrates multiple top large language models, supports automatic execution of complex tasks across software and web pages, and adds a memory function.
Tencent launches WorkBuddy, an AI agent that autonomously completes complex tasks instead of just answering questions, with a demo showing parallel agent execution.
Describes an AI tool that automates multi-tool workflows and executes tasks, waiting for user approval, instead of just providing instructions.
The author integrated an AI employee into Slack that autonomously executed and completed a weekly task, showcasing its high capability.
Introduces Teach VLM, a model that extracts step-by-step operational knowledge from mobile screen demonstrations, and the Teach-and-Repeat paradigm that uses this knowledge to guide GUI agents, achieving state-of-the-art performance on a new benchmark.
Most companies mistakenly automate tasks rather than decisions, missing out on significant ROI. By automating decisions that require human judgment, such as lead scoring and support triage, companies can save hours of senior time daily.
Kimi launched a new AI office product, Kimi Work, which inherits the capabilities of Kimi Code and Kimi Agent, enabling up to 300 agents to collaborate simultaneously on tasks, aiming to provide workers with a command-line-free automated office experience.
Arena Agent Mode enables autonomous AI agents to complete real-world tasks.
PhoneWorld is a pipeline that transforms real GUI trajectories into controllable mobile environments, enabling scalable creation of phone-use benchmarks. It covers 34 apps across 16 domains and shows that using its supervision improves performance on multiple evaluation benchmarks.
Manus announces Scheduled Tasks 2.0, a major upgrade allowing recurring work to run with context within the same task, power background actions in web apps, and provide clearer visibility. Available now.
Agent-Sin is an AI agent that automates repeated tasks using reusable skills, aimed at boosting productivity.
A discussion on whether AI agents are finally transitioning from chat-based interactions to autonomously performing real-world tasks like customer support and subscription cancellations, questioning if practical implementation has arrived or remains in early stages.
This paper introduces MCP-Cosmos, a framework that integrates generative world models into the Model Context Protocol ecosystem to enhance agent planning and execution through predictive simulation in latent space.
Codex introduces the /goal command, which lets the AI autonomously work toward a defined end state, streamlining long-running tasks like refactors, migrations, and retry loops.
Pazi offers specialized AI agents that connect to users' existing tools to automate various tasks.