web-browsing

#web-browsing

Real Chrome. Live DOM. MCP tools. Multi tab control. Visible progress.

Reddit r/openclaw ↗ · 17h ago

A new tool enabling AI agents to browse the web using a real Chrome instance with live DOM access, MCP tools, and multi-tab control.

0 favorites 0 likes

#web-browsing

@gdb: Codex can now drive Chrome tabs in the background:

X AI KOLs Following ↗ · 3d ago

Codex has been updated to allow driving Chrome tabs in the background, enabling automated web tasks without active user supervision.

0 favorites 0 likes

#web-browsing

Deep research System Card

OpenAI Blog ↗ · 2025-02-25 Cached

OpenAI launches Deep Research, an agentic capability powered by an early version of o3 that conducts multi-step internet research for complex tasks, with comprehensive safety testing and privacy protections implemented before rollout to Pro users.

0 favorites 0 likes

#web-browsing

Computer-Using Agent

OpenAI Blog ↗ · 2025-01-23 Cached

OpenAI introduced the Computer-Using Agent (CUA), a model combining GPT-4o's vision with reinforcement learning to interact with GUIs like a human, powering the new Operator agent. CUA sets new state-of-the-art benchmarks including 38.1% on OSWorld and 58.1% on WebArena, and is available as a research preview for ChatGPT Pro users in the US.

0 favorites 0 likes

#web-browsing

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Papers with Code Trending ↗ · 2024-07-23 Cached

OpenDevin is an open-source platform for developing AI agents that can write code, use command lines, and browse the web to interact with the environment. It supports multiple agents, sandboxed code execution, and evaluation benchmarks like SWE-Bench.

0 favorites 0 likes

#web-browsing

WebGPT: Improving the factual accuracy of language models through web browsing

OpenAI Blog ↗ · 2021-12-16 Cached

OpenAI fine-tuned GPT-3 to answer open-ended questions more accurately by enabling it to use a text-based web browser to search, retrieve, and cite sources. The model outperforms human demonstrators 56% of the time on questions from ELI5 dataset but shows limitations on out-of-distribution tasks like TruthfulQA.

0 favorites 0 likes

web-browsing

Real Chrome. Live DOM. MCP tools. Multi tab control. Visible progress.

@gdb: Codex can now drive Chrome tabs in the background:

Deep research System Card

Computer-Using Agent

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

WebGPT: Improving the factual accuracy of language models through web browsing

Submit Feedback