stale html and headless browsers kept getting me blocked, so i started replaying the actual requests instead
Summary
The author shares their experience of switching from headless browsers to replaying direct requests to scrape websites, reducing block rates and resource usage significantly.
Similar Articles
@Fluyeporlaweb: Web scraping just leveled up Scrapling bypasses Cloudflare blocks, is 774 times faster than BeautifulSoup, and doesn't …
Scrapling is a web scraping framework that bypasses Cloudflare blocks, is 774 times faster than BeautifulSoup, and adapts to website changes automatically. It has 52.2k GitHub stars and supports AI agents as an MCP server.
@simplifyinAI: This python library scrapes any website while bypassing every bot protection on the internet. It rotates fingerprints, …
A Python library that scrapes websites while bypassing bot protections like Cloudflare and Akamai by rotating fingerprints, mimicking browser headers, and automatically handling CAPTCHAs. It uses Headless Chromium, Playwright, proxy rotation, and is fully open-source.
@hasantoxr: I'm done paying $500 a month for anti-detect browsers after finding this. It's called CloakBrowser. A stealth Chromium …
The article introduces CloakBrowser, an open-source stealth Chromium-based browser designed to bypass bot detection systems like reCAPTCHA and Cloudflare Turnstile. It claims to offer superior stealth capabilities by patching the C++ source code rather than injecting JavaScript, positioning itself as a free alternative to expensive commercial anti-detect browsers.
@thisguyknowsai: This is why GitHub is undefeated... A developer built a headless browser that makes Chrome look obese. It's called Obsc…
A developer built Obscura, an open-source headless browser engine in Rust designed specifically for AI agents, web scraping, and automation, claiming it's more lightweight than Chrome.
@NFTCPS: Time to retire Headless Chrome! Someone built a Rust-based headless browser engine for AI agents and crawlers—Obscura—whose performance leaves Chrome in the dust: ① only 30 MB of RAM (Chrome eats several GB) ②…
Obscura is a new Rust-based headless browser engine targeting AI agents and crawlers, offering 30 MB memory usage, 85 ms startup, and CDP compatibility with Puppeteer/Playwright.