Tag
This article discusses passive browser identification techniques using HTTP header order, IP options, user-agent strings, and random number generator patterns.
A personal blog post explaining an anti-crawler browser blocking mechanism, including special notes for Inoreader, Feedly, Vivaldi, and archive.* users.
The author explains that their blog is blocking requests from old or suspicious browser user agents to mitigate a surge in high-volume crawlers, likely for LLM training data. Specific instructions are provided for users of Vivaldi and Inoreader to adjust settings or report issues.
A systems administrator explains blocking old browser user-agents to deter LLM-training crawlers, and gives tips for affected users.