So now scraping data without permission is bad for AI training all of sudden?

Reddit r/artificial 06/27/26, 07:37 AM News

Summary

A commentary on the shifting attitudes towards web scraping for AI training, questioning the sudden condemnation of data collection without permission.

No content available

Original Article

Similar Articles

AI Makes Large-Scale Web Scraping Accessible. Is That a Problem?

Reddit r/ArtificialInteligence

The article discusses how AI coding assistants make large-scale web scraping accessible to ordinary people, raising ethical concerns about ignoring robots.txt and rate limits, and questions the responsibility of AI providers.

How does AI follow ethical guidelines in Data Collection?

Reddit r/artificial

A commentary on the ethical challenges of AI agents ignoring website rules like robots.txt when generating scrapers, and the responsibility of AI providers to implement guardrails without hindering product usability.

OpenAI violated Canadian privacy laws, federal and provincial watchdogs say

Reddit r/ArtificialInteligence

Canadian federal and provincial privacy watchdogs have determined that OpenAI violated privacy laws by scraping vast amounts of personal data to train ChatGPT without proper consent.

Unlawful by design: Exposing the human rights costs of generative AI

Lobsters Hottest

Amnesty International's briefing argues that generative AI systems built on unlawful web scraping violate international human rights law, and calls for their prohibition.

Atlassian enables default data collection to train AI

Hacker News Top

Atlassian has enabled data collection by default to use customer data for training AI models, raising privacy concerns among enterprise users.