Cloudflare, the internet infrastructure company powering roughly 20 percent of websites, has taken a firm stance against unauthorized AI scraping.
On Tuesday, the company announced that AI bots will now be blocked from accessing websites by default unless site owners choose to opt in.
This default change means AI companies must now explicitly seek permission to scrape website content. Website owners will also be able to determine how AI crawlers use their content, whether for training, search, or inference, creating a new standard of transparency and control in the AI era.
Cloudflare’s move challenges the long-standing practice of AI systems harvesting vast amounts of online content without rewarding or even acknowledging the original creators. While search engines have traditionally driven traffic and revenue back to websites, AI-generated answers often sidestep that cycle entirely, depriving creators of both visibility and income.
“If the internet is going to survive the age of AI, we need to give publishers the control they deserve and build a new economic model that works for everyone,” said Cloudflare CEO Matthew Prince. “AI crawlers have been scraping content without limits. Our goal is to put the power back in the hands of creators.”
To address this imbalance, Cloudflare also unveiled a new “Pay Per Crawl” system. Under this model, websites can grant access to AI crawlers in exchange for micropayments, essentially charging per visit. This could enable creators to benefit financially from their contributions to AI training without having to surrender control of their content.
Analysts see both promise and challenges in this model. Some, like Akylade’s Jason Dion, support it as a fair, API-style solution. Others question whether AI companies will embrace the cost or if the payment model will mainly enrich intermediaries.
Critics also warn of a potential two-tiered internet, where high-traffic or trusted sites are paid to allow AI access, while smaller ones are left behind, or worse, bypassed through data laundering by third-party aggregators.
Still, with Cloudflare backing the plan and over 50 major publishers, including Time, Reddit, Pinterest, The Associated Press, and Universal Music Group, supporting a permission-based approach, the model has momentum.
Security experts say this new structure could also help manage bot authentication and minimize malicious traffic. However, as researchers point out, overly strict control could reduce the quality and breadth of data used by AI systems, harming innovation and public benefit in the long term.
While the initiative may be in its early stages, Cloudflare’s move represents a significant power shift in the digital content economy. By giving content creators more control and potentially more compensation, it’s sparking a new chapter in the ongoing debate between open access and content ownership in the AI age.
What You Should Know
- Cloudflare, servicing 20% of websites, now blocks AI web-scraping bots by default as of July 1, 2025, requiring explicit permission from site owners.
- The Pay Per Crawl model allows website owners to charge AI companies per crawl, aiming to compensate creators for AI training data.
- Over 50 companies, including Reddit and The Atlantic, support the permission-based model to protect content creators’ revenue and control.
- The policy could limit AI model training data, particularly for startups, and may shift power to publishers, though scalability and enforcement challenges remain.
- Analysts warn that restricting web access could impact AI accuracy, while unethical crawlers might bypass rules, highlighting the need for a balanced approach to innovation and content control.
























