Yazılar

Cloudflare Introduces Pay-Per-Crawl Tool to Help Websites Monetize AI Bot Access

Cloudflare has unveiled a new tool designed to give website owners greater control over AI bot crawlers accessing their content, allowing them to block unauthorized scraping or set fees for access. The move aims to help publishers and content creators monetize the use of their material by artificial intelligence companies, which increasingly crawl websites to train AI models without sending traffic back or providing compensation.

The tool enables site owners to choose which AI crawlers can access their content and implement a “pay per crawl” pricing model, helping creators control how their work is used and ensure fair payment. This innovation comes amid declining referral traffic from search engines, which historically drove ad revenue to websites.

Major publishers like Condé Nast, the Associated Press, and social platforms including Reddit and Pinterest back the initiative. Cloudflare’s Chief Strategy Officer, Stephanie Cohen, explained that the tool is designed to establish a sustainable ecosystem for content creators and AI companies alike. She highlighted that rapid changes in traffic patterns demand new approaches, calling this tool “the beginning of a new model for the internet.”

Data from Cloudflare shows that Google’s ratio of crawls to visitor referrals has dropped from 6:1 to 18:1 in six months, suggesting users increasingly get answers directly from Google search results or AI features rather than visiting original sites. However, Google’s crawl-to-visit ratio remains far lower than AI firms like OpenAI, which have ratios around 1,500:1, reflecting heavy content scraping without referral traffic.

For decades, traditional search engines indexed web content and drove users to publishers, rewarding them for their work. But AI crawlers disrupt this model by harvesting data without sending visitors back, aggregating content in chatbots like ChatGPT, and reducing creators’ revenue and recognition.

Many AI companies bypass common publisher tools used to block scraping and argue their data collection is legal and fair use. This has led some publishers, including the New York Times, to sue AI firms for copyright infringement. Others have negotiated licensing agreements to protect their content and monetize usage.

Reddit, notably, has sued AI startup Anthropic for scraping user comments but also signed a licensing deal with Google, illustrating the complex responses from content owners seeking to protect their assets in the AI era.

Databricks to Acquire Neon for $1 Billion to Bolster AI Agent Capabilities

Databricks is continuing its aggressive expansion into artificial intelligence by acquiring database startup Neon in a $1 billion deal, the company announced Wednesday. The acquisition is part of Databricks’ broader strategy to empower developers and enterprises to build AI agents, software programs that automate tasks with minimal human input.

Neon, founded in 2021, has built a cloud-native, serverless PostgreSQL platform optimized for real-time data access and AI agent integration. Its technology is already embedded across leading developer ecosystems, including Vercel, Replit, Cloudflare, GitHub, and Microsoft platforms.

The disruption will be with AI. We would love to own a chunk of that,” said Databricks CEO Ali Ghodsi.

Why Neon?

AI agents require real-time access to structured data to function effectively in environments such as automated coding, task execution, and workflow optimization. Neon’s serverless PostgreSQL technology offers:

  • Scalable and low-latency data infrastructure

  • Simplified deployment for AI-powered apps

  • Integration into popular developer tools

The acquisition enables Databricks to own and embed a modern database engine directly into its Data Intelligence Platform, making it more attractive for developers building AI agent applications.

Strategic Fit for Databricks

  • Databricks already invested in Neon, and now brings the full team onboard post-acquisition.

  • This is Databricks’ third billion-dollar+ deal, following its acquisitions of:

    • MosaicML ($1.3B in 2023) – focused on generative AI

    • Tabular (2023) – focused on data management tools

  • The company now boasts a valuation of $62 billion, having raised $10 billion last year.

Industry Implications

The database market is undergoing a generational shift, according to Ghodsi, as AI upends traditional data architecture needs. Neon’s database is built specifically to serve AI-first workloads, a growing priority as enterprises race to deploy intelligent agents for tasks like:

  • Automated report generation

  • Code writing

  • Smart customer support

  • Email drafting and scheduling

Databricks by the Numbers

  • 10,000+ enterprise clients, including Comcast, Shell, Block, and Rivian

  • Competing closely with Snowflake in the data analytics and AI infrastructure space

  • Seen as a prime IPO candidate in the near future

The acquisition of Neon marks another bold step by Databricks to dominate the intersection of AI, data, and developer tooling, as it positions itself to lead the next wave of enterprise AI adoption.

Cloudflare introduces a tool aimed at countering AI-driven bots

Cloudflare, the publicly traded cloud service provider, has launched a new tool aimed at preventing bots from scraping data on websites hosted on its platform for AI model training. While some AI vendors respect directives in a website’s robots.txt file to block their bots, others do not, prompting Cloudflare’s intervention. Devamını Oku