Yazılar

Anthropic Reportedly Preparing to Launch Claude 4.5 Opus With Enhanced Jailbreak Resistance

Anthropic is reportedly nearing the release of Claude 4.5 Opus, the most advanced model in its Claude 4.5 series. According to recent leaks, the San Francisco-based AI company has begun testing a new large language model (LLM) with red-teamers — experts tasked with probing AI systems for vulnerabilities. The model, internally codenamed Neptune V6, is said to place a strong emphasis on resisting jailbreaks and prompt-injection exploits, marking a notable step forward in AI safety research. The company has already introduced Claude 4.5 Sonnet and Claude 4.5 Haiku, leaving Opus as the last expected variant in the lineup.

Anthropic Begins Red-Teaming Phase for New Model

As per a post by Tibor Blaho, Lead Engineer at AIPRM, Anthropic has distributed the Neptune V6 model to external red-teamers for testing. Blaho also revealed that the firm has launched a 10-day “universal jailbreak” challenge, where participants who successfully identify confirmed vulnerabilities or bypasses in the model’s safeguards will receive bonus rewards. This initiative signals a rigorous pre-release security audit process — one that encourages ethical testing before the model is made available to the public.

Focus on Reinforcing Safety and Jailbreak Resistance

The leak suggests that Anthropic is doubling down on safety and robustness, areas where the company has historically distinguished itself from competitors. Claude models are already known for their strong alignment capabilities and resistance to manipulation, but the upcoming Opus version reportedly takes these defenses even further. The focus on resisting jailbreaks highlights Anthropic’s commitment to ensuring its AI systems remain compliant, even in adversarial conditions where users attempt to override restrictions or induce unsafe responses.

Anticipation Builds for Claude 4.5 Opus Launch

If these reports prove accurate, the Claude 4.5 Opus could represent one of Anthropic’s biggest advancements since the Claude 4 series debuted earlier this year. The company has not officially commented on the leaks or provided a launch timeline, but given the red-teaming phase is already underway, a public release could be imminent. As the AI landscape heats up with models like GPT-5 and Gemini Ultra, Anthropic’s move to bolster its flagship model’s security could set a new standard for responsible AI deployment.

Reddit Sues Perplexity for Allegedly Scraping Data to Train AI Search Engine

Reddit has filed a lawsuit in a New York federal court against artificial intelligence startup Perplexity, accusing it of unlawfully scraping Reddit data to train its AI-based “answer engine.” The complaint also names three other companies — Lithuania-based Oxylabs, Russia-based AWMProxy, and Texas-based SerpApi — alleging that they bypassed Reddit’s data protection systems to extract massive amounts of content.

According to Reddit, Perplexity “desperately needs” the stolen data to strengthen its search capabilities. The platform, home to thousands of user-driven “subreddit” communities, said its content is one of the most frequently cited sources for AI-generated responses. Reddit has legally licensed its data to OpenAI, Google, and other companies, but claims Perplexity acted without authorization.

The lawsuit follows similar cases across the tech industry involving unauthorized use of copyrighted materials to train AI models. Reddit had previously sued Anthropic in June for similar conduct. Perplexity rejected the accusations, calling its methods “principled and responsible.” Meanwhile, Reddit’s chief legal officer Ben Lee accused AI firms of engaging in “industrial-scale data laundering.”

Reddit is seeking financial damages and a court order preventing Perplexity from continuing to use its content.

Anthropic, Google Discuss Cloud Deal Worth Tens of Billions, Bloomberg Reports

Anthropic, the AI startup behind the Claude chatbot, is in advanced discussions with Google (GOOGL.O) over a massive cloud computing deal valued in the high tens of billions of dollars, according to Bloomberg News, citing people familiar with the talks. The agreement, which is not yet finalized, would see Google provide cloud infrastructure and computing power to support Anthropic’s fast-growing AI operations.

The potential deal underscores the increasing cost and scale of AI development, as companies like Anthropic, OpenAI, and Microsoft-backed Mistral race to secure computing resources needed to train and deploy large language models.

Alphabet shares rose 2.3% after hours following the report, reflecting investor optimism over Google Cloud’s role as a major infrastructure provider for next-generation AI systems. Google declined to comment, while Anthropic did not immediately respond to requests for clarification.

Anthropic, which counts Google and Amazon (AMZN.O) among its biggest investors, has seen rapid adoption of its enterprise AI products and is reportedly on track to reach a $9 billion annual revenue run rate by the end of 2025, nearly tripling its current pace, according to a Reuters report last week.

The collaboration could deepen Google’s long-standing relationship with Anthropic, which already relies heavily on Google Cloud for model training. If completed, the deal would be among the largest cloud infrastructure partnerships ever in the AI sector, solidifying both firms’ positions in the escalating competition against OpenAI and Microsoft.