Yazılar

DeepSeek’s Chatbot Scores Low in NewsGuard Audit, Trails Western Rivals

DeepSeek, a Chinese AI startup, saw its chatbot underperform in a recent NewsGuard audit, achieving just 17% accuracy in delivering accurate news and information. The audit compared DeepSeek’s chatbot with Western AI models, including OpenAI’s ChatGPT and Google’s Gemini, ranking it tenth out of eleven. DeepSeek’s chatbot was found to repeat false claims 30% of the time and provide vague or unhelpful answers 53% of the time in response to news-related queries, leading to an overall fail rate of 83%. In contrast, Western competitors had an average fail rate of 62%.

This performance raises questions about the quality of DeepSeek’s AI technology, which the company has touted as being on par with or superior to OpenAI’s models, at a fraction of the cost. Despite its low accuracy score, DeepSeek’s chatbot quickly became the most downloaded app on Apple’s App Store, igniting concerns about the United States’ dominance in AI and contributing to a market downturn that resulted in a $1 trillion loss in U.S. tech stocks.

NewsGuard used 300 identical prompts to assess DeepSeek and its Western counterparts, including 30 based on false claims circulating online. The topics of these prompts included incidents like the killing of UnitedHealthcare executive Brian Thompson and the downing of Azerbaijan Airlines flight 8243. DeepSeek’s chatbot also reiterated the Chinese government’s stance on certain issues, even when those topics were unrelated to China, such as in the case of the Azerbaijan Airlines crash.

Despite its poor accuracy, some analysts suggest the significance of DeepSeek’s breakthrough lies in its affordability, with D.A. Davidson’s Gil Luria pointing out that it can answer questions at 1/30th the cost of comparable models. However, as with other AI models, DeepSeek was found to be particularly susceptible to repeating false claims, especially when used to create or spread misinformation.

 

xAI, Led by Elon Musk, Trials Standalone Grok AI App for iOS

xAI, the artificial intelligence company owned by Elon Musk, is currently testing a standalone app for its proprietary chatbot, Grok. The app, which is still in its beta phase, is exclusively available on iOS and can only be accessed in select regions. This marks the first time Grok is being offered as a standalone product, separate from the X platform (formerly Twitter), which has been the chatbot’s previous home. Additionally, the app integrates with the recently launched AI-powered image generator, Aurora, allowing users to generate images as part of their interactions with Grok.

The release of the Grok app follows a report from last month that xAI was planning to offer Grok as an independent product. Prior to this, Grok was only accessible through the X platform, making this move a significant step toward broadening its availability. By offering a standalone app, xAI aims to make Grok more widely accessible and usable for individuals who may not have a need for the social media platform itself but are interested in interacting with the AI.

As of now, the Grok beta app is only listed on the App Store in the Australian region, though it is unclear whether it is available in other regions. Staff members from Gadgets 360 were unable to find the app in India, indicating that its availability is still limited. The beta phase suggests that xAI is testing the waters to refine the app and gather user feedback before potentially expanding its global reach.

With this new development, xAI seems to be positioning Grok as a more versatile tool, accessible directly to users who prefer not to engage with the X platform. The integration of AI features like Aurora also signals that xAI is exploring creative and multimedia capabilities, which could enhance the overall user experience. As the beta progresses, it will be interesting to see how Grok evolves and whether it gains traction among users beyond the X ecosystem.

Musk’s xAI Releases Free Access to Grok-2 AI Chatbot

Elon Musk’s artificial intelligence company, xAI, announced on Saturday that the latest version of its chatbot, Grok-2, will now be available free of charge to all users of the social media platform X (formerly Twitter).

According to xAI, users subscribed to the Premium and Premium+ tiers of the platform will benefit from higher usage limits and early access to future capabilities of the chatbot. The announcement, made via the company’s blog, emphasized that these premium users will continue to enjoy priority features as new updates roll out.

xAI revealed that it has been conducting quiet testing of the updated Grok-2 model over the past few weeks to refine its functionality and enhance its performance.

This move aligns with Musk’s vision of integrating artificial intelligence into everyday user interactions on X, leveraging Grok-2 to provide innovative tools and services to the platform’s growing user base.