Yazılar

Chinese Chip Makers and Cloud Providers Rush to Support DeepSeek’s AI Models

Chinese chip makers and cloud service providers are quickly integrating DeepSeek’s artificial intelligence models into their systems, marking a significant turning point for the nation’s AI industry. Companies like Moore Threads and Hygon Information Technology, both AI chip manufacturers, announced that their computing clusters and accelerators are now compatible with DeepSeek’s R1 and V3 models. Moore Threads even shared a celebratory post on WeChat, praising DeepSeek’s ability to drive China’s AI sector forward by utilizing domestically produced graphic processing units (GPUs).

Huawei Technologies, which also develops AI chips, revealed it is collaborating with AI infrastructure startup SiliconFlow to integrate DeepSeek’s models into its Ascend cloud service. This integration has been described as a “watershed moment” by Bernstein analysts, highlighting the growing independence of China’s AI sector from advanced U.S. hardware.

Cloud giants like Alibaba, Baidu, and Tencent have also jumped on board, offering DeepSeek’s models through their respective services. DeepSeek’s AI assistant, launched last month, quickly gained popularity by offering a more data-efficient alternative at a fraction of the cost of global competitors, surpassing ChatGPT in app downloads from Apple’s App Store within days.

The company has drawn attention globally with its groundbreaking approach. DeepSeek’s research, published in December, claimed that its V3 model’s training cost less than $6 million in Nvidia’s H800 chips—significantly lower than the billions spent by companies like Meta and Microsoft. This has been a major factor in DeepSeek’s rising prominence, with its founder, Liang Wenfeng, becoming a cultural figure in China.

While Microsoft and Amazon have started offering DeepSeek’s models, some countries, including Italy and the Netherlands, have raised concerns over privacy, leading them to either block or investigate the AI app.

 

High-Flyer, the AI Quant Fund Behind China’s DeepSeek, Shifts Focus to Artificial General Intelligence

Key Highlights:

  • High-Flyer, a quantitative hedge fund, shifted its focus from managing a $13.79 billion portfolio to developing Artificial General Intelligence (AGI).
  • The fund, officially known as Hangzhou Huanfang Technology Ltd Co., is reorienting its resources to pursue AGI, which is a more advanced form of AI capable of surpassing human abilities in most economically valuable tasks.
  • The DeepSeek AI model, which has gained significant attention in the tech world, is part of this new direction under the leadership of Liang Wenfeng, High-Flyer’s founder and the leader of DeepSeek.
  • DeepSeek’s AI models have garnered praise from Silicon Valley and sparked concerns about the computational efficiency of U.S. firms’ AI models, given DeepSeek’s claims of utilizing far less computing power.

Background on High-Flyer’s AI Investment Strategy:

  • High-Flyer has heavily invested in supercomputing clusters, including those made up of Nvidia A100 chips, despite U.S. export controls. These clusters have been crucial for the development of its AI models.
  • The company has also spent millions on high-end Nvidia chips, setting up two AI supercomputing clusters with a combined investment of over $1.2 billion.
  • DeepSeek, which utilizes much less powerful chips (Nvidia’s H800 and H20), has nonetheless sparked debates on its computational capabilities, with some tech executives speculating it may have access to 50,000 Nvidia H100 chips, potentially bypassing U.S. export restrictions.

Future Prospects:

  • Liang Wenfeng has indicated that High-Flyer is not currently seeking external funding for DeepSeek and is more focused on overcoming challenges related to chip restrictions rather than financial concerns.
  • The company’s strategic shift towards AGI signals a long-term commitment to advancing AI technology for human benefit, even as it navigates international tensions over AI and technology exports.