December 27th, 2024

DeepSeek's new AI model appears to be one of the best 'open' challengers yet

DeepSeek, a Chinese AI firm, launched DeepSeek V3, an open-source model with 671 billion parameters, excelling in text tasks and outperforming competitors, though limited by regulatory constraints.

Read original articleLink Icon
DeepSeek's new AI model appears to be one of the best 'open' challengers yet

DeepSeek, a Chinese AI firm, has launched DeepSeek V3, a powerful open-source AI model that allows developers to download and modify it for various applications, including commercial use. Released under a permissive license, DeepSeek V3 excels in text-based tasks such as coding, translation, and writing. Internal benchmarks indicate that it outperforms both open and closed AI models, including Meta's Llama 3.1 and OpenAI's GPT-4o, particularly in coding competitions. The model boasts 671 billion parameters and was trained on a dataset of 14.8 trillion tokens, making it significantly larger than its competitors. Despite its capabilities, DeepSeek V3 requires high-end hardware to operate efficiently. The model was developed using a relatively modest budget of $5.5 million and a data center of Nvidia H800 GPUs, which is noteworthy given the restrictions on GPU procurement for Chinese companies. However, the model's responses are influenced by Chinese regulatory requirements, limiting its ability to discuss sensitive topics. DeepSeek is backed by High-Flyer Capital Management, which aims to develop superintelligent AI. The launch of DeepSeek V3 highlights the competitive landscape of AI development, particularly in the open-source domain.

- DeepSeek V3 is one of the most powerful open-source AI models available.

- It outperforms major competitors in various benchmarks, including coding tasks.

- The model has 671 billion parameters and was trained on 14.8 trillion tokens.

- Development costs were significantly lower than those of other leading AI models.

- Regulatory constraints affect the model's ability to discuss sensitive political topics.

Link Icon 1 comments