December 26th, 2024

DeepSeek-V3

DeepSeek has launched DeepSeek-V3, which processes 60 tokens per second, features 671 billion parameters, and maintains open-source compatibility. Pricing changes will occur after February 8, 2024, with future updates planned.

Read original articleLink Icon
DeepSeek-V3

DeepSeek has announced the release of DeepSeek-V3, marking a significant advancement in their API capabilities. The new version boasts a processing speed of 60 tokens per second, which is three times faster than its predecessor, DeepSeek-V2. It features 671 billion mixture of experts (MoE) parameters, with 37 billion activated parameters, and has been trained on 14.8 trillion high-quality tokens. The API remains compatible with previous versions, and the models and research papers are fully open-source. Pricing for the API will remain the same as V2 until February 8, after which new rates will apply: $0.27 per million tokens for cache misses, $0.07 for cache hits, and $1.10 for output. DeepSeek emphasizes its commitment to open-source principles and aims to bridge the gap between open and closed models in the AI landscape. Future updates are expected to include multimodal support and other innovative features, reinforcing DeepSeek's mission to advance inclusive artificial general intelligence (AGI).

- DeepSeek-V3 is three times faster than V2, processing 60 tokens per second.

- The new model includes 671 billion MoE parameters and is trained on 14.8 trillion tokens.

- API pricing will remain unchanged until February 8, 2024, after which new rates will be implemented.

- DeepSeek is committed to open-source development and aims to enhance the AI community.

- Future updates will introduce multimodal support and additional features.

Link Icon 1 comments