January 20th, 2025

DeepSeek R1

DeepSeek-R1 is a new series of reasoning models utilizing large-scale reinforcement learning, featuring distilled models that outperform benchmarks. They are open-sourced, available for local use, and licensed under MIT.

Read original article

DeepSeek-R1 is a new series of reasoning models developed by DeepSeek, including the first-generation model DeepSeek-R1-Zero, which was trained using large-scale reinforcement learning (RL) without prior supervised fine-tuning (SFT). This innovative approach has led to impressive reasoning capabilities, although it also faced challenges like repetition and readability. To enhance performance, DeepSeek-R1 was introduced, incorporating cold-start data before RL training. The models have shown competitive performance against OpenAI's models in various tasks, including math and coding. The research community benefits from the open-sourcing of DeepSeek-R1-Zero, DeepSeek-R1, and several distilled models derived from them, which demonstrate that smaller models can achieve high performance by leveraging the reasoning patterns of larger models. The evaluation results indicate that the distilled models outperform many existing benchmarks, establishing new state-of-the-art results. The models are available for download and can be run locally or accessed via an API. The project is licensed under the MIT License, allowing for commercial use and modifications.

- DeepSeek-R1 models utilize large-scale reinforcement learning for enhanced reasoning capabilities.

- The series includes distilled models that outperform existing benchmarks, demonstrating the effectiveness of smaller models.

- DeepSeek-R1 and its variants are open-sourced to support the research community.

- The models are available for local deployment and through an API platform.

- The project is licensed under MIT, permitting commercial use and modifications.

DeepSeek v2.5 – open-source LLM comparable to GPT-4o, but 95% less expensive

DeepSeek launched DeepSeek-V2.5, an advanced open-source model with a 128K context length, excelling in math and coding tasks, and offering competitive API pricing for developers.

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

Chinese AI startup DeepSeek launched DeepSeek-V3, a 671 billion parameter model outperforming major competitors. It features cost-effective training, innovative architecture, and is available for testing and commercial use.

DeepSeek's new AI model appears to be one of the best 'open' challengers yet

DeepSeek, a Chinese AI firm, launched DeepSeek V3, an open-source model with 671 billion parameters, excelling in text tasks and outperforming competitors, though limited by regulatory constraints.

Interesting Interview with DeepSeek's CEO

Deepseek, a Chinese AI startup, has surpassed OpenAI's models in reasoning benchmarks, focusing on foundational AI technology, open-source models, and low-cost APIs, while aiming for artificial general intelligence.

Notes on the New Deepseek v3

Deepseek v3, a leading open-source model with 607 billion parameters, excels in reasoning and math tasks, outperforming competitors while being cost-effective, trained on 14.8 trillion data points for $6 million.

2 comments

By @chvid - 4 months

Not much info here but this is big news - an actually published open source model that matches O1 from OpenAI - the model has been available behind an api for a few months.

Here is an article with someone playing with it:

https://www.datacamp.com/blog/deepseek-r1-lite-preview

By @deyiao - 4 months

It’s been reported that DeepSeek R1’s coding capabilities exceed GPT-o1-low and nearly match GPT-o1-meduim, quite astonishing.

DeepSeek v2.5 – open-source LLM comparable to GPT-4o, but 95% less expensive

DeepSeek launched DeepSeek-V2.5, an advanced open-source model with a 128K context length, excelling in math and coding tasks, and offering competitive API pricing for developers.

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

DeepSeek's new AI model appears to be one of the best 'open' challengers yet

DeepSeek, a Chinese AI firm, launched DeepSeek V3, an open-source model with 671 billion parameters, excelling in text tasks and outperforming competitors, though limited by regulatory constraints.

DeepSeek R1

Related

DeepSeek v2.5 – open-source LLM comparable to GPT-4o, but 95% less expensive

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

DeepSeek's new AI model appears to be one of the best 'open' challengers yet

Interesting Interview with DeepSeek's CEO

Notes on the New Deepseek v3

Related

DeepSeek v2.5 – open-source LLM comparable to GPT-4o, but 95% less expensive

DeepSeek-V3, ultra-large open-source AI, outperforms Llama and Qwen on launch

DeepSeek's new AI model appears to be one of the best 'open' challengers yet

Interesting Interview with DeepSeek's CEO

Notes on the New Deepseek v3