HomeModelsText Generationdeepseek-ai/DeepSeek-R1-0528
D

deepseek-ai/DeepSeek-R1-0528

Text Generation·deepseek-ai· 5.5M· 2.5K
transformers mit 684.5B params arxiv:2501.12948license:mit

text generation · transformers model

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull deepseek-ai/DeepSeek-R1-0528

Model details

Task
Text Generation
Provider
deepseek-ai
Framework
transformers
Parameters
684.5B
License
mit
Downloads
5.5M
Likes
2.5K
Paper
arXiv:2501.12948
Updated
2025-05-29

About deepseek-ai/DeepSeek-R1-0528

The DeepSeek R1 model has undergone a minor version upgrade, with the current version being DeepSeek-R1-0528. In the latest update, DeepSeek R1 has significantly improved its depth of reasoning and inference capabilities by leveraging increased computational resources and introducing algorithmic optimization mechanisms during post-training. The model has demonstrated outstanding performance across various benchmark evaluations, including mathematics, programming, and general logic. Its overall performance is now approaching that of leading models, such as O3 and Gemini 2.5 Pro.

Related Text Generation

Q Qwen/Qwen3-0.6B Text Generation ·751.6M params 27.8M 1.4K 🤗 HF Q Qwen/Qwen3-4B Text Generation ·4.0B params 16.4M 641 🤗 HF G openai-community/gpt2 Text Generation ·137.0M params 13.3M 3.3K 🤗 HF Q Qwen/Qwen3-8B Text Generation ·8.2B params 13.0M 1.2K 🤗 HF Q Qwen/Qwen2.5-7B-Instruct Text Generation ·7.6B params 12.8M 1.4K 🤗 HF O facebook/opt-125m Text Generation 12.3M 267 🤗 HF