DeepSeek AI

Deepseek R1

Open source3 variantes

DeepSeek-R1

reasoning671B

HuggingFace

DeepSeek's flagship 671B reasoning model with chain-of-thought, matching o1 performance on math and coding benchmarks.

164K tokensGratuit / Poids ouvertsMoEMIT

DeepSeek-R1

76.4%

SU AI Zero (Anthropic+Google+OpenAI, Suzhou AI Lab)

DeepSeek-R1-0528

reasoning671B

HuggingFace

DeepSeek's May 2025 R1 update — stronger reasoning across math, coding, and science over the original R1 release.

164K tokensGratuit / Poids ouvertsMoEMIT

LiveCodeBench

DeepSeek-R1-0528

81.7%

DeepSeek-R1-0528, easy=99.0 med=89.3 hard=60.4

DeepSeek-R1-Zero

reasoning671B

HuggingFace

DeepSeek's pure-RL reasoning model trained without SFT — demonstrates emergent chain-of-thought through reinforcement learning alone.

164K tokensGratuit / Poids ouvertsMoEMIT

Retour aux modèles

DeepSeek AI

Deepseek R1

Open source3 variantes

DeepSeek-R1

reasoning671B

HuggingFace

DeepSeek's flagship 671B reasoning model with chain-of-thought, matching o1 performance on math and coding benchmarks.

164K tokensGratuit / Poids ouvertsMoEMIT

DeepSeek-R1

76.4%

SU AI Zero (Anthropic+Google+OpenAI, Suzhou AI Lab)

DeepSeek-R1-0528

reasoning671B

HuggingFace

DeepSeek's May 2025 R1 update — stronger reasoning across math, coding, and science over the original R1 release.

164K tokensGratuit / Poids ouvertsMoEMIT

LiveCodeBench

DeepSeek-R1-0528

81.7%

DeepSeek-R1-0528, easy=99.0 med=89.3 hard=60.4

DeepSeek-R1-Zero

reasoning671B

HuggingFace

DeepSeek's pure-RL reasoning model trained without SFT — demonstrates emergent chain-of-thought through reinforcement learning alone.

164K tokensGratuit / Poids ouvertsMoEMIT