DeepSeek-R1-Distill-Llama-8B

reasoning8BOpen source

DeepSeek's R1 reasoning distilled into Llama 3 8B — strong chain-of-thought on a Llama base for broad ecosystem compatibility.

164K tokensGratuit / Poids ouvertsTransformerMIT

Benchmarks

GAIA

DeepSeek-R1-Distill-Llama-8B

27.6%

DeepSeek-R1-Distill-Qwen-32B

57.5%

DeepSeek-R1-Distill-Qwen-7B

25.6%

TapeAgent v0.1 (GPT-4o) referencing distill

Autres modèles de cette famille

DeepSeek-R1-Distill-Qwen-1.5Breasoning1.5B

0 benchmarks

DeepSeek-R1-Distill-Qwen-32Breasoning32B

1 benchmarks

DeepSeek-R1-Distill-Llama-70Breasoning70B

0 benchmarks

DeepSeek-R1-Distill-Qwen-7Breasoning7B

1 benchmarks

DeepSeek-R1-0528-Qwen3-8Breasoning8B

0 benchmarks

Retour à Deepseek R1 Distill

DeepSeek AI · Deepseek R1 Distill

DeepSeek-R1-Distill-Llama-8B

reasoning8BOpen source

HuggingFace

DeepSeek's R1 reasoning distilled into Llama 3 8B — strong chain-of-thought on a Llama base for broad ecosystem compatibility.

164K tokensGratuit / Poids ouvertsTransformerMIT

Benchmarks

GAIA

DeepSeek-R1-Distill-Llama-8B

27.6%

DeepSeek-R1-Distill-Qwen-32B

57.5%

DeepSeek-R1-Distill-Qwen-7B

25.6%

TapeAgent v0.1 (GPT-4o) referencing distill

Autres modèles de cette famille

DeepSeek-R1-Distill-Qwen-1.5Breasoning1.5B

0 benchmarks

DeepSeek-R1-Distill-Qwen-32Breasoning32B

1 benchmarks

DeepSeek-R1-Distill-Llama-70Breasoning70B

0 benchmarks

DeepSeek-R1-Distill-Qwen-7Breasoning7B

1 benchmarks

DeepSeek-R1-0528-Qwen3-8Breasoning8B

0 benchmarks