Mistral AI · Mixtral
Mistral AI's 141B 8x22B MoE base model — significantly stronger than 8x7B with 39B active params, matching GPT-3.5 on most benchmarks.
Mixtral-8x22B (GAIA baseline era)
Mixtral-8x22B proxy (BFCL era)