Compare Models

Side-by-side comparison of benchmark performance across Chinese LLM providers.

Benchmark Comparison

Bar Chart View

Radar Chart View

Full Comparison Table

Showing top performing models. Green highlights indicate best score in category.

ModelProviderParametersContextOpen Sourcemmluhumanevalgsm8kmathmmlu-progpqa
DeepSeek-R1-0528
DeepSeek671B (37B active)131KYes90.8%89.5%97%94.5%--
Qwen3-235B-A22B
Qwen235B (22B active)131KYes86.5%92.1%95.2%90.3%--
DeepSeek-V3.2
DeepSeek671B (37B active)131KNo88.5%90%95.5%---
Kimi K2
Kimi1T (32B active)131KYes87.5%90.5%95%---
DeepSeek-V3-0324
DeepSeek671B (37B active)131KYes87.5%88%--81.2%68.4%
DeepSeek-V3
DeepSeek671B (37B active)131KYes87.1%86.5%93%---
QwQ-32B
Qwen32B131KYes85%88.5%94%87.5%--
Qwen2.5-72B-Instruct
Qwen72B131KYes86.1%86.4%91.2%83.1%--

Providers