Kimi

Moonshot AI

Moonshot AI's Kimi series is known for pioneering long-context understanding and efficient MoE architectures. K2 became a major open-source competitor in 2025.

Models

Open Source

Models

Kimi K2 Thinking

1T (32B active) parameters

Open Source API

🧠 Reasoning 🤖 Agents 💻 Coding

Reasoning and tool-using thinking agent. Can execute 200-300 sequential tool calls without human interference. State-of-the-art agentic capabilities.

200-300 sequential tool calls
LiveCodeBench-v6: 83.1%
Advanced agentic reasoning

Benchmarks

humaneval

91%

gsm8k

96.5%

Released Nov 1, 2025 131K context

Kimi Linear

48B (3B active) parameters

Open Source API

💬 Chat 📄 Long Context

Uses Kimi Delta Attention (KDA) for efficient long-context processing. Reduces memory usage and improves generation speed at longer context windows.

1M token context window
Novel Delta Attention mechanism
Efficient memory usage

Benchmarks

mmlu

78%

Released Oct 1, 2025 1M context

Kimi K2

1T (32B active) parameters

Open Source API

💬 Chat 🧠 Reasoning 💻 Coding

1 trillion parameter MoE with 32B active. State-of-the-art open-source performance on coding benchmarks. Trained for $4.6M, rivaling ChatGPT and Claude.

SOTA open-source coding performance
Trained for only $4.6M
Beats GPT-4o on multiple benchmarks

Benchmarks

mmlu

87.5%

humaneval

90.5%

gsm8k

95%

Released Jul 14, 2025 131K context

HuggingFace

Kimi-Dev

72B parameters

Open Source API

💻 Coding

Coding-focused model based on Qwen2.5-72B. State-of-the-art among open source on SWE-bench Verified.

SOTA open-source on SWE-bench Verified
Built on Qwen2.5-72B foundation
Specialized for software development

Benchmarks

humaneval

89%

Released Jun 1, 2025 131K context

Kimi-VL

16B (3B active) parameters

Open Source API

👁️ Vision

Open-source vision-language MoE model. Efficient multimodal understanding with only 3B active parameters.

16B MoE with 3B active
Efficient multimodal processing
Apache 2.0 license

Released Apr 1, 2025 131K context

Kimi K1.5

Unknown parameters

API

💬 Chat 🧠 Reasoning 👁️ Vision

First major Kimi reasoning model. Matches OpenAI o1 in math, coding, and multimodal reasoning. Uses reinforcement learning for dynamic learning.

Matches OpenAI o1 performance
Free with no usage limits
Long-CoT and short-CoT modes

Benchmarks

mmlu

85%

gsm8k

93%

math

85%

Released Jan 20, 2025 131K context