CanItRun Logocanitrun.

Kimi K2.6

Kimi K2.6 needs roughly 500.0GB VRAM at Q4 quantization (2000.0GB at FP16). 2 GPUs we track can run it fully in VRAM at 8k context.

Moonshot AI1000B params32B active (MoE)250k contextKimiNon-commercial only

VRAM at each quantization

Assumes 8k context. KV cache grows linearly with context length.

QuantWeightsKV cacheTotal
FP162000.0 GB2.68 GB2243.0 GB
Q81000.0 GB2.68 GB1123.0 GB
Q6_K750.0 GB2.68 GB843.0 GB
Q5_K_M625.0 GB2.68 GB703.0 GB
Q4_K_M500.0 GB2.68 GB563.0 GB
Q3_K_M400.0 GB2.68 GB451.0 GB
Q2_K300.0 GB2.68 GB339.0 GB

Benchmarks

GPUs that run Kimi K2.6 natively (2)

Notes

1 Trillion parameter MoE. Advanced agentic capabilities for complex workflows.

Hugging Face ↗Ollama ↗Released 2026-04-10