Hardware to run Kimi K2.7 Code 1T (MoE)
Jun 2026 release — code-specialized variant built on K2.6 with ~30% fewer thinking tokens for the same task. Same 1 T total / 32 B active MoE (384 experts top-8 + 1 shared), 256 K context, Modified MIT. Internal Moonshot benchmarks only (Kimi Code Bench v2 62.0, MCP Atlas 76.0, MCP Mark Verified 81.1) — standard SWE-Bench / TB2 cells stay null until third-party leaderboards land.
4× Strix Halo cluster (512 GB unified)
DGX B200 — 8× B200 server (1.44 TB HBM3e)
Single AMD Instinct MI355X 288 GB workstation
8× RTX Pro 6000 Blackwell server (768 GB)
DGX H200 — 8× H200 server (1.13 TB HBM3e)
12× RTX Pro 6000 Blackwell rack (1152 GB)
Mac Studio M3 Ultra 512 GB
Mac Studio M3 Ultra 512 GB
Single AMD Instinct MI355X 288 GB workstation
Every other build that runs Kimi K2.7 Code 1T (MoE)
6 additional builds fit Kimi K2.7 Code 1T (MoE) at Q2_K (280 GB usable minimum), sorted by sticker price.
| Build | Price | Memory | Bandwidth | tg/s (Q2) | Active W | 5-yr power |
|---|---|---|---|---|---|---|
4× DGX Spark cluster (512 GB unified, CUDA)NVIDIA · rack of 4 desktops | $20k | 512 / 488 GB | 273 GB/s | 11 t/s | 920 W | $3.4k |
8× Strix Halo cluster (1024 GB unified)AMD · rack of 8 mini-PCs, 10/25 GbE fabric | $23k | 1024 / 768 GB | 256 GB/s | 6.0 t/s | 960 W | $3.4k |
2× Mac Studio M3 Ultra 512 GB cluster (TB5 / MLX)Apple · two desktops, Thunderbolt 5 RDMA | $28k | 1024 / 960 GB | 819 GB/s | 8.4 t/s | 440 W | $1.5k |
Quad RTX Pro 6000 Blackwell build (384 GB)NVIDIA · workstation / 4U pedestal | $38k | 384 / 372 GB | 1792 GB/s | — | 2200 W | $8k |
8× DGX Spark cluster (1024 GB unified, CUDA)NVIDIA · rack of 8 desktops, 200 GbE fabric | $44k | 1024 / 976 GB | 273 GB/s | 16 t/s | 1840 W | $7k |
8× H100 80 GB serverNVIDIA · server rack | $280k | 640 / 620 GB | 3350 GB/s | 72 t/s | 5600 W | $20k |
8× Strix Halo cluster (1024 GB unified)
DGX B200 — 8× B200 server (1.44 TB HBM3e)
2× Mac Studio M3 Ultra 512 GB cluster (TB5 / MLX)
8× RTX Pro 6000 Blackwell server (768 GB)
DGX H200 — 8× H200 server (1.13 TB HBM3e)
12× RTX Pro 6000 Blackwell rack (1152 GB)
8× DGX Spark cluster (1024 GB unified, CUDA)
No plug-and-play build fits at Q4_K_M
Only used / DIY / homelab-cluster rigs fit Kimi K2.7 Code 1T (MoE) at this quant. Turn off "Only plug & play" to see them.
Every other build that runs Kimi K2.7 Code 1T (MoE)
1 additional build fit Kimi K2.7 Code 1T (MoE) at Q4_K_M (600 GB usable minimum), sorted by sticker price.
| Build | Price | Memory | Bandwidth | tg/s (Q4) | Active W | 5-yr power |
|---|---|---|---|---|---|---|
8× H100 80 GB serverNVIDIA · server rack | $280k | 640 / 620 GB | 3350 GB/s | 60 t/s | 5600 W | $20k |
8× Strix Halo cluster (1024 GB unified)
DGX B200 — 8× B200 server (1.44 TB HBM3e)
2× Mac Studio M3 Ultra 512 GB cluster (TB5 / MLX)
8× RTX Pro 6000 Blackwell server (768 GB)
DGX H200 — 8× H200 server (1.13 TB HBM3e)
12× RTX Pro 6000 Blackwell rack (1152 GB)
8× DGX Spark cluster (1024 GB unified, CUDA)
No plug-and-play build fits at Q5_K_M
Only used / DIY / homelab-cluster rigs fit Kimi K2.7 Code 1T (MoE) at this quant. Turn off "Only plug & play" to see them.
DGX B200 — 8× B200 server (1.44 TB HBM3e)
No plug-and-play build fits at Q8_0
Only used / DIY / homelab-cluster rigs fit Kimi K2.7 Code 1T (MoE) at this quant. Turn off "Only plug & play" to see them.
Sources
Last updated 2026-06-13