Kimi K2.7-Code cuts tokens 30%, but skips independent benchmarks

https://images.ctfassets.net/jdtwqhzvc2n1/1qgfxRq3zYmGo7J9jMkqCA/0d83440ff424b453d03efc45bb23cce1/kimi-gateway-smk1.jpg?w=800&q=75

Moonshot AI released Kimi K2.7-Code this week, an open-source update to its K2 coding model family, claiming leaner reasoning and double-digit performance gains.

K2.7-Code is built on the same trillion-parameter mixture-of-experts architecture as its predecessor K2.6, and drops in via an OpenAI-compatible API — which matters for teams already running K2.6 in production gateways.

When K2.6 launched in April, it topped OpenRouter's weekly LLM leaderboard — a ranking based on actual API routing decisions by developers, not self-reported benchmark scores.

Moonshot AI says K2.7-Code addresses what it calls "overthinking," reducing thinking-token usage by 30% compared to K2.6 — a number that would directly affect inference costs for teams running agentic workflows. Whether that efficiency gain holds on independent benchmarks is a question practitioners have already started raising publicly.

What Kimi K2.7-Code is

K2.7-Code is released under a Modified MIT license, with weights available on HuggingFace. The model is...

Copyright of this story solely belongs to venturebeat.com. To see the full text click HERE

Read more

https://images.axios.com/ipnTE2-5LrONsgbhTPiex2--oQ8=/0x0:1280x720/1366x768/2026/07/01/1782891632836.jpeg

Meta names Chief Marketing Officer Alex Schultz as its first-ever chief data officer, to manage AI analytics across the company; Denise Moreno is named CMO

Sponsor Posts Fast, affordable law for startups — Soxton automates startup legal so founders can move faster and sleep better. We handle incorporation, advisor, employment and commercial contracts. Join the waitlist for early access! Stop vibe coding analytics — Equals AI turns questions about your business into auditable spreadsheet models and dashboards.