Kimi K2 Just Hit 71.6% on SWE-Bench — The Open-Source Coding Crown Has a New King
Article summary
Quick briefing — cleaned from the original RSS feed
For months, the open-source coding model race has been a two-player game between Qwen3-Coder-480B and DeepSeek-V3.2 — trading blows around the 69–70% mark on SWE-bench Verified. But a dark horse just blew past both of them. Kimi K2 , the latest open-weight coding model from Moonshot AI, has landed at 71.6% on SWE-bench Verified — the highest score ever recorded from a fully open-source model. To put that in perspective, it outperforms DeepSeek-V3.2 (~70%) and Qwen3-Coder-480B (69.6%) while…
1Key Takeaways
- For months, the open-source coding model race has been a two-player game between Qwen3-Coder-480B and DeepSeek-V3.2 — trading blows around the 69–70% mark on SWE-bench Verified.
- But a dark horse just blew past both of them.
- Kimi K2 , the latest open-weight coding model from Moonshot AI, has landed at 71.6% on SWE-bench Verified — the highest score ever recorded from a fully open-source model.
- To put that in perspective, it outperforms DeepSeek-V3.2 (~70%) and Qwen3-Coder-480B (69.6%) while….
2AIWedia Score
8.9/10
High relevance — worth your attention today
Based on source trust, recency, category impact, and story depth.
3Why it matters
Coding AI shifts how fast software ships and how much human review each change needs. DEV — ML reports that for months, the open-source coding model race has been a two-player game between Qwen3-Coder-480B and DeepSeek-V3.2 — trading blows around the 69–70% mark on SWE-bench Verified.
Explore related
Browse toolsCoding AI news
Explore curated coding ai tools on AIWedia — compare, rank, and launch from our directory.
Full story on DEV — ML
Read full articleHeadlines aggregated via RSS for discovery on AIWedia. Original content © DEV — ML. We link to the source and do not republish full articles.