it’s new entrant week! today? Kimi-K2
an open weights model that’s competitive with Claude 4 Opus
- 1T, 32B active MoE
- a true agentic model, hitting all the marks on coding & tool use
- no training instability, due to MuonClip optimizer
new frontier lab to watch!
moonshotai.github.io/Kimi-K2/
it’s new entrant week! today? Kimi-K2
View original thread
34
6
8 hours later
note: they have NOT done RL yet. this is just an instruction-tuned model. And it’s hanging with the greats
5
1 hour later
i might be wrong on this. they do advertise it as an “agentic model”
2
1 hour later
okay, i think the real story is, K2 isn’t s long-thinking model, so it’s not quite in the same league as o3 or gemini
but it probably has been RL’d. that’s how you get it to be agentic
i’m not sure what the balance is, but it does seem like it’s a solid model to build off of
but it probably has been RL’d. that’s how you get it to be agentic
i’m not sure what the balance is, but it does seem like it’s a solid model to build off of
4