Bluesky Thread

it’s new entrant week! today? Kimi-K2

July 11, 2025 View original thread

it’s new entrant week! today? Kimi-K2

an open weights model that’s competitive with Claude 4 Opus

- 1T, 32B active MoE
- a true agentic model, hitting all the marks on coding & tool use
- no training instability, due to MuonClip optimizer

new frontier lab to watch!

moonshotai.github.io/Kimi-K2/

34 6

8 hours later

note: they have NOT done RL yet. this is just an instruction-tuned model. And it’s hanging with the greats

1 hour later

i might be wrong on this. they do advertise it as an “agentic model”

1 hour later

okay, i think the real story is, K2 isn’t s long-thinking model, so it’s not quite in the same league as o3 or gemini

but it probably has been RL’d. that’s how you get it to be agentic

i’m not sure what the balance is, but it does seem like it’s a solid model to build off of

More like this