Bluesky Thread

o3 mightily beats Gemini, Opus 4 and others in the game of Diplomacy

View original thread
o3 mightily beats Gemini, Opus 4 and others in the game of Diplomacy

only Gemini was able to win even one game, due to o3’s “ruthless” strategies

o3: ruthless
Gemini Pro: alliance builder
Opus 4: overly honest

www.reddit.com/r/singularit...
www.reddit.com
From the singularity community on Reddit: o3 is the top AI Diplomacy player, followed by Gemini 2.5 Pro
Explore this post and more from the singularity community
24 3
is Claude nice? yes ofc, but maybe also just tired bsky.app/profile/timk...
Tim Kellogg @timkellogg.me
there are reports that anthropic does this — serves a reduced quant under high load
tweet from @_xjdr:

you should legally be required to disclose what quantization level you are serving your current model at like it was a nutrition label. you should also be banned from dynamically adjusting quantization based on demand without notification. (you know who you are ...)
6 1
24 likes 3 reposts

More like this

×