R1 & K2 are high taste models. For sure the only open models that are high taste.
the fact that they've done basically zero RLHF and very little human alignment raises some questions about alignment work
R1 & K2 are high taste models. For sure the only open models that are high ta...
View original thread
25
3
then there's this whole other thing where the US gov't is trying to force alignment, when there's decent evidence that we really don't know how alignment actually works, even in non-controversial areas
bsky.app/profile/timk...
bsky.app/profile/timk...
the irony here is that Grok 3-4 are the only models to violate this
“Developers shall not intentionally encode partisan or ideological judgments into an LLM's outputs”
“Developers shall not intentionally encode partisan or ideological judgments into an LLM's outputs”
7