Bluesky Thread

i read through ~5 pages of the Olmo 3 tech report.. whoah

View original thread
i read through ~5 pages of the Olmo 3 tech report.. whoah

this is the best and most detailed summary of the current state of SOTA LLM training

nanochat is good for understanding LLM training, this tech report catches you up to SOTA methods
Tim Kellogg @timkellogg.me
tech report: www.datocms-assets.com/64837/176364...
42 4
7 hours later
read a bit on the plane — SO MUCH time spent on data curation and benchmark selection

they have a whole section on dynamically selecting benchmarks based on signal-to-noise

afaict they don’t even talk about baguettotron-style rephrasing
3
42 likes 4 reposts

More like this

×