Bluesky Thread

Self-Improving Transformers

February 13, 2025 View original thread

Self-Improving Transformers

They found that you can train LLMs on their own outputs by

1. generating *slightly harder* problems each time
2. filtering low quality via majority voting

I have thoughts. I’m going to write a blog. tl;dr it’s interesting, not the singularity

arxiv.org/abs/2502.01612

arxiv.org

Self-Improving Transformers Overcome Easy-to-Hard and Length Generalization Challenges

Large language models often struggle with length generalization and solving complex problem instances beyond their training distribution. We present a self-improvement approach where models iterativel...

51 5

1 hour later

here bsky.app/profile/timk...

Tim Kellogg @timkellogg.me

Self-Improving Transformers

They found that you can train LLMs on their own outputs by

1. generating *slightly harder* problems each time
2. filtering low quality via majority voting

Is this the singularity? Maybe, but I think it might just be benchmark saturation.

timkellogg.me/blog/2025/02...

More like this