Bluesky Thread

if you want to try a diffusion LLM, inception labs just released one. it goes...

February 27, 2025 View original thread

if you want to try a diffusion LLM, inception labs just released one. it goes 1000 tokens/sec on regular H100s, absolutely nuts how fast it is

speed alone makes these things interesting

www.inceptionlabs.ai/news

www.inceptionlabs.ai

Inception Labs

We are leveraging diffusion technology to develop a new generation of LLMs. Our dLLMs are much faster and more efficient than traditional auto-regressive LLMs. And diffusion models are more accurate, ...

Tim Kellogg @timkellogg.me

LLMs That Don't Gaslight You

A new language model uses diffusion instead of next-token prediction. That means the text it can back out of a hallucination before it commits. This is a big win for areas like law & contracts, where global consistency is valued

timkellogg.me/blog/2025/02...

61 7

10 hours later

seems like a promising application of dLLMs is real-time systems, where you have a strict upper bound of how long things can take, e.g. robotics, cars, etc.

More like this