A new paper dropped from DeepMind: Deliberation in Latent Space via Differentiable Cache Augmentation
The trouble is, it's not very readable. I tried making a thread here, but it got far too long, so it's a blog now:
timkellogg.me/blog/2024/12...
More like this
×