In this issue:
Looking ahead increases decoding speed
Another PaSS for parallel decoding
Attentions (plural) is all we need?
Keep reading with a 7-day free trial
Subscribe to LLM Watch to keep reading this post and get 7 days of free access to the full post archives.