this post was submitted on 10 Aug 2025
98 points (99.0% liked)
AI - Artificial intelligence
80 readers
18 users here now
AI related news and articles.
Rules:
- No Videos.
- No self promotion: Don't post links to your articles.
founded 2 months ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
I feel like diffusion LLMs would get this better.
After “position 5,” an autoregressive LLM has one chance, one pass, to get the next token right instead of another bullet point. And if it randomly picks another bullet point because the temperature is at 1 or whatever, the whole answer is hosed.
Not that OpenAI would ever do that. They just want to deep fry autoregressive transformers more and more instead of, you know, trying something actually interesting.