Astral Codex Ten • 26498 implied HN points • 26 Feb 26
- Being trained to predict the next token is an optimization goal, not a literal account of inner thought; models learn higher-level representations and don’t literally reason by counting tokens.
- Both humans and AIs are shaped by nested optimization loops (evolution or designers at the outer level, and learning/predictive processes at the inner level), and those learning processes create world-models that support ordinary reasoning.
- Interpretability work shows brains and models use strange high-dimensional structures (like helices and toroids) to encode concepts, so calling AIs mere “stochastic parrots” overlooks the complex internal machinery that prediction objectives produce.