All the Harry Potter movie frames compressed into a single image — what happens when time collapses into state?

Finding Structure in Time: Decoding the 1990 Paper

In 1990, Jeffrey Elman ran a small experiment with a tiny recurrent network. No billion parameters. No attention. No massive datasets. Just a simple question: If a model learns to predict the next word, can it discover structure on its own? The answer changed how we think about intelligence. The Experiment: Learning Grammar Without Rules Elman trained a simple recurrent network (now called an Elman network) on sentences generated from a small artificial grammar. ...

February 22, 2026 · 6 min · 1131 words · Thiyagaraj T