
Finding Structure in Time: Decoding the 1990 Paper
In 1990, Jeffrey Elman ran a small experiment with a tiny recurrent network. No billion parameters. No attention. No massive datasets. Just a simple question: If a model learns to predict the next word, can it discover structure on its own? The answer changed how we think about intelligence. The Experiment: Learning Grammar Without Rules Elman trained a simple recurrent network (now called an Elman network) on sentences generated from a small artificial grammar. ...