Análisis · History & Fundamentals · Edition #0005

The AI timeline — milestones year by year

From Turing in 1950 to agents in 2026. The moments that changed everything, explained by why they matter today.

G
Germán Falcioni April 12, 2026
✦ Reading: 12 min
Visual timeline: AI milestones from 1936 to 2026, by decade
TL;DR

Ninety years of AI in roughly ten milestones. Cycles of optimism, winters, revivals. And why the last five years went exponential.

✦ Summarized with Claude at publish time
AI rewrite
Read it as…

The AI timeline — milestones year by year

Before we start: a mental image

Picture the history of AI as a long roller coaster. It climbs slowly for sixty years. It has two deep drops (the winters). And since 2012 it climbs at a slope almost nobody saw coming.

If everything feels fast right now, that's because you're on the steep part.

Before all that: the theoretical seed (1936-1950)

Long before useful computers existed, Alan Turing published "On Computable Numbers" (1936). It defines the Turing machine: a theoretical device that can solve any computable problem if you give it the right algorithm.

Pure abstraction. But it sets the floor.

In 1943, Warren McCulloch and Walter Pitts publish "A Logical Calculus of Ideas" and model an artificial neuron as mathematical logic. The first abstraction of how a brain might be simulated. Conceptual foundation for neural networks.

1950: Turing — "Computing Machinery and Intelligence"

Turing comes back with the almost philosophical question: can machines think? He proposes the Turing Test: if you can't tell a conversation with a machine from one with a human, then it thinks.

Curiosity: ChatGPT and Claude have been beating the Turing Test for years now. Nobody calls AI "done" because the test turned out to be superficial. But it was the starting point.

1956: Dartmouth — the official birth

John McCarthy, Marvin Minsky, Nathaniel Rochester and Claude Shannon convene a summer meeting at Dartmouth College. McCarthy coins the term Artificial Intelligence. The original proposal asked for 7,500 dollars at the time to bring ten people together for two months (Dartmouth archive, 1955).

They all believe they'll solve the problem that summer. They don't. But the discipline is officially born.

1966: ELIZA

Joseph Weizenbaum, at MIT, writes ELIZA. A program that simulates a psychotherapist. It reads "I'm sad" and replies "Why are you sad?" People bonded with it deeply.

ELIZA understood nothing. It did keyword search and answered with templates. But it demonstrated something important: humans project intelligence wherever we see coherent responses.

1974-1980: First Winter

The Lighthill report (1973) in the UK and DARPA cuts in the US shut off the funding spigot. Machines were slow. Algorithms were naive. There was no mass data. Academics kept working, just out of the spotlight.

1980-1987: the expert systems era

A new strategy: instead of machines that learn, build systems that encode human expert rules.

Digital Equipment Corporation's XCON system configured VAX computer orders and saved the company about 40 million dollars a year (DEC internal documentation, cited in Crevier 1993). The expert systems market reached an estimated 3 billion dollars by the late 1980s.

The problem: brittle. Change one rule and the machine fails. Coding millions of rules by hand is impossible.

1987-1997: Second Winter

Expert systems didn't scale. The hype collapsed again. The irony: in the quiet, in unglamorous labs, people like Rumelhart, Hinton and Williams (1986) had already published the backpropagation algorithm that would later make deep learning possible.

But in 1990 nobody had the data or the compute to make it useful.

1997: Deep Blue

IBM builds a machine that defeats Garry Kasparov, world chess champion. A psychological milestone. Deep Blue doesn't understand chess: it calculates millions of possible moves. The public doesn't care about the how. They care about the result. AI is back on the cover.

2009-2012: the break

Three things converge at once.

One: Nvidia's GPUs (CUDA, 2006 onward) enable massively parallel compute. Neural networks that used to take years now train in days.

Two: the internet provides mass data. Fei-Fei Li and her team at Stanford publish ImageNet (Deng et al., 2009): 14 million hand-labeled images. The first giant dataset with academic-quality curation.

Three: in 2012, Geoffrey Hinton's team — with Alex Krizhevsky and Ilya Sutskever — wins ImageNet with AlexNet, a deep convolutional network. They cut classification error from 26% to 15% in a single year (official ImageNet 2012 results).

The message: deep learning works. At scale. More data plus more compute plus more layers equals better performance.

From here, everything accelerates.

2014: Google buys DeepMind

DeepMind, founded by Demis Hassabis, had shown that neural networks could play Atari videogames better than humans. Google buys it for roughly 500 million dollars (figure reported by The Times and the BBC). A clear signal: AI is the future.

2016: AlphaGo beats Lee Sedol

DeepMind builds AlphaGo, which defeats the world Go champion. Go is exponentially more complex than chess. Decades were supposed to be left. AlphaGo combined deep neural networks with reinforcement learning and did it in one.

2017: "Attention Is All You Need"

Vaswani and others publish the paper introducing transformers: a brand new architecture based on the attention mechanism. Without this paper, there's no ChatGPT, no Claude, no Gemini. It now has more than 100,000 citations.

2018-2020: GPT-1, GPT-2, GPT-3

OpenAI scales transformers for text.

GPT-1 (2018): 117 million parameters. GPT-2 (2019): 1.5 billion. GPT-3 (2020): 175 billion parameters, trained on roughly 300 billion tokens (Brown et al., 2020).

Each leap brings capabilities nobody specifically asked for. We call them "emergent abilities" and we still don't understand exactly why they happen.

2021: Anthropic is founded

Dario and Daniela Amodei leave OpenAI with a group of researchers and found Anthropic. The focus: safe and aligned AI. Pioneers of the Constitutional AI approach (Bai et al., 2022).

2022: ChatGPT

OpenAI launches ChatGPT on November 30. It's a tuned version of GPT-3.5 with a simple conversational interface. It hits 100 million users in two months (Stanford AI Index 2024). The fastest-adopted consumer product in history up to that point.

2023-2024: the model war

March 2023: GPT-4 (initially limited multimodal, full multimodal in September with GPT-4 Vision). July 2023: Claude 2. December 2023: Gemini 1.0. March 2024: Claude 3 (Opus, Sonnet, Haiku). June 2024: Claude 3.5 Sonnet. October 2024: Claude 3.5 Sonnet (updated version). November 2024: Anthropic publishes the Model Context Protocol (MCP).

Prices fall. Access rises. AI stops being a monopoly.

2025-2026: agents and MCP

AI stops being a chat tool. It becomes an agent that acts.

It reads your calendar. Answers emails by category. Chains tasks. Plugs into Slack, GitHub, Google Workspace, your CRM. Executes in real time.

MCP is the layer that lets AIs access data and tools without every company having to build a custom integration for every model. It's the missing piece.

A question to close on

If you look at the AI cycles, there's a clear pattern: overpromise, disappointment, quiet research, breakthrough, mass hype. We've lived it at least twice in the twentieth century.

Are we in another cycle, or is this time different? I wrote my view in another piece in this series, "Is another AI winter coming?". Short version: I don't think a hard winter like the previous ones is coming. But I do think the current hype will correct. And the deciding factor comes down to one question: do we deliver what we promise?

In the meantime, mastering AI today is like mastering Excel in the 1990s. A separating skill. The next five years will split those who use AI like pros from those who don't.

Keep exploring

Want to go deeper?

01 Why did AI take so long if the idea is from 1950?

Because good prediction needs data and compute. In 1950 there was neither. The internet (data) and GPUs (compute) only showed up in the 2000s and 2010s.

02 What are the AI winters?

Periods when promises went unfulfilled, funding dried up, and research slowed. It happened in the 1970s and again in the late 1980s and 1990s.

03 When did AI actually become useful?

2012, with AlexNet. From there, a leap every two or three years. But the public explosion was ChatGPT in 2022.

Next article
Types of AI — the classification that matters