Análisis · The AI Landscape · Edition #0015

Meta and Llama — the quietest AI rollout in the world

More than five hundred million people use Meta AI each month inside WhatsApp, Instagram, and Facebook — most of them unaware there's a model called Llama running behind it. At the same time, Llama is the reason an actual open-source AI community exists today.

G
Germán Falcioni April 20, 2026
✦ Reading: 9 min
Meta AI lives inside WhatsApp, Instagram, and Facebook. Most of its users have never heard the word Llama.
TL;DR

Meta built the most distinctive strategy in the sector. On one side, it opened the weights of its Llama models to the world — Llama 2 in July 2023, Llama 3 in April 2024, Llama 4 in April 2025 — and effectively founded the modern open-weights community. On the other, it bolted Meta AI inside WhatsApp, Instagram, and Facebook: over 500 million monthly active users interact with it without thinking about it. The real strength is open-weights: you can host Llama 3.3 70B on reasonable hardware, pay nothing per token, and keep your data inside your own infrastructure. For companies with privacy requirements, that's gold. The real weakness: for general users, Meta AI is good enough but not better than Claude or ChatGPT on complex tasks. Distribution through WhatsApp is its biggest edge, not model quality.

✦ Summarized with Claude at publish time
AI rewrite
Read it as…

In late February 2023, a researcher at Meta AI Research — FAIR — published a paper titled "LLaMA: Open and Efficient Foundation Language Models." Access to the model was restricted: academic researchers had to fill out a form and sign a use agreement. Five days after publication, someone leaked the weights on 4chan. A BitTorrent link made its way through Twitter in under twenty-four hours.

Inside Meta, the initial reaction was what you'd expect — lawyers on alert, security teams running their playbook. But in the weeks that followed something unusual happened. The technical community started publishing fine-tunes, optimizations, versions running on ordinary laptops. Alpaca appeared, Vicuna appeared, a long list of derivatives followed. An ecosystem bloomed without Meta spending a dollar on evangelism.

In July 2023, Meta made the call that redefined its strategy. It shipped Llama 2 under a permissive license with commercial rights included. It wasn't an apology for the leak. It was an acknowledgment: the accident had revealed where the real moat actually sat.

That acknowledgment is the key to understanding why Meta competes so differently from Anthropic and OpenAI.

Yann LeCun's thesis

Meta's Chief AI Scientist is Yann LeCun, 2018 Turing Award recipient for his foundational work on convolutional networks. LeCun has been the public face of Meta's open-weights stance. His argument has two legs.

The first is technical. LeCun holds — and many researchers agree with him — that current autoregressive language models (GPT-4, Claude, Llama) aren't the path to genuinely general capability. In his read, the qualitative jumps will come from different architectures entirely. If that thesis holds, training one more model and charging a premium for it isn't the long-game winning move.

The second leg is strategic. If the base model ends up a commodity within a few years, whoever gives it away today builds an ecosystem; whoever charges for it today ends up tied to a moat that's eroding. The technical vocabulary here is "commoditize your complement" — a phrase Joel Spolsky popularized in 2002 but that Meta has applied to AI more consistently than anyone else. If what your rival sells at a premium becomes free, the rival loses the moat.

Between these two readings, Meta decided that base models are the complement, and that what needs protecting is distribution — WhatsApp, Instagram, Facebook — plus the data feeding its advertising business.

The Llama sequence and the infrastructure behind it

Worth walking through the release cadence to see the company's pace.

Llama 1 (February 2023). Four sizes: 7B, 13B, 33B, 65B parameters. Restricted access. Leaked on 4chan five days later. The modern open-source community starts here.

Llama 2 (July 2023). 7B, 13B, 70B. Permissive license with commercial rights. A foundational milestone for the open-weights industry.

Llama 3 (April 2024). 8B and 70B initially. The 70B became the workhorse for anyone wanting to host their own AI without depending on third parties. Months later came the 405B — competitive with GPT-4 on the main benchmarks.

Llama 3.1, 3.2, 3.3 (2024). Successive iterations. Llama 3.3 70B matched the original 405B on many benchmarks with a third of the parameters — a demonstration that efficiency still has room to run.

Llama 4 (April 2025). Mixture-of-Experts architecture. The largest model in the family, Behemoth, was unveiled at 2 trillion total parameters (not all active per token). It marked Meta's move to MoE, the same path OpenAI and Google had taken with their frontier models.

The infrastructure enabling that cadence is offensive. Meta stated in its Q4 2024 earnings call roughly $65 billion in capex for 2024, with guidance above $100 billion for 2025 — the most aggressive investment in the sector by an order of magnitude. As of late 2024, Meta reported more than 350,000 NVIDIA H100 GPUs installed. Those figures come from the company itself and should be read as "interested-party data," as we warn when covering similar metrics in our coverage of the broader landscape.

Meta AI in WhatsApp: the other play

While Llama grew as the open-weights standard, Meta ran a parallel move: jam Meta AI inside the apps most of the world already has installed.

Today Meta AI lives in three places. Inside WhatsApp chats (type "@Meta AI" and it appears). In Instagram's search bar. In Facebook Messenger. The number Meta communicated in 2025 is over 500 million monthly active users — a figure that, if confirmed by independent measurement, would make Meta AI the generative AI with the most users in the world by absolute volume.

That figure carries an important asterisk. Most of those users didn't choose Meta AI as a tool. Meta AI chose the users: it showed up inside the app they already had open, with no explicit decision in between. That's the textbook definition of platform advantage.

The real strength: hostable open-weights

Where Meta genuinely punches hardest is in a specific professional segment: companies with privacy requirements that can't send data to a third-party cloud.

A law firm handling matters under confidentiality, a hospital with medical records subject to strict regulation, a bank with transactional data — none of these can, for compliance reasons, ship that information to Anthropic or OpenAI. With Llama the story flips: you download the model, you run it on your own infrastructure, and the data never leaves your network.

Llama 3.3 70B runs on reasonable hardware — servers with four to eight GPUs — and delivers quality close to Claude or GPT-4 on extraction, classification, and summarization tasks. No per-token cost. No dependency on a vendor's uptime. No vendor lock-in. For that customer profile, Llama is the only viable option in the market, and it's a very good one.

The honest limitations

An honest pro-Claude read has to name what Meta doesn't do better.

For the individual professional who wants to delegate quality work — writing a careful report, analyzing a contract, reasoning through a complex problem across multiple turns — Meta AI and Llama aren't the first pick. Claude is still more consistent on literal instructions, more honest when it doesn't know something, and better at holding coherence across long conversations. That's a measurable work difference, not a theoretical detail.

The difference comes down to priority. Anthropic invests heavily in alignment and reliability with Constitutional AI. Meta invests heavily in scale, distribution, and efficiency. Two different bets. For serious professional use, Anthropic's bet pays off better. For ambient mass use on your phone and for private hosting at companies, Meta's bet is hard to beat.

To close, and to keep going

Meta is the hardest company to fit into a single narrative about the sector. It doesn't compete on the same lane as Anthropic (professional reliability), nor the same lane as OpenAI (mass consumer application with subscriptions), nor the same lane as Google (deep office suite integration). Meta plays its own game: open-weights at the model layer, glued-to-user distribution at the application layer, and offensive capex at the infrastructure layer.

That strategy has clear winners. For companies with sensitive data, Llama is the only viable path today. For five hundred million people already living inside WhatsApp, Meta AI is the de facto AI, even if they have no idea Llama exists. For the individual professional producing delegable work, Claude or ChatGPT still deliver more.

If you want to see how this bet compares to the other majors', The AI race sets out the full map. If you want to dig into the question of how models get measured honestly — because benchmarks published by the companies themselves need to be read with care — How AIs are measured is the next link.

Does your company have data that can't leave your network, or is your daily AI use exploratory and living on top of your phone? That question tells you which face of Meta actually serves you.

Next article
China's AI labs — DeepSeek, Qwen, and the other side of the race