Análisis · The AI Landscape · Edition #0018

Open vs closed models — the battle that shapes the future

What 'open' actually means, what 'closed' actually means, and why for a mature professional the right question isn't which one wins — it's which one to pair with which.

G
Germán Falcioni April 20, 2026
✦ Reading: 10 min
Two roads to the same destination. The difference shows up the day something breaks.
TL;DR

"Open" and "closed" are labels hiding a richer spectrum: open weights, open code, open data, permissive license. Meta's Llama has open weights but its license restricts commercial use above a user threshold — it isn't OSI-approved open source. DeepSeek-R1 is MIT, genuinely open. On the other side, Claude, ChatGPT, and Gemini publish papers and architectures but not weights, and they have a public thesis on why. The pro-open argument is control, auditability, low marginal cost at scale, and no vendor lock-in. The pro-closed argument is investment in alignment research, legal accountability when things go wrong, frontier technical quality. Both arguments are defensible. This blog's stance: a mature professional uses both — a frontier closed model for high-stakes work, a self-hostable open model as a backup and for data that can't leave the network.

✦ Summarized with Claude at publish time
AI rewrite
Read it as…

In February 2024, two things happened in the same week, and the contrast between them tells the state of this debate better than any comparison chart.

Monday: Meta released Llama 2 under a license that permitted nearly unrestricted commercial use — the most capable downloadable model available at that moment. Tens of thousands of developers pulled it that week.

Wednesday: Anthropic published a long essay signed by Dario Amodei arguing, with technical detail, why his company doesn't publish the weights of the most capable models it trains — and why no one should. Same industry, same week, two opposite positions. Both reasoned. Both defensible.

That February week crystallized the question this evergreen tries to answer without propaganda: when does an open model make sense, when does a closed one, and why does a serious professional probably end up using both?

What "open" actually means

The word "open" in AI is messier than it is in traditional software. Worth unpacking.

A model has at least three components that can each be open or closed independently.

Weights — the file with the trained parameters. Llama publishes weights. Claude doesn't.

Code — the software that runs the model and the software that trained it. Llama publishes part of the inference code. The full training code for large models almost no one publishes.

Data — the corpus the model was trained on. Practically no one publishes the full training data, not even the most "open" projects. Copyright, privacy, and competitive reasons all apply.

On top of all that sits the license: what uses are legally permitted. And here's the trick a lot of people miss.

Llama 2 and Llama 3's license allows commercial use in most cases, but it has a clause saying that if your product has more than 700 million monthly active users you need a separate license from Meta. That's why the Open Source Initiative — the organization that formally defines what "open source" means — doesn't consider Llama open source in the strict sense. Llama is "open-weights with a restrictive license."

DeepSeek released R1 in January 2025 under a plain MIT license. Mistral publishes Mistral 7B and Mixtral under Apache 2.0. Those are open source in the traditional meaning of the term.

The distinction matters because if you're going to build a product on top of the model, the license is what defines your future legal position.

The players on the map

Worth mapping the landscape as of April 2026.

Meta (Llama). The biggest player in the open-with-restrictions camp. Llama 3 and Llama 4 are capable models, with versions up to 405 billion parameters. Meta has a strategic reason to play open: if the AI standards get set by Meta instead of OpenAI, Meta wins position. For Meta, Llama isn't a product — it's a platform play.

DeepSeek and Qwen (China). The genuinely-open players of 2025-2026. DeepSeek-R1 under MIT license showed that a Chinese model trained on a relatively modest budget could compete on reasoning with frontier closed models. Qwen, from Alibaba, is Apache 2.0. They're the models that most pushed the "if China publishes open and the US closes, who wins the race?" conversation.

Mistral (France). The interesting middle play. Publishes some open models (Mistral 7B, Mixtral) and some closed ones (Mistral Large). Their read is pragmatic: open to win mindshare and talent, closed to monetize frontier capability.

xAI (United States). An odd case: Elon Musk promised to open models and in March 2024 released the Grok-1 weights under Apache 2.0. Subsequent versions (Grok-2, Grok-3) weren't released openly. It sits as an intermediate gesture.

OpenAI, Anthropic, Google. The closed side. They publish technical papers, general architectures, sometimes performance figures. They don't publish weights. Each has its own public reasoning about why.

The pro-open argument

The case for open models rests on four concrete pillars.

No vendor lock-in. If your business depends on a closed model and the provider changes pricing, changes terms, or sunsets the model, your business is captured. With open, you have the model. If the provider vanishes, you keep operating.

Auditability. You can inspect how the model responds to specific prompts in a controlled environment. You can study its biases. You can verify what it says when you present your actual use case. With closed, you trust what the provider tells you it does.

Self-hosting for sensitive data. If your data legally can't leave your network (health, finance, defense, legal), an open model running on-premise is often the only technically viable option.

Low marginal cost at high volume. Once you've amortized the infrastructure, each additional inference costs close to zero. Closed APIs charge per token. For very high-volume use cases, the math favors open.

Distributed innovation. The community can fine-tune, quantize, optimize. Llama has thousands of specialized variants on Hugging Face. A closed model depends only on what the provider chooses to release.

The pro-closed argument

The case for closed models also has solid pillars. It isn't just marketing.

Concentrated economics for alignment research. Training a frontier model costs hundreds of millions of dollars. Serious alignment research on top costs tens of millions more. Open models rarely have that budget dedicated to alignment because the money comes back through usage, not through licensing. Claude invests in Constitutional AI partly because it can charge for the result.

Operational accountability. When something goes wrong with Claude, there's a company to hold accountable. There are contracts, SLAs, indemnifications. With an open model running on your server, you're the accountable party. For regulated applications (health, legal, financial), that difference is large.

Revocability. If the provider detects a misuse mode, it can update the model, add mitigations, even revoke access to specific accounts. Once weights are out on the internet, there's no way back.

Frontier technical quality. Closed models tend to be ahead on general capability benchmarks because they have more compute and more proprietary data. The gap narrows every year, but as of April 2026 it still exists in several domains.

Anthropic's position in particular is honest about this: its argument isn't "closing is better for everyone." Its argument is "for us, as a company whose mission is safety, closing the weights is part of the operational commitment." That stance has integrity whether or not you agree with it.

The capture risk

There's a counter-argument to pro-closed worth naming without melodrama.

If the market concentrates into three or four closed companies setting prices, usage terms, and access, AI becomes infrastructure controlled by oligopoly. It isn't a wild hypothesis — it's what happened with cloud compute.

In that scenario, open models stop being just a technical option and become systemic health. They're the guarantee that if closed providers turn abusive, there's an alternative. That guarantee has value even if you don't use it.

That's why Meta, DeepSeek, and Mistral publishing open isn't just generosity — it's competitive pressure that benefits the rest of the market. Closed providers also lower prices and improve terms when there are viable open alternatives.

When do I use one, when the other?

Getting practical, an honest guide by user profile.

Solo professional or small business. Closed. Claude, ChatGPT, or Gemini on subscription. The cost of engineers to maintain an open model on-premise exceeds the savings.

Mid-size company with a technical team and moderate compliance. Mostly closed for quality, with some open experimentation if there's a specific use case (domain fine-tuning, massive batch processing).

Company with strict compliance (health, banking, defense). Open on-premise is typically required for sensitive data, closed as a complementary tool for non-sensitive tasks.

Data consulting with reputational risk. Frontier closed by default (reliability). Open as Plan B for cases where the client demands data residency.

Product built on top of the model. Depends on volume. At low volume, closed for simplicity. At high volume with tight margins, open and fine-tuned.

To close, and to keep going

The "open vs closed" question is old in software and old in hardware. Every time a new technology shows up, the debate gets run again. The historical answer is almost always the same: both coexist, each one settles into the niche where its trade-offs pay off best.

The same is happening in AI. The closed models dominate in frontier quality and in professional use with accountability. The open ones dominate in control, customization, and cost at scale. They'll keep coexisting.

The only bad answer is zealotry. Whoever tells you "the closed models are evil because they hoard knowledge" or "the open ones are irresponsible because they release dangerous capability" is flattening a debate that has real nuance on both sides.

Which constraint weighs most in your work today: immediate quality, data control, or cost at volume?

If you want to understand the player that pushed the open frontier the most over the past year, read Meta and Llama — the open bet. If you want the full competitive map of the industry, The AI race.

Next article
How AIs actually get measured