Análisis · The AI Landscape · Edition #0013

xAI and Grok — the AI that lives inside Twitter

Elon Musk founded xAI in July 2023 after falling out with the company he'd co-founded himself. Two and a half years on, Grok is the only frontier AI that sees the public conversation in real time — and the most polarizing bet on the market.

G
Germán Falcioni April 20, 2026
✦ Reading: 9 min
xAI bet on integration with the live conversation. Grok is the only frontier AI that sees X in real time while it answers.
TL;DR

xAI was founded by Elon Musk in July 2023, five years after he left the OpenAI board. Its product is Grok, integrated natively in X (formerly Twitter) with live access to the feed. The trajectory: Grok-1 (November 2023) → Grok-1.5 (March 2024) → Grok-2 (August 2024) → Grok-3 (February 2025) with "Big Brain mode" and extended reasoning. Grok-1 was released open-source on GitHub in March 2024 — 314 billion parameters, Mixture-of-Experts architecture — a move no other frontier lab has matched. Aurora (December 2024) added image generation with minimal filters. Behind all of it runs Colossus, the Memphis supercomputer with 100,000 H100 GPUs — the largest single cluster in the world by late 2024. Grok has a real strength (real-time search over X) and a real weakness (less consistent than Claude, ChatGPT, or Gemini at following instructions). For delegable professional work, it's not the first pick. For tracking trends and news as they happen, no alternative is this tightly integrated.

✦ Summarized with Claude at publish time
AI rewrite
Read it as…

On July 12, 2023, Elon Musk registered a company called X.AI Corp in the state of Nevada. Two days later he announced it publicly in a post on X — the social network he'd bought himself nine months earlier for $44 billion — saying the new company had a one-line mission: "understand the true nature of the universe."

The founding came preceded by a document nobody read in public. Musk had been hiring researchers from DeepMind, Google Brain, OpenAI, and Tesla for months. He'd assembled a team of eleven before the announcement. Over the following months he brought in Igor Babuschkin (ex-DeepMind), Tony Wu (ex-Google), Christian Szegedy (ex-Google Research), and other high-profile researchers. Eight months later they had their first model running.

That speed — idea in July 2023, model in November 2023 — is hard to explain without two pieces that came out later. One: Musk had been quietly buying GPUs before the founding. Two: in 2024 he'd build Colossus, the supercomputer that made the fast training cycles for Grok-2 and Grok-3 possible.

The fight that started it

The xAI story doesn't make sense without the Musk-OpenAI story.

Musk co-founded OpenAI in 2015 and put in the first serious funding — reportedly around $100 million across the first years. He sat on the board until February 2018, when he left citing Tesla conflicts (Tesla was also developing AI for self-driving). The split at that point was cordial.

Between 2018 and 2022 Musk watched from the outside as OpenAI moved to a hybrid structure — non-profit controlling a capped-return subsidiary — and took Microsoft's first investment. He didn't say much in public.

The break came after ChatGPT. Between late 2022 and mid-2023, Musk started posting sharp criticism: OpenAI had "betrayed its original mission," ChatGPT was "woke" and over-filtered, the current structure benefited Microsoft at humanity's expense. In 2024 he filed a lawsuit against OpenAI and against Altman personally — later withdrawn, later refiled — alleging they'd abandoned the non-profit mission.

That's the emotional temperature xAI was born into. It isn't a company standing alone in a market. It's a company built as an explicit answer to another one.

The model line

Grok-1 shipped in November 2023 inside X, initially only for X Premium+ subscribers ($22 a month at the time). It was a competent model but not competitive with GPT-4 — closer to GPT-3.5 on public benchmarks. What set it apart from day one was live access to the X feed.

Grok-1.5 arrived in March 2024, with context expanded to 128K tokens and strong gains on code and mathematical reasoning. That same month, xAI did something unexpected: it pushed the Grok-1 weights to GitHub under an Apache 2.0 license. 314 billion parameters, Mixture-of-Experts architecture with 25 percent of parameters active per token. No other frontier lab had released a model that large. Meta had released Llama, but with commercial restrictions and at smaller scale.

Grok-2 came in August 2024 and was the first xAI model that competed closely with the top tier. On LMSYS Chatbot Arena — the human preference benchmark — Grok-2 sat in the top five for several months. It also introduced image generation via a partnership with Black Forest Labs (FLUX).

Grok-3 launched in February 2025 and is xAI's current frontier model. The documented improvements are three: extended reasoning ("Big Brain mode" spends more inference compute on complex problems, similar to what OpenAI does with o1/o3), integrated X and web search, and lower latency. The comparative benchmarks xAI provided at launch should be read with the same caution as any interested-party evidence — worth waiting for independent external measurement.

Aurora and the image ecosystem

In December 2024 xAI launched Aurora, its in-house image generator (replacing the FLUX integration). The most visible difference with DALL-E, Midjourney, or Google's Imagen is the filter level: Aurora has substantially lower filters for content involving public figures, violence, and politically sensitive material.

This is deliberate positioning, not oversight. Musk has said publicly he considers competitors' moderation excessive. Aurora is that thesis turned into product.

The practical consequence is mixed. For independent creatives who keep bumping into DALL-E's filters, Aurora opens space. For companies that need guarantees their tool won't generate problematic images, Aurora is a legal and reputational risk they won't take on. Each path has its market.

Colossus, the invisible infrastructure

There's one piece of xAI that gets less attention than it deserves: the Colossus supercomputer in Memphis, Tennessee.

The project started in May 2024 and was running by September — four months to light up the first cluster. By late 2024, Colossus had 100,000 NVIDIA H100 GPUs running as a single training cluster. For context: the largest single cluster Meta had at that date was around 24,000 GPUs; Google's was comparable. Microsoft's cluster for OpenAI was larger in total but split across several sites.

Building it in four months required unusual engineering coordination: electrical capacity (the facility draws more power than some large hospitals), custom liquid cooling, massive InfiniBand networking. Honest caveat: Musk has a track record of being extremely fast on infrastructure when the company is under his direct control — Tesla Nevada, SpaceX Boca Chica, and now Colossus are the three examples.

Why does it matter if you use Grok? Because Colossus is the reason xAI can train frontier models without Google or Microsoft's headcount. Each training cycle that competitors take months to run, xAI runs in weeks. That iteration-speed advantage, if it holds, will be decisive.

The honest question for a professional

If you use AI for real work — analysis, drafting, code, documents — where does Grok fit?

The direct answer is: in a niche, added to others. Grok isn't the tool you delegate something important to without reviewing afterward. On instruction-following benchmarks, on production code heading to a repo, on contract analysis — Claude, ChatGPT, and Gemini are more predictable. That's the reality reported by professional users and reflected in independent evaluations.

Where Grok becomes unbeatable is when the work requires knowing what's happening on X right now. Journalists covering a live crisis. Traders reading the sentiment of an influential account. Brand teams catching a problem before it scales. Trend analysts who need to know what's being discussed today, not last month.

For those uses, Grok has no competition. And it's reasonable to keep it in the kit as a complementary tool — not as a replacement for the professional default.

To close, and to keep going

xAI is the youngest company in the frontier group. Two and a half years. It has already shipped a competitive model, built the largest single cluster in the world, open-sourced a 314-billion-parameter model, and runs the only live X access on the market.

It also carries the consequences of being born as an answer to another company. The xAI public narrative is tied to Musk — to his posts, his lawsuits, his shifting political alliances. For some users that adds; for others it subtracts. It's a real factor weighing on the decision to adopt it in a corporate setting.

Which AI capability changed the way you work most in the last six months: reasoning depth, speed, real-time data access, or predictability?

If you want the broader competitive picture, The AI race. If you want to dig into how model capability gets measured and compared, How AIs are measured.

Keep exploring

Want to go deeper?

01 Is Grok better than Claude or ChatGPT?

Depends on the job. Grok has one advantage no other frontiernAI matches: it sees the X feed in real time while it answers,nwith latency of seconds. If your work involves tracking what'snhappening live — traders, journalists, crisis teams, trendnanalysts — Grok has no direct competitor. For the rest ofnprofessional work — drafting, analyzing long documents,nproducing code that ships, reasoning through complex problemsn— Claude and ChatGPT remain more consistent, especially onnfollowing literal instructions. Grok is more unpredictable.nFor work you'll hand to a client, predictability matters morenthan live access.n

02 What's Colossus and why does it matter?

Colossus is the supercomputer xAI built in Memphis, Tennessee,nbetween May and September 2024. By late 2024 it had 100,000nNVIDIA H100 GPUs running as a single training cluster — thenlargest in the world at that point. It matters for twonreasons. First: standing it up in four months is a realnengineering feat, at a time when comparable projects at otherncompanies took years. Second: it shows xAI, despite being ansmall startup, has access to a compute layer comparable tonthe giants. That buys time and capacity to train frontiernmodels. It's also why Grok-3 could be trained that fast.n

03 Why did xAI release Grok-1 open-source?

In March 2024, xAI pushed Grok-1 weights to GitHub — 314nbillion parameters, Mixture-of-Experts architecture — undernan Apache 2.0 license. No other frontier lab (OpenAI,nAnthropic, Google) did anything comparable with a model thatnsize. Musk's public motivations were two: differentiate xAInfrom OpenAI (with whom he was feuding) and pressure thenindustry to open up models. The practical motivation is moreninteresting: Grok-1 was already being superseded by Grok-1.5ninternally, and releasing it carried low cost and highnsymbolic value. It was a real gesture but a strategicallynconvenient one — both can be true at once.n

Next article
Microsoft and Copilot — distribution as AI strategy