MatterGen, Meta's UMA, and the Ghost of LK-99: AI Materials Discovery Grows Up
For most of materials science history, discovering a new solid-state compound meant combining chemicals, firing the result, and analyzing what emerged—a process that could consume years per candidate. The first wave of AI for materials turned the search into a screening problem: train a model on known crystals, score millions of hypothetical compositions, and let a chemist decide what to synthesize. DeepMind's GNoME, published in Nature in November 2023, exemplified this approach at scale, cataloguing approximately 2.2 million thermodynamically stable inorganic crystals not previously in any public database.
A second wave has now arrived—one that does not screen chemical space but generates it on demand. Microsoft Research's MatterGen and Meta FAIR's Universal Model for Atoms (UMA) represent architecturally distinct bets on the same insight: condition a generative model on target properties and a researcher can describe what they need, receiving a prioritized batch of synthesis candidates in return. The unanswered question—and the shadow of LK-99 still haunts it—is how reliably these predictions survive contact with the bench.
MatterGen: From Gaussian Noise to Novel Crystal
Microsoft Research's MatterGen, published in Nature in February 2025 by Claudio Zeni and colleagues at Microsoft Research Cambridge, is the most ambitious generative model for inorganic materials released to date. Where earlier tools predicted properties of user-specified structures, MatterGen starts from random Gaussian noise and denoises it—jointly over atomic species, fractional coordinates, and lattice parameters—into a proposed crystal unit cell. The denoising backbone is an SE(3)-equivariant graph neural network, ensuring every output is invariant to rigid rotations and reflections of the crystal lattice, a mathematical necessity for physically valid crystal structures.
Training data totalled approximately 608,000 structures pooled from the Materials Project, the Alexandria database, and NOMAD. At inference, conditioning inputs include bulk modulus, electronic band gap, magnetic density of states, and hard elemental constraints. A materials engineer can specify a target—stable Li–Mn–O oxide, bulk modulus above 100 GPa, no rare-earth elements—and receive a ranked batch of novel candidate unit cells, structures that may not exist in any literature database.
What the Validation Numbers Actually Say
The key benchmark in the paper is a battery-cathode challenge. Constraining generation to Li–Mn–O ternary compositions with cathode-relevant property targets, roughly 35% of top-ranked generated structures passed DFT stability screening, defined as formation energy within 0.1 eV per atom of the convex hull using CHGNet as a surrogate. Six structures were taken to physical synthesis and confirmed as novel compounds absent from the training database—an empirical yield of approximately 8% of the raw generated pool.
The more consequential result is what DFT found in the passing set: approximately 65% of MatterGen's stability-confirmed structures had no close structural analog in any public database. That is the generator's core advantage over screeners—the model is not interpolating known compositions but reaching unmapped regions of chemical space that no human had reason to look in.
Meta's Universal Model for Atoms (UMA)
Where MatterGen addresses generation, Meta FAIR's UMA addresses the evaluator bottleneck: DFT itself. A single DFT relaxation for a moderately complex crystal can occupy 10–100 CPU-hours on a supercomputer. At the scales generative pipelines produce—tens of thousands of candidates per campaign—raw DFT is a hard cost wall. Machine learning interatomic potentials (MLIPs) approximate DFT energies and forces at milliseconds per structure, but the leading specialized models of 2022–2023 (NequIP, CHGNet, early MACE) degraded sharply outside their training distribution. Each was tuned to a narrow chemical family.
UMA was released alongside Meta's Open Materials 2024 (OMat24) dataset in late 2024, produced in collaboration with Lawrence Berkeley National Laboratory. OMat24 contains approximately 110 million DFT single-point calculations spanning broad inorganic chemistry—the largest publicly available ab initio dataset at the time of its release. The underlying architecture is eSEN (equivariant scalable equivariant network), an equivariant transformer that applies attention-based message-passing across atomic neighborhoods and scales to diverse multi-element systems without per-chemistry fine-tuning.
UMA is multi-task by design: it predicts total energy, per-atom forces, and virial stress in a single forward pass, making it directly deployable as a force field inside molecular dynamics or geometry optimization codes. On the Matbench Discovery benchmark—maintained by Janosh Riebesell at Cambridge and the Materials Project team—UMA-based stability predictions rank among the top published entries on the κ leaderboard (κ measures the fraction of truly stable crystals recovered within a fixed prediction budget), competing with MACE-MP-0 (Cambridge, openly released 2023) and CHGNet across diverse holdout chemistry.
The Combined Pipeline
The workflow that emerges when both tools are paired: generate with MatterGen, pre-filter with UMA or MACE-MP-0 as a fast DFT surrogate, then commit only surviving candidates to full quantum-chemistry calculations. The published MatterGen paper used CHGNet for this role; replacing it with UMA extends reliable scoring to exotic multi-element compositions where CHGNet's distribution runs thin. No public end-to-end benchmark yet captures the combined pipeline's empirical synthesis yield, but multiple independent groups described analogous stack architectures in preprints during early 2025—suggesting the approach is converging on a de facto standard.
The LK-99 Lesson: Validation Is the Real Bottleneck
In July 2023, Sukbae Lee and Ji-Hoon Kim of the Quantum Energy Research Centre in Seoul posted a preprint claiming that Pb₉Cu(PO₄)₆O—a copper-doped lead apatite they named LK-99—was a room-temperature, ambient-pressure superconductor. Within 72 hours the claim had been broadcast to millions of social-media followers and dozens of global replication attempts were underway. By August 2023, the scientific consensus was unambiguous: the partial levitation visible in the circulated videos was ferromagnetic behavior from a copper sulfide (Cu₂S) impurity phase, not a Meissner effect. LK-99 is an insulator whose resistivity increases with temperature—the opposite of superconducting behavior.
The lessons for AI-driven discovery are specific and actionable:
- DFT stability ≠ synthesizability. DFT predicts equilibrium thermodynamics. It says nothing about whether a structure is kinetically accessible from available precursors at achievable temperatures and pressures.
- Impurity phases dominate experimental signals. LK-99's anomalous electrical behavior came entirely from a minority Cu₂S phase unrelated to the intended compound. AI models trained on idealized single-crystal DFT structures have no representation of what forms during actual synthesis.
- Property prediction requires property-specific validation. A model trained to predict formation energy cannot be trusted to predict superconducting Tc, ionic conductivity, or catalytic turnover frequency. The failure mode is silent extrapolation into an unrepresented target space.
- Preprint velocity outpaces replication infrastructure. Extraordinary claims now travel at social-media speed. Multiple serious experimental groups dropped weeks of planned work on replication attempts that could never have succeeded—a real cost to the field's credibility and resource allocation.
The Room-Temperature Superconductor Question in 2026
No synthesis-validated room-temperature superconductor exists as of May 2026. Microsoft Quantum's Majorana 1 chip (announced February 2025) pursues topological qubits in semiconductor–superconductor heterostructures operating at millikelvin temperatures—a fundamentally distinct objective that relies on known low-temperature superconductors, not the discovery of ambient-condition ones. Microsoft's earlier high-profile Nature work on topological quantum states faced data-integrity challenges and retraction; the Majorana 1 program represents a rebased experimental approach.
AI search tools have been applied to the most promising known superconductor families: hydrogen-rich clathrates such as LaH₁₀-type structures (which require megabar pressures to stabilize) and nickelate perovskites. Several 2024–2025 preprints report AI-generated candidates with predicted Tc above 273 K under high pressure, but none has survived reproducible experimental confirmation across independent laboratories at the claimed conditions. The gap between predicted superconducting by a neural network and confirmed in three independent synthesis labs is the central unsolved problem, and it will not be closed by a better generator or a bigger dataset alone.
Real Partnerships, Real Money
The most documented industry-government AI materials collaboration remains Microsoft + Pacific Northwest National Laboratory (PNNL), active since 2023. Using Azure OpenAI (GPT-4) for chemical reasoning and candidate filtering combined with quantum chemistry calculations, the joint team narrowed approximately 32 million hypothetical solid electrolyte compositions to 23 high-priority synthesis targets in roughly 80 compute-hours. One candidate—a novel sodium-based ionic conductor—was synthesized at PNNL and demonstrated measurable ionic conductivity, reported publicly in early 2024. Performance was not yet commercially competitive, but the closed AI-to-synthesis loop was confirmed end-to-end, which was the demonstration's point.
Meta's Open Catalyst Project (OCP)—a collaboration between Meta FAIR and Carnegie Mellon University running since 2020—counts Shell and several electrolysis hardware companies as active ecosystem contributors. The OC22 and ODAC23 dataset extensions specifically target catalyst design for direct air capture, commercially motivated by growing carbon-credit market economics. OMat24 extends OCP's coverage to battery-relevant inorganic chemistries, deepening the overlap with those same industrial partners.
Citrine Informatics (San Francisco) operates a commercial AI platform for industrial materials R&D centered on active-learning workflows, where experimental results continuously update the model in closed loop rather than relying on pre-trained generalization. Clients span aerospace alloys, polymer formulations, and battery separators; partnership details are typically confidential. Chemistry-AI startup Chemify—spun out from Leroy Cronin's lab at the University of Glasgow—focuses on synthesis automation: the system not only proposes target compounds but plans and robotically executes the synthesis route, directly targeting the kinetic-accessibility gap that prediction-only models cannot close.
The Stack as of 2026
A well-resourced materials lab running AI tooling today operates roughly this pipeline:
- Specify target property profile and elemental constraints.
- Generate 5,000–50,000 candidates with MatterGen or a fine-tuned derivative.
- Pre-filter using UMA or MACE-MP-0 as a DFT surrogate; retain the top 1–2% by predicted stability and target property score.
- Run full DFT (VASP or Quantum ESPRESSO) on the filtered set—typically 100–500 structures.
- Submit the top 10–30 DFT-confirmed candidates to synthesis with manual or AI-assisted route planning.
- Feed experimental outcomes back to retrain the surrogate model.
The weakest link is step 5. Current AI tools are strong at predicting thermodynamic stability; they are poor at predicting whether a structure is kinetically accessible from cheap precursors at industrial temperatures. Solid-state synthesis route planning for inorganic crystals remains an open research problem—which is why the LK-99 lesson applies even when the generator and surrogate-DFT components are excellent. The next meaningful benchmark in this field will not be a leaderboard κ score. It will be the first AI-generated material independently confirmed inside a shipped commercial product.
Frequently asked
What makes MatterGen different from GNoME or earlier AI materials tools?
What is a machine learning interatomic potential (MLIP) and why does Meta's UMA matter?
Why exactly did LK-99 fail, and what does that mean for AI predictions?
Has any AI-first-discovered inorganic material made it into a commercial product?
Is room-temperature ambient-pressure superconductivity theoretically possible?
Sources & further reading
- Zeni et al. — MatterGen: a generative model for inorganic materials design (Nature, Feb 2025)
- Meta AI — Open Materials 2024 (OMat24) and the Universal Model for Atoms (UMA)
- Google DeepMind — Millions of new materials discovered with deep learning (GNoME, Nov 2023)
- Matbench Discovery — crystal stability prediction leaderboard (Materials Project / Cambridge)
- Nature News — LK-99 isn't a superconductor — how did we get here? (Aug 2023)
- Microsoft Research — AI accelerates materials discovery toward sustainable battery electrolytes (2024)
Last reviewed May 01, 2026. AI Pulled News is editorial; corrections welcome at /news/about.html.