← All stories
AI & Tech

Architecture search reveals transformers need less structure at trillion-parameter scale

Cognitive Revolution · Intelligence on the Edge: Liquid AI's Ramin Hasani on the Search for Device-Native Foundation Models · July 4, 2026
Architecture search reveals transformers need less structure at trillion-parameter scale
Cognitive Revolution
Cognitive Revolution
Intelligence on the Edge: Liquid AI's Ramin Hasani on the Search for Device-Native Foundation Models
"Scaling laws define architecture. When we are talking about Transformers being this revolutionary thing, we are talking about maximum scale. The reason why transformer architecture and attention mechanism is such a brilliant architecture is the fact that it is unstructured. There is no structure. The larger neural networks that you make into infinite size, the larger neural networks you make, the more you want them to become less and less structured. As soon as you start adding a little bit of bias in that architecture at scale, things become completely messed up."
Liquid AI's automated architecture search across models from 10 million to 72 billion parameters revealed a fundamental scaling principle: larger models require less architectural bias and structure, while smaller specialized models benefit from complex gating and nonlinearity. This insight challenges assumptions about optimal architectures and suggests transformers' dominance at frontier scale is mathematically justified.

About this episode

Nathan Labenz interviews Ramin Hassani, CEO and co-founder of Liquid AI, in a technically deep exploration of biologically-inspired neural architectures and the future of efficient AI systems. Hassani traces Liquid AI's origin to a decade of MIT research into liquid neural networks—differential equation-based systems inspired by the 300-neuron brain of C. elegans worms that can perform complex control tasks like autonomous parking with just 12 neurons. The breakthrough came in 2022 when the team solved century-old neuronal dynamics equations in closed form, enabling these nonlinear systems to scale from hundreds to potentially billions of neurons. Today, Liquid AI ranks fifth in the US for foundation model downloads on Hugging Face with over 1 million weekly downloads, competing against Google, Meta, Microsoft, and NVIDIA while using just 1,000 GPUs. The company developed an Automated Foundation Model Design system that searches architecture space with hardware in the loop, testing on actual downstream tasks rather than proxy metrics. This revealed a fundamental scaling principle: smaller models benefit from complex gating and architectural bias, while trillion-parameter systems require maximal unstructured computation like pure attention. Liquid's LFM models use primarily gated convolutions rather than attention, achieving competitive quality at dramatically lower compute and memory footprints. The company has secured partnerships with Shopify for production deployment and Mercedes-Benz for in-car intelligence using 600-megabyte models. Hassani argues the trillion dollars of smartphones and laptops shipped annually represents untapped substrate for local AI that current foundation models cannot efficiently utilize, and warns semiconductor companies they must build their own intelligence layers like NVIDIA's Nematron or risk losing competitiveness. He closes with a techno-optimist vision of curiosity-driven research enabled by AI agents, while noting current architectures likely cannot match human brain efficiency without discovering new emergent learning mechanisms beyond next-token prediction.

Key takeaways

More stories More from Cognitive Revolution