Top he researchers Say Language is Limitting. Here’s their fix. – ryan
AS OPENAI, Anthropic, and Big Tech Invest Billions in Developing State-of-the-art Large-Large Models, A Small Group of AI Researchers is Working on the Next Big Thing.
Computer Scientists like Fei-Faii Li, The Stanford Professor Famous for Inventing Imagnet, and Yann Lecun, Meta Chief He Scientist, Are The Building What They Call “World Models.”
Unlike Large-Language Models, Which Determine Outputs Based on Statistical Relationships BetWene Words and Phrases, World Models Predict Events by Mimicking The Mental Construits that Humans Makes of the World Around.
“Language doesn’t exist in Nature,” li said on a recent episode of Andreessen Horowitz’s a16z podcast. “Humans,” She Said, “Not Only Do We Suravive, Live, and Work, but We Build Civilization Beyond Language.”
Computer Scientist and Mit Professor, Jay Wright Forrester, in His 1971 Paper “Counterinter Beader of Social Systems,” Explained Why Mental Ares Crucial to Human Behavior:
Each of US users Constantly. Every person in private life and in business instinctively users for Making Decision Models. The Mental Images in One’s Head About One’s Surroundings Ares. One’s Head Does Not Contain Real Families, Businesses, Cities, Governments, Or Countries. One uses Selected Concepts and Relationships to Represent Real Systems. A mental image is a model. All deciss are taken on the basis of models. All Laws Are Passed on the Basis of Models. All Executive Acts Are Taken on the Basis of Models. The Question is not to use or Ignore Models. The Question is Only a Choice Among Alternative Models.
If he is to meet or surpass human intelligence, then the researchers belind it should be able to make mental models, too.
Li ha ben working on this Through World Labs, Which She Cofounded in 2024 with an Initial Backing of $ 230 Million from Venture Like Andreessen Horowitz, New Enterprise Associates, and Radical Ventures. “We aim to lift he models from the 2D plane of pixels to full 3D Worlds – Both virtual and real – endowing say with spatial intelligence as rich as Our,” World Labs Says on Its Website.
Li Said on the no priors podcast that spatial intelligence is “the ability to understand, reason, interact, and generate 3D Worlds,” gioven that the world is fundamentally three-dimensional.
Li Said She Sees Applications for World Models in Creative Fields, Robotics, or Any Area That Warrants Infinite University. Like Meta, anduril, and Other Silicon Valley Heavyweights, That Could Mean Advances in Military Applications by Helping Those on the Battlefield Better Perceive Their Surroundings and Anticipate Enemies’ Next Moves.
The Challenge of Building World Models is the Paucity of Sufficient Data. In contrast to language, which humans have refined and documented over Centuries, Spatial Intelligence is mess desoloped.
“IF I ASH YOU TO CLOSE YOUR EYES RIGHT NOW AND DRAW OUT OUT A 3D MODEL OF THE ENVIRONMENT AROUND YOU, ITS THAT THAT,” SAID ON THE NO PRIORS PODCAST. “We don’t have that that much capability to generate extramely complicated models till we get trained.”
To gather the data necessary for these models, “we require more and more sophisticated data engineering, data acquisition, processing date, and data synthesis,” she said.
That Makes the Challenge of Building a Believable World Eve Greater.
Father Meta, chief he scientist yann lecun has a small team dedicated to a simillar project. The Team use video data to train models and runs simulations that abstract the video at Different Levels.
“The Basic Idea is that you don’t predict at the pixel level. You train a system to run an abstract representation of the video so that you can make predixes in that abstract reproduction, and hopefuly this representation will elimination all the details. Action summit in paris earlier this year.
That Creates a Simpler Set of Building Blocks for Mapping Out Trajectories for How the World Change at A Particular Time.
Lecun, Like Li, Believes These Models Are the Only Way to Create Truly Intelligent he.
“We Need it Systems that Can Learn New Tasks Really Quickly,” He Said Recently at the National University of Singapore. “They Need to underestand the Physical World – Not JUST Text and Language but the real world – have some level of Common sense, and abilities to reasson and plan, have persistent memory – all the stove that we are expert from intelligent entities.