
Was this email forwarded to you? Sign up here
World Models are Next. Now World Data is Key
What are world models?
World models learn by watching video or digesting simulation data and other spatial inputs, building internal representations of objects, scenes, and physical dynamics.”
“Instead of predicting the next word, as a language model does, they predict what will happen next in the world, modeling how things move, collide, fall, interact, and persist over time.”
“The goal is to create models that understand concepts like gravity, occlusion, object permanence, and cause-and-effect without having been explicitly programmed on those topics.”
Makes me think that Humanoids and other robots are going to be big beneficiaries of world models as well as autonomous vehicles.
So how does this relate to data?
Data is one of the key challenges. Those building large language models have been able to get most of what they need by scraping the breadth of the internet.
“World models also need a massive amount of information, but from data that’s not consolidated or as readily available.”
“One of the biggest hurdles to developing world models has been the fact that they require high-quality multimodal data at a massive scale in order to capture how agents perceive and interact with physical environments,” Encord president and co-founder Ulrik Stig Hansen said in an email interview.”
“Encord offers one of the largest open source datasets for world models, with 1 billion data pairs across images, videos, text, audio, and 3D point clouds as well as a million human annotations assembled over months.”
“But even that is just a baseline, Hansen said. “Production systems will likely need significantly more.”
As I think through the world models and what data is important, I was shown this company from a founder I was chatting with:
AI is moving fast. There isn’t a day that goes by without some sort of news coming out. The underlying fact is that data is one of the key pillars of all of this AI innovation. It now seems like the world model data needs will be the next area of attention…..

Sequentum is thrilled to announce the release of Sequentum Cloud’s AI Magic Wand Feature Now in Beta! Read the press release here.
The new AI-augmented feature builds agents in seconds, coupled with the ability to customize to the atomic level.
Join our CEO, Sarah McKenna on The Ravit Show on Tue, Nov 25th at 4pm EST for our launch celebration and see our new AI tool in action, sign up here.
View the demo here.
Access documentation here.
Start a free trial of Sequentum Cloud: www.sequentum.com/cloud
