Oliver Cameron, co-founder and CEO of Odyssey ML, introduced Starchild-1 as the first real-time multimodal world model that generates interactive simulations incorporating synchronized audio and accepts continuous text, speech, and action inputs
Starchild-1 uses causal autoregressive prediction across audio and video to support stable long-horizon interactive simulations, advancing toward general-purpose world simulators.
😍😍😍😍
Introducing Starchild-1 from @odysseyml, the first ever real-time multimodal world model. This a model that can generate interactive simulations of the world that you can—for the first time ever—hear. Starchild-1 represents a big step towards a general-purpose world simulator.
@olivercameron @odysseyml a serious technical step, as audio forces the model to learn hidden physical and social structure that silent video can often fake.
So many possibilities I can think of right away.
Introducing Starchild-1 from @odysseyml, the first ever real-time multimodal world model. This a model that can generate interactive simulations of the world that you can—for the first time ever—hear. Starchild-1 represents a big step towards a general-purpose world simulator.
Introducing Starchild-1 from @odysseyml, the first ever real-time multimodal world model.
This a model that can generate interactive simulations of the world that you can—for the first time ever—hear.
Starchild-1 represents a big step towards a general-purpose world simulator.
Starchild-1 is an early step beyond world models that learn only from visual observation, toward systems that learn from richer multimodal interaction with the world.
Introducing Starchild-1 from @odysseyml, the first ever real-time multimodal world model. This a model that can generate interactive simulations of the world that you can—for the first time ever—hear. Starchild-1 represents a big step towards a general-purpose world simulator.
My dream is that models like Starchild-1 unlock entirely new forms of education, gaming, companionship, robotics, and brand new computing devices.
We're so early on this journey, but I'm so excited. The team did truly great work here.
Starchild-1 is an early step beyond world models that learn only from visual observation, toward systems that learn from richer multimodal interaction with the world.
We're also releasing an accompanying technical report of Starchild-1, to share our learnings and encourage further research in this direction!

My dream is that models like Starchild-1 unlock entirely new forms of education, gaming, companionship, robotics, and brand new computing devices. We're so early on this journey, but I'm so excited. The team did truly great work here. https://odyssey.ml/introducing-starchild-1