Odyssey-2 Pro
Odyssey-2 Pro is a frontier general-purpose world model that generates continuous, interactive simulations you can integrate into products via the Odyssey API, marking a pivotal moment for world models similar to GPT-2 in language. It’s trained on large amounts of video and interaction data to learn how the world evolves frame-by-frame and outputs minutes-long simulations that can be interacted with in real time, not fixed short clips. Odyssey-2 Pro delivers improved physics, richer dynamics, more authentic behaviors, and sharper visuals by streaming 720p video at up to ~22 FPS that responds instantly to prompts and actions, and it supports embedding interactive streams, viewable streams, and parameterized simulations into applications with simple SDKs in JavaScript and Python. Developers can integrate the model with under ten lines of code to create open-ended, interactive video experiences where users’ inputs shape evolving scenes.
Learn more
NVIDIA Cosmos
NVIDIA Cosmos is a developer-first platform of state-of-the-art generative World Foundation Models (WFMs), advanced video tokenizers, guardrails, and an accelerated data processing and curation pipeline designed to supercharge physical AI development. It enables developers working on autonomous vehicles, robotics, and video analytics AI agents to generate photorealistic, physics-aware synthetic video data, trained on an immense dataset including 20 million hours of real-world and simulated video, to rapidly simulate future scenarios, train world models, and fine‑tune custom behaviors. It includes three core WFM types; Cosmos Predict, capable of generating up to 30 seconds of continuous video from multimodal inputs; Cosmos Transfer, which adapts simulations across environments and lighting for versatile domain augmentation; and Cosmos Reason, a vision-language model that applies structured reasoning to interpret spatial-temporal data for planning and decision-making.
Learn more
Project Genie
Project Genie is an experimental AI system from Google that generates interactive worlds in real time. It allows users to create living, explorable environments using simple text or image prompts. As you move through a world, Genie dynamically builds the landscape around you, making each experience unique. Users can design characters and choose how they explore, from walking and driving to flying and riding. The platform supports a wide range of environments, including natural landscapes, fictional worlds, and scenes generated from photos or artwork. Genie reacts to movement, physics, and user actions to create a continuous sense of discovery. Project Genie showcases the future of real-time, AI-generated interactive environments.
Learn more
Marble
Marble is an experimental AI model internally tested by World Labs, a variant and extension of their Large World Model technology. It is a web service that turns a single 2D image into a navigable spatial environment. Marble offers two generation modes: a smaller, fast model for rough previews that’s quick to iterate on, and a larger, high-fidelity model that takes longer (around ten minutes in the example) but produces a significantly more convincing result. The value proposition is instant, photogrammetry-like image-to-world creation without a full capture rig, turning a single shot into an explorable space for memory capture, mood boards, archviz previews, or creative experiments.
Learn more