Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Genie 3 represents DeepMind's innovative leap in general-purpose world modeling, capable of real-time generation of immersive 3D environments at 720p resolution and 24 frames per second, maintaining consistency for several minutes. When provided with textual prompts, this advanced system fabricates interactive virtual landscapes that allow users and embodied agents to explore and engage with natural occurrences from various viewpoints, including first-person and isometric perspectives. One of its remarkable capabilities is the emergent long-horizon visual memory, which ensures that environmental details remain consistent even over lengthy interactions, retaining off-screen elements and spatial coherence when revisited. Additionally, Genie 3 features “promptable world events,” granting users the ability to dynamically alter scenes, such as modifying weather conditions or adding new objects as desired. Tailored for research involving embodied agents, Genie 3 works in harmony with systems like SIMA, enhancing navigation based on specific goals and enabling the execution of intricate tasks. This level of interactivity and adaptability marks a significant advancement in how virtual environments can be experienced and manipulated.
Description
Odyssey-2 Pro represents a groundbreaking general-purpose world model that allows for the generation of continuous, interactive simulations, which can be seamlessly integrated into various products through the Odyssey API, akin to the significant impact that GPT-2 had on language processing. This model is developed using extensive video and interaction datasets, enabling it to understand the progression of events frame-by-frame and produce simulations that last for minutes, rather than just brief static clips. With its enhanced physics, richer dynamics, more lifelike behaviors, and clearer visuals, Odyssey-2 Pro streams 720p video at approximately 22 frames per second, providing immediate responses to user prompts and actions. Furthermore, it facilitates the integration of interactive streams, viewable streams, and parameterized simulations into applications through straightforward SDKs available in both JavaScript and Python. Developers can incorporate this powerful model with fewer than ten lines of code, allowing them to craft open-ended, interactive video experiences that dynamically change based on user interactions, thus enhancing the overall engagement and immersion. This capability not only revolutionizes how simulations are utilized but also opens the door for innovative applications across various industries.
API Access
Has API
API Access
Has API
Integrations
Gemini
Gemini Enterprise
JavaScript
Odyssey
OpenAI
Project Genie
Python
Integrations
Gemini
Gemini Enterprise
JavaScript
Odyssey
OpenAI
Project Genie
Python
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Google DeepMind
Country
United Kingdom
Website
deepmind.google/discover/blog/genie-3-a-new-frontier-for-world-models/
Vendor Details
Company Name
Odyssey ML
Founded
2023
Country
United States
Website
odyssey.ml/the-gpt-2-moment-for-world-models