Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Hunyuan Motion, often referred to as HY-Motion 1.0, represents an advanced AI model designed for transforming text into 3D motion, utilizing a billion-parameter Diffusion Transformer combined with flow matching techniques to create high-quality, skeleton-based animations in mere seconds. This innovative system comprehends detailed descriptions in both English and Chinese, allowing it to generate fluid and realistic motion sequences that can easily integrate into typical 3D animation workflows by exporting into formats like SMPL, SMPLH, FBX, or BVH, which are compatible with software such as Blender, Unity, Unreal Engine, and Maya. Its sophisticated training approach includes a three-phase pipeline: extensive pre-training on thousands of hours of motion data, meticulous fine-tuning on selected sequences, and reinforcement learning informed by human feedback, all of which significantly boost its capacity to interpret intricate commands and produce motion that is not only realistic but also temporally coherent. This model stands out for its ability to adapt to various animation styles and requirements, making it a versatile tool for creators in the gaming and film industries.
Description
Molmo 2 represents a cutting-edge suite of open vision-language models that come with completely accessible weights, training data, and code, thereby advancing the original Molmo series' capabilities in grounded image comprehension to encompass video and multiple image inputs. This evolution enables sophisticated video analysis, including pointing, tracking, dense captioning, and question-answering functionalities, all of which demonstrate robust spatial and temporal reasoning across frames. The suite consists of three distinct models: an 8 billion-parameter variant tailored for comprehensive video grounding and QA tasks, a 4 billion-parameter model that prioritizes efficiency, and a 7 billion-parameter model backed by Olmo, which features a fully open end-to-end architecture that includes the foundational language model. Notably, these new models surpass their predecessors on key benchmarks, setting unprecedented standards for open-model performance in image and video comprehension tasks. Furthermore, they often rival significantly larger proprietary systems while being trained on a much smaller dataset compared to similar closed models, showcasing their efficiency and effectiveness in the field. This impressive achievement marks a significant advancement in the accessibility and performance of AI-driven visual understanding technologies.
API Access
Has API
API Access
Has API
Integrations
Hugging Face
Ai2 OLMoE
Blender
Bluesky
GitHub
Imagen3D
Maya
Olmo 2
Threads
Unity
Integrations
Hugging Face
Ai2 OLMoE
Blender
Bluesky
GitHub
Imagen3D
Maya
Olmo 2
Threads
Unity
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Tencent Hunyuan
Founded
1998
Country
China
Website
hunyuan.tencent.com
Vendor Details
Company Name
Ai2
Founded
2014
Country
United States
Website
allenai.org/blog/molmo2