Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
The MiniMax‑M1 model, introduced by MiniMax AI and licensed under Apache 2.0, represents a significant advancement in hybrid-attention reasoning architecture. With an extraordinary capacity for handling a 1 million-token context window and generating outputs of up to 80,000 tokens, it facilitates in-depth analysis of lengthy texts. Utilizing a cutting-edge CISPO algorithm, MiniMax‑M1 was trained through extensive reinforcement learning, achieving completion on 512 H800 GPUs in approximately three weeks. This model sets a new benchmark in performance across various domains, including mathematics, programming, software development, tool utilization, and understanding of long contexts, either matching or surpassing the capabilities of leading models in the field. Additionally, users can choose between two distinct variants of the model, each with a thinking budget of either 40K or 80K, and access the model's weights and deployment instructions on platforms like GitHub and Hugging Face. Such features make MiniMax‑M1 a versatile tool for developers and researchers alike.
Description
SubQ 1.1 Small is the second iteration of Subquadratic’s long-context AI model, built to help enterprises solve problems that require reasoning across entire artifacts rather than isolated chunks. The model is designed for use cases involving large code repositories, document libraries, legal agreements, financial reports, contracts, and other complex information sets. Its Subquadratic Sparse Attention architecture reduces the compute burden of traditional dense attention, making it more practical to process multi-million-token contexts. SubQ 1.1 Small achieves near-perfect performance on needle-in-a-haystack retrieval tests up to 12M tokens, despite being trained primarily at 1M tokens. It also performs strongly on RULER, GPQA Diamond, LiveCodeBench, and AutomationBench Finance, showing a balance between long-context retrieval and general reasoning ability. At 1M tokens, the model uses 64.5x less compute than dense attention and runs 56x faster than FlashAttention-2 on a single attention layer. This efficiency makes long-context training and inference more scalable for enterprise AI applications. SubQ 1.1 Small is especially valuable for teams that need to analyze relationships across full documents, trace logic across codebases, or connect information across extensive collections. The model is intended to help organizations reduce dependence on complex retrieval workarounds and reason more directly over large-scale data.
API Access
Has API
API Access
Has API
Integrations
Anuma
Claude Code
GitHub
Hugging Face
OpenAI
OpenAI Codex
SiliconFlow
SubQ
Integrations
Anuma
Claude Code
GitHub
Hugging Face
OpenAI
OpenAI Codex
SiliconFlow
SubQ
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
MiniMax
Founded
2021
Country
Singapore
Website
www.minimax.io/news/minimaxm1
Vendor Details
Company Name
Subquadratic
Founded
2026
Country
United States
Website
subq.ai/subq-1-1-small-technical-report