Pixtral Large is Mistral AI’s latest open-weight multimodal model, featuring a powerful 124-billion-parameter architecture. It combines a 123-billion-parameter multimodal decoder with a 1-billion-parameter vision encoder, allowing it to excel at interpreting documents, charts, and natural images while maintaining top-tier text comprehension. With a 128,000-token context window, it can process up to 30 high-resolution images simultaneously. The model has achieved cutting-edge results on benchmarks like MathVista, DocVQA, and VQAv2, outperforming competitors such as GPT-4o and Gemini-1.5 Pro. Available under the Mistral Research License for non-commercial use and the Mistral Commercial License for enterprise applications, Pixtral Large is designed for advanced AI-powered understanding.