Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Chinchilla is an advanced language model that operates with a compute budget comparable to Gopher while having 70 billion parameters and utilizing four times the amount of data. This model consistently and significantly surpasses Gopher (280 billion parameters), as well as GPT-3 (175 billion), Jurassic-1 (178 billion), and Megatron-Turing NLG (530 billion), across a wide variety of evaluation tasks. Additionally, Chinchilla's design allows it to use significantly less computational power during the fine-tuning and inference processes, which greatly enhances its applicability in real-world scenarios. Notably, Chinchilla achieves a remarkable average accuracy of 67.5% on the MMLU benchmark, marking over a 7% enhancement compared to Gopher, showcasing its superior performance in the field. This impressive capability positions Chinchilla as a leading contender in the realm of language models.
Description
In honor of Archimedes, whose 2311th anniversary we celebrate this year, we are excited to introduce our inaugural Mathstral model, a specialized 7B architecture tailored for mathematical reasoning and scientific exploration. This model features a 32k context window and is released under the Apache 2.0 license. Our intention behind contributing Mathstral to the scientific community is to enhance the pursuit of solving advanced mathematical challenges that necessitate intricate, multi-step logical reasoning. The launch of Mathstral is part of our wider initiative to support academic endeavors, developed in conjunction with Project Numina. Much like Isaac Newton during his era, Mathstral builds upon the foundation laid by Mistral 7B, focusing on STEM disciplines. It demonstrates top-tier reasoning capabilities within its category, achieving remarkable results on various industry-standard benchmarks. Notably, it scores 56.6% on the MATH benchmark and 63.47% on the MMLU benchmark, showcasing the performance differences by subject between Mathstral 7B and its predecessor, Mistral 7B, further emphasizing the advancements made in mathematical modeling. This initiative aims to foster innovation and collaboration within the mathematical community.
API Access
Has API
API Access
Has API
Screenshots View All
No images available
Integrations
1min.AI
AI-FLOW
Codestral
Continue
Graydient AI
Hugging Face
Le Chat
Literal AI
Ministral 8B
Motific.ai
Integrations
1min.AI
AI-FLOW
Codestral
Continue
Graydient AI
Hugging Face
Le Chat
Literal AI
Ministral 8B
Motific.ai
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Google DeepMind
Country
United States
Website
arxiv.org/abs/2203.15556
Vendor Details
Company Name
Mistral AI
Founded
2023
Country
France
Website
mistral.ai/news/mathstral/