Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Introducing DeepSeek-V3.2-Exp, our newest experimental model derived from V3.1-Terminus, featuring the innovative DeepSeek Sparse Attention (DSA) that enhances both training and inference speed for lengthy contexts. This DSA mechanism allows for precise sparse attention while maintaining output quality, leading to improved performance for tasks involving long contexts and a decrease in computational expenses. Benchmark tests reveal that V3.2-Exp matches the performance of V3.1-Terminus while achieving these efficiency improvements. The model is now fully operational across app, web, and API platforms. Additionally, to enhance accessibility, we have slashed DeepSeek API prices by over 50% effective immediately. During a transition period, users can still utilize V3.1-Terminus via a temporary API endpoint until October 15, 2025. DeepSeek encourages users to share their insights regarding DSA through our feedback portal. Complementing the launch, DeepSeek-V3.2-Exp has been made open-source, with model weights and essential technology—including crucial GPU kernels in TileLang and CUDA—accessible on Hugging Face. We look forward to seeing how the community engages with this advancement.
Description
Phi-4-reasoning-plus is an advanced reasoning model with 14 billion parameters, enhancing the capabilities of the original Phi-4-reasoning. It employs reinforcement learning for better inference efficiency, processing 1.5 times the number of tokens compared to its predecessor, which results in improved accuracy. Remarkably, this model performs better than both OpenAI's o1-mini and DeepSeek-R1 across various benchmarks, including challenging tasks in mathematical reasoning and advanced scientific inquiries. Notably, it even outperforms the larger DeepSeek-R1, which boasts 671 billion parameters, on the prestigious AIME 2025 assessment, a qualifier for the USA Math Olympiad. Furthermore, Phi-4-reasoning-plus is accessible on platforms like Azure AI Foundry and HuggingFace, making it easier for developers and researchers to leverage its capabilities. Its innovative design positions it as a top contender in the realm of reasoning models.
API Access
Has API
API Access
Has API
Pricing Details
Free
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
DeepSeek
Founded
2023
Country
China
Website
deepseek.com
Vendor Details
Company Name
Microsoft
Founded
1975
Country
United States
Website
azure.microsoft.com/en-us/blog/one-year-of-phi-small-language-models-making-big-leaps-in-ai/