Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Qwen3-Coder is a versatile coding model that comes in various sizes, prominently featuring the 480B-parameter Mixture-of-Experts version with 35B active parameters, which naturally accommodates 256K-token contexts that can be extended to 1M tokens. This model achieves impressive performance that rivals Claude Sonnet 4, having undergone pre-training on 7.5 trillion tokens, with 70% of that being code, and utilizing synthetic data refined through Qwen2.5-Coder to enhance both coding skills and overall capabilities. Furthermore, the model benefits from post-training techniques that leverage extensive, execution-guided reinforcement learning, which facilitates the generation of diverse test cases across 20,000 parallel environments, thereby excelling in multi-turn software engineering tasks such as SWE-Bench Verified without needing test-time scaling. In addition to the model itself, the open-source Qwen Code CLI, derived from Gemini Code, empowers users to deploy Qwen3-Coder in dynamic workflows with tailored prompts and function calling protocols, while also offering smooth integration with Node.js, OpenAI SDKs, and environment variables. This comprehensive ecosystem supports developers in optimizing their coding projects effectively and efficiently.
Description
StarCoder and StarCoderBase represent advanced Large Language Models specifically designed for code, developed using openly licensed data from GitHub, which encompasses over 80 programming languages, Git commits, GitHub issues, and Jupyter notebooks. In a manner akin to LLaMA, we constructed a model with approximately 15 billion parameters trained on a staggering 1 trillion tokens. Furthermore, we tailored the StarCoderBase model with 35 billion Python tokens, leading to the creation of what we now refer to as StarCoder.
Our evaluations indicated that StarCoderBase surpasses other existing open Code LLMs when tested against popular programming benchmarks and performs on par with or even exceeds proprietary models like code-cushman-001 from OpenAI, the original Codex model that fueled early iterations of GitHub Copilot. With an impressive context length exceeding 8,000 tokens, the StarCoder models possess the capability to handle more information than any other open LLM, thus paving the way for a variety of innovative applications. This versatility is highlighted by our ability to prompt the StarCoder models through a sequence of dialogues, effectively transforming them into dynamic technical assistants that can provide support in diverse programming tasks.
API Access
Has API
API Access
Has API
Integrations
OpenAI
Alibaba Cloud
Brokk
ChatGPT
CodeQwen
Gemini
Gemini Enterprise
Git
GitHub
LM Studio
Integrations
OpenAI
Alibaba Cloud
Brokk
ChatGPT
CodeQwen
Gemini
Gemini Enterprise
Git
GitHub
LM Studio
Pricing Details
Free
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Qwen
Founded
2023
Country
China
Website
qwenlm.github.io/blog/qwen3-coder/
Vendor Details
Company Name
BigCode
Founded
2023
Website
huggingface.co/blog/starcoder