Average Ratings 1 Rating
Average Ratings 0 Ratings
Description
Introducing CodeGeeX, a powerful multilingual code generation model boasting 13 billion parameters, which has been pre-trained on an extensive code corpus covering over 20 programming languages. Leveraging the capabilities of CodeGeeX, we have created a VS Code extension (search 'CodeGeeX' in the Extension Marketplace) designed to support programming in various languages. In addition to its proficiency in multilingual code generation and translation, CodeGeeX can serve as a personalized programming assistant through its few-shot learning capability. This means that by providing a handful of examples as prompts, CodeGeeX can mimic the showcased patterns and produce code that aligns with those examples. This functionality enables the implementation of exciting features such as code explanation, summarization, and generation tailored to specific coding styles. For instance, users can input code snippets reflecting their unique style, and CodeGeeX will generate similar code accordingly. Moreover, experimenting with different prompt formats can further inspire CodeGeeX to develop new coding skills and enhance its versatility. Thus, CodeGeeX stands out as a versatile tool for developers looking to streamline their coding processes.
Description
StarCoder and StarCoderBase represent advanced Large Language Models specifically designed for code, developed using openly licensed data from GitHub, which encompasses over 80 programming languages, Git commits, GitHub issues, and Jupyter notebooks. In a manner akin to LLaMA, we constructed a model with approximately 15 billion parameters trained on a staggering 1 trillion tokens. Furthermore, we tailored the StarCoderBase model with 35 billion Python tokens, leading to the creation of what we now refer to as StarCoder.
Our evaluations indicated that StarCoderBase surpasses other existing open Code LLMs when tested against popular programming benchmarks and performs on par with or even exceeds proprietary models like code-cushman-001 from OpenAI, the original Codex model that fueled early iterations of GitHub Copilot. With an impressive context length exceeding 8,000 tokens, the StarCoder models possess the capability to handle more information than any other open LLM, thus paving the way for a variety of innovative applications. This versatility is highlighted by our ability to prompt the StarCoder models through a sequence of dialogues, effectively transforming them into dynamic technical assistants that can provide support in diverse programming tasks.
API Access
Has API
API Access
Has API
Integrations
Python
Visual Studio Code
C
C++
ChatGPT
CodeQwen
Git
GitHub
Go
Java
Integrations
Python
Visual Studio Code
C
C++
ChatGPT
CodeQwen
Git
GitHub
Go
Java
Pricing Details
Free
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
AMiner
Country
China
Website
codegeex.cn/
Vendor Details
Company Name
BigCode
Founded
2023
Website
huggingface.co/blog/starcoder