Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

We are thrilled to present DBRX, a versatile open LLM developed by Databricks. This innovative model achieves unprecedented performance on a variety of standard benchmarks, setting a new benchmark for existing open LLMs. Additionally, it equips both the open-source community and enterprises crafting their own LLMs with features that were once exclusive to proprietary model APIs; our evaluations indicate that it outperforms GPT-3.5 and competes effectively with Gemini 1.0 Pro. Notably, it excels as a code model, outperforming specialized counterparts like CodeLLaMA-70B in programming tasks, while also demonstrating its prowess as a general-purpose LLM. The remarkable quality of DBRX is complemented by significant enhancements in both training and inference efficiency. Thanks to its advanced fine-grained mixture-of-experts (MoE) architecture, DBRX elevates the efficiency of open models to new heights. In terms of inference speed, it can be twice as fast as LLaMA2-70B, and its total and active parameter counts are approximately 40% of those in Grok-1, showcasing its compact design without compromising capability. This combination of speed and size makes DBRX a game-changer in the landscape of open AI models.

Description

StarCoder and StarCoderBase represent advanced Large Language Models specifically designed for code, developed using openly licensed data from GitHub, which encompasses over 80 programming languages, Git commits, GitHub issues, and Jupyter notebooks. In a manner akin to LLaMA, we constructed a model with approximately 15 billion parameters trained on a staggering 1 trillion tokens. Furthermore, we tailored the StarCoderBase model with 35 billion Python tokens, leading to the creation of what we now refer to as StarCoder. Our evaluations indicated that StarCoderBase surpasses other existing open Code LLMs when tested against popular programming benchmarks and performs on par with or even exceeds proprietary models like code-cushman-001 from OpenAI, the original Codex model that fueled early iterations of GitHub Copilot. With an impressive context length exceeding 8,000 tokens, the StarCoder models possess the capability to handle more information than any other open LLM, thus paving the way for a variety of innovative applications. This versatility is highlighted by our ability to prompt the StarCoder models through a sequence of dialogues, effectively transforming them into dynamic technical assistants that can provide support in diverse programming tasks.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

ChatGPT
CodeQwen
Double
EPIC
GPT-3.5
GPT-4
Git
GitHub
LM Studio
OpenAI
Python
Rayven
Tabby
Taylor AI
Visual Studio Code
ZenML

Integrations

ChatGPT
CodeQwen
Double
EPIC
GPT-3.5
GPT-4
Git
GitHub
LM Studio
OpenAI
Python
Rayven
Tabby
Taylor AI
Visual Studio Code
ZenML

Pricing Details

No price information available.
Free Trial
Free Version

Pricing Details

Free
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Databricks

Country

United States

Website

www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm

Vendor Details

Company Name

BigCode

Founded

2023

Website

huggingface.co/blog/starcoder

Product Features

Alternatives

DeepSeek-V2 Reviews

DeepSeek-V2

DeepSeek

Alternatives

CodeQwen Reviews

CodeQwen

Alibaba
FLIP Reviews

FLIP

Kanerika
CodeGemma Reviews

CodeGemma

Google
Qwen2 Reviews

Qwen2

Alibaba
CodeGen Reviews

CodeGen

Salesforce
Ai2 OLMoE Reviews

Ai2 OLMoE

The Allen Institute for Artificial Intelligence
DeepSeek Coder Reviews

DeepSeek Coder

DeepSeek