Best Large Language Models of 2024

Find and compare the best Large Language Models in 2024

Use the comparison tool below to compare the top Large Language Models on the market. You can filter results by user reviews, pricing, features, platform, region, support options, integrations, and more.

  • 1
    CodeGen Reviews

    CodeGen

    Salesforce

    Free
    CodeGen is a model for program synthesis that is open-source. Trained on TPU v4. OpenAI Codex is competitive with TPU-v4.
  • 2
    GPT-NeoX Reviews

    GPT-NeoX

    EleutherAI

    Free
    A model parallel autoregressive transformator implementation on GPUs based on the DeepSpeed Library. This repository contains EleutherAI’s library for training large language models on GPUs. Our current framework is based upon NVIDIA's Megatron Language Model, and has been enhanced with techniques from DeepSpeed, as well as some novel improvements. This repo is intended to be a central and accessible place for techniques to train large-scale autoregressive models and to accelerate research into large scale training.
  • 3
    GPT-J Reviews

    GPT-J

    EleutherAI

    Free
    GPT-J, a cutting edge language model developed by EleutherAI, is a leading-edge language model. GPT-J's performance is comparable to OpenAI's GPT-3 model on a variety of zero-shot tasks. GPT-J, in particular, has shown that it can surpass GPT-3 at tasks relating to code generation. The latest version of this language model is GPT-J-6B and is built on a linguistic data set called The Pile. This dataset is publically available and contains 825 gibibytes worth of language data organized into 22 subsets. GPT-J has some similarities with ChatGPT. However, GPTJ is not intended to be a chatbot. Its primary function is to predict texts. Databricks made a major development in March 2023 when they introduced Dolly, an Apache-licensed model that follows instructions.
  • 4
    Pythia Reviews

    Pythia

    EleutherAI

    Free
    Pythia combines interpretationability analysis and scaling law to understand how knowledge evolves and develops during training with autoregressive transformators.
  • 5
    Dolly Reviews

    Dolly

    Databricks

    Free
    Dolly is an inexpensive LLM that demonstrates a surprising amount of the capabilities of ChatGPT. Whereas the work from the Alpaca team showed that state-of-the-art models could be coaxed into high quality instruction-following behavior, we find that even years-old open source models with much earlier architectures exhibit striking behaviors when fine tuned on a small corpus of instruction training data. Dolly uses an open source model with 6 billion parameters from EleutherAI, which is modified to include new capabilities like brainstorming and text creation that were not present in the original.
  • 6
    mT5 Reviews

    mT5

    Google

    Free
    Multilingual T5 is a massively pretrained text-totext transformer model that has been trained using a similar recipe to T5. This repo can used to reproduce the experiments described in the mT5 article. The mC4 corpus covers 101 languages. Afrikaans, Albanian, Amharic, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bulgarian, Burmese, Catalan, Cebuano, Chichewa, Chinese, Corsican, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hmong, Hungarian, Icelandic, Igbo, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Kurdish, Kyrgyz, Lao, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Mongolian, Nepali, Norwegian, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Samoan, Scottish Gaelic, Serbian, Shona, Sindhi, and more.
  • 7
    Cerebras-GPT Reviews
    The training of state-of-the art language models is extremely difficult. They require large compute budgets, complex distributed computing techniques and deep ML knowledge. Few organizations are able to train large language models from scratch. The number of organizations that do not open source their results is increasing, even though they have the expertise and resources to do so. We at Cerebras believe in open access to the latest models. Cerebras is proud to announce that Cerebras GPT, a family GPT models with 111 million to thirteen billion parameters, has been released to the open-source community. These models are trained using the Chinchilla Formula and provide the highest accuracy within a given computing budget. Cerebras GPT has faster training times and lower training costs. It also consumes less power than any other publicly available model.
  • 8
    Falcon-40B Reviews

    Falcon-40B

    Technology Innovation Institute (TII)

    Free
    Falcon-40B is a 40B parameter causal decoder model, built by TII. It was trained on 1,000B tokens from RefinedWeb enhanced by curated corpora. It is available under the Apache 2.0 licence. Why use Falcon-40B Falcon-40B is the best open source model available. Falcon-40B outperforms LLaMA, StableLM, RedPajama, MPT, etc. OpenLLM Leaderboard. It has an architecture optimized for inference with FlashAttention, multiquery and multiquery. It is available under an Apache 2.0 license that allows commercial use without any restrictions or royalties. This is a raw model that should be finetuned to fit most uses. If you're looking for a model that can take generic instructions in chat format, we suggest Falcon-40B Instruct.
  • 9
    Falcon-7B Reviews

    Falcon-7B

    Technology Innovation Institute (TII)

    Free
    Falcon-7B is a 7B parameter causal decoder model, built by TII. It was trained on 1,500B tokens from RefinedWeb enhanced by curated corpora. It is available under the Apache 2.0 licence. Why use Falcon-7B Falcon-7B? It outperforms similar open-source models, such as MPT-7B StableLM RedPajama, etc. It is a result of being trained using 1,500B tokens from RefinedWeb enhanced by curated corpora. OpenLLM Leaderboard. It has an architecture optimized for inference with FlashAttention, multiquery and multiquery. It is available under an Apache 2.0 license that allows commercial use without any restrictions or royalties.
  • 10
    RedPajama Reviews

    RedPajama

    RedPajama

    Free
    GPT-4 and other foundation models have accelerated AI's development. The most powerful models, however, are closed commercial models or partially open. RedPajama aims to create a set leading, open-source models. Today, we're excited to announce that the first phase of this project is complete: the reproduction of LLaMA's training dataset of more than 1.2 trillion tokens. The most capable foundations models are currently closed behind commercial APIs. This limits research, customization and their use with sensitive information. If the open community can bridge the quality gap between closed and open models, fully open-source models could be the answer to these limitations. Recent progress has been made in this area. AI is in many ways having its Linux moment. Stable Diffusion demonstrated that open-source software can not only compete with commercial offerings such as DALL-E, but also lead to incredible creative results from community participation.
  • 11
    Vicuna Reviews

    Vicuna

    lmsys.org

    Free
    Vicuna-13B, an open-source chatbot, is trained by fine-tuning LLaMA using user-shared conversations from ShareGPT. Vicuna-13B's preliminary evaluation using GPT-4, as a judge, shows that it achieves a quality of more than 90%* for OpenAI ChatGPT or Google Bard and outperforms other models such as LLaMA or Stanford Alpaca. Vicuna-13B costs around $300 to train. The online demo and the code, along with weights, are available to non-commercial users.
  • 12
    MPT-7B Reviews

    MPT-7B

    MosaicML

    Free
    Introducing MPT-7B - the latest addition to our MosaicML Foundation Series. MPT-7B, a transformer that is trained from scratch using 1T tokens of code and text, is the latest entry in our MosaicML Foundation Series. It is open-source, available for commercial purposes, and has the same quality as LLaMA-7B. MPT-7B trained on the MosaicML Platform in 9.5 days, with zero human interaction at a cost $200k. You can now train, fine-tune and deploy your private MPT models. You can either start from one of our checkpoints, or you can start from scratch. For inspiration, we are also releasing three finetuned models in addition to the base MPT-7B: MPT-7B-Instruct, MPT-7B-Chat, and MPT-7B-StoryWriter-65k+, the last of which uses a context length of 65k tokens!
  • 13
    OpenLLaMA Reviews

    OpenLLaMA

    OpenLLaMA

    Free
    OpenLLaMA, a permissively-licensed open source reproduction of Meta AI’s LLaMA 7B, is trained on the RedPajama data set. Our model weights are a drop-in replacement for LLaMA7B in existing implementations. We also offer a smaller 3B version of the LLaMA Model.
  • 14
    Baichuan-13B Reviews

    Baichuan-13B

    Baichuan Intelligent Technology

    Free
    Baichuan-13B, a large-scale language model with 13 billion parameters that is open source and available commercially by Baichuan Intelligent, was developed following Baichuan -7B. It has the best results for a language model of the same size in authoritative Chinese and English benchmarks. This release includes two versions of pretraining (Baichuan-13B Base) and alignment (Baichuan-13B Chat). Baichuan-13B has more data and a larger size. It expands the number parameters to 13 billion based on Baichuan -7B, and trains 1.4 trillion coins on high-quality corpus. This is 40% more than LLaMA-13B. It is open source and currently the model with the most training data in 13B size. Support Chinese and English bi-lingual, use ALiBi code, context window is 4096.
  • 15
    FreeWilly Reviews

    FreeWilly

    Stability AI

    Free
    Stability AI, in collaboration with its CarperAI Lab, is proud to announce FreeWilly1 (and its successor FreeWilly2), two powerful, new Large Language Models. Both models show exceptional reasoning abilities across a variety of benchmarks. FreeWilly1 leverages the original LLaMA 65B foundation model and was carefully fine-tuned with a new synthetically-generated dataset using Supervised Fine-Tune (SFT) in standard Alpaca format. FreeWilly2 uses the LLaMA 70B foundation model in order to achieve a performance that is comparable with GPT-3.5 on some tasks. The FreeWilly models were inspired by Microsoft's "Orca: Progressive Learning from Complex Explanation traces of GPT-4" paper. While our data generation processes are similar, our data sources differ.
  • 16
    ChatGLM-6B Reviews

    ChatGLM-6B

    Zhipu AI

    Free
    ChatGLM-6B, a Chinese-English bilingual dialogue model based on General Language Model architecture (GLM), has 6.2 billion parameters. Users can deploy model quantization locally on consumer-grade graphic cards (only 6GB video memory required at INT4 quantization levels). ChatGLM-6B is based on technology similar to ChatGPT and optimized for Chinese dialogue and Q&A. After approximately 1T identifiers for Chinese and English bilingual training and supplemented with supervision and fine-tuning as well as feedback self-help and human feedback reinforcement learning, ChatGLM-6B, with 6.2 billion parameters, has been able generate answers that are in line with human preference.
  • 17
    Code Llama Reviews
    Code Llama, a large-language model (LLM), can generate code using text prompts. Code Llama, the most advanced publicly available LLM for code tasks, has the potential to improve workflows for developers and reduce the barrier for those learning to code. Code Llama can be used to improve productivity and educate programmers to create more robust, well documented software. Code Llama, a state-of the-art LLM, is capable of generating both code, and natural languages about code, based on both code and natural-language prompts. Code Llama can be used for free in research and commercial purposes. Code Llama is a new model that is built on Llama 2. It is available in 3 models: Code Llama is the foundational model of code; Codel Llama is a Python-specific language. Code Llama-Instruct is a finely tuned natural language instruction interpreter.
  • 18
    LongLLaMA Reviews

    LongLLaMA

    LongLLaMA

    Free
    This repository contains a research preview of LongLLaMA. It is a large language-model capable of handling contexts up to 256k tokens. LongLLaMA was built on the foundation of OpenLLaMA, and fine-tuned with the Focused Transformer method. LongLLaMA code was built on the foundation of Code Llama. We release a smaller base variant of the LongLLaMA (not instruction-tuned) on a permissive licence (Apache 2.0), and inference code that supports longer contexts for hugging face. Our model weights are a drop-in replacement for LLaMA (for short contexts up to 2048 tokens) in existing implementations. We also provide evaluation results, and comparisons with the original OpenLLaMA model.
  • 19
    Grok Reviews
    Grok is a computer program based on the Hitchhiker’s Guide to the galaxy. It can answer virtually any question and, much harder, it can even suggest the questions to be asked! Grok is a witty and rebellious way to answer questions. Please don't use this if you dislike humor! Grok has a unique and fundamental advantage in that it can access real-time information about the world through the X platform. It can also answer questions that other AI systems would reject.
  • 20
    Lemonfox.ai Reviews

    Lemonfox.ai

    Lemonfox.ai

    $5 per month
    Our models are deployed all over the world for the best possible response time. Integrate our OpenAI compatible API seamlessly into your application. Start in minutes and scale seamlessly to serve millions of users. Our API is 4 times cheaper than OpenAI GPT-3.5 API due to our extensive performance and scale optimizations. Our AI model can generate text and chat at ChatGPT performance levels for a fraction of what it costs. Our OpenAI-compatible API makes it easy to get started. Use one of the most powerful AI image models in order to create stunning images, graphics and illustrations.
  • 21
    Inflection-2 Reviews

    Inflection-2

    Inflection

    Free
    We are proud to announce we have completed the training on Inflection-2. It is the best model for its compute class in the entire world and the second most powerful LLM. Inflection's mission is to create an AI that is personal for everyone. Inflection-2 is a new model that is significantly more capable than Inflection-1. It has better factual knowledge, better style control, and dramatically enhanced reasoning. Inflection-2 has been trained on 5,000 NVIDIA GPUs at fp8 mixed accuracy for 1025 FLOPs. This puts Inflection-2 in the same training compute category as Google's flagship PaLM 2 Large Model. Inflection-2 also outperforms the majority of standard AI performance benchmarks including the well-known MMLU, TriviaQA, HellaSwag & GSM8k. Inflection-2, designed with efficiency in mind, will soon power Pi. We were able to reduce costs by switching from A100 to the H100 GPUs and optimizing our inference implementation.
  • 22
    JinaChat Reviews

    JinaChat

    Jina AI

    $9.99 per month
    Experience JinaChat - a LLM service designed for professionals. JinaChat is a multimodal chat service that goes beyond text and includes images. Enjoy our free short interactions below 100 tokens. Our API allows developers to build complex applications by leveraging long conversation histories. JinaChat is the future of LLM, with multimodal conversations that are long-memory and affordable. Modern LLM applications are often based on long prompts or large memory, which can lead to high costs if the same prompts are sent repeatedly to the server. JinaChat API solves this issue by allowing you to carry forward previous conversations, without having to resend the entire prompt. This is a great way to save both time and money when developing complex applications such as AutoGPT.
  • 23
    Ferret Reviews

    Ferret

    Apple

    Free
    A MLLM system that accepts any form of referral and grounds anything in response. Ferret Model- Hybrid Region representation + Spatial-aware visual sampler allows for fine-grained and open vocabulary referring and grounding. GRIT Dataset - A large-scale, hierarchical, robust ground-and refer instruction tuning dataset. Ferret Bench - A multimodal benchmark that requires Referring/Grounding as well as Semantics, Knowledge and Reasoning.
  • 24
    Jan Reviews

    Jan

    Jan

    Free
    AI assistants that can be customized, global hotkeys and in-line AI will help you to double your productivity. Elegant features that seamlessly integrate into your mobile workflows. Conversations, preferences and model usage remain on your computer - secure, exportable and can be deleted any time.
  • 25
    Mixtral 8x7B Reviews

    Mixtral 8x7B

    Mistral AI

    Free
    Mixtral 8x7B has open weights and is a high quality sparse mixture expert model (SMoE). Licensed under Apache 2.0. Mixtral outperforms Llama 70B in most benchmarks, with 6x faster Inference. It is the strongest model with an open-weight license and the best overall model in terms of cost/performance tradeoffs. It matches or exceeds GPT-3.5 in most standard benchmarks.