OPT Description

Large language models, often requiring extensive computational resources for training over long periods, have demonstrated impressive proficiency in zero- and few-shot learning tasks. Due to the high investment needed for their development, replicating these models poses a significant challenge for many researchers. Furthermore, access to the few models available via API is limited, as users cannot obtain the complete model weights, complicating academic exploration. In response to this, we introduce Open Pre-trained Transformers (OPT), a collection of decoder-only pre-trained transformers ranging from 125 million to 175 billion parameters, which we intend to share comprehensively and responsibly with interested scholars. Our findings indicate that OPT-175B exhibits performance on par with GPT-3, yet it is developed with only one-seventh of the carbon emissions required for GPT-3's training. Additionally, we will provide a detailed logbook that outlines the infrastructure hurdles we encountered throughout the project, as well as code to facilitate experimentation with all released models, ensuring that researchers have the tools they need to explore this technology further.

Integrations

No Integrations at this time

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
Meta
Year Founded:
2004
Headquarters:
United States
Website:
www.meta.com

Media

Get Started
Recommended Products
Error to trace to log to deploy. One click. No SSH. Icon
Error to trace to log to deploy. One click. No SSH.

Catch the cause before the pager goes off.

AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.
Free 30 days.

Product Details

Platforms
Web-Based
On-Premises
Types of Training
Training Docs

OPT Features and Options

OPT User Reviews

Write a Review
  • Previous
  • Next