Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Pre-trained language models have made significant strides, achieving top-tier performance across multiple Natural Language Processing (NLP) applications. The impressive capabilities of GPT-3 highlight how increasing the scale of these models can unlock their vast potential. Recently, a comprehensive framework known as ERNIE 3.0 was introduced to pre-train large-scale models enriched with knowledge, culminating in a model boasting 10 billion parameters. This iteration of ERNIE 3.0 has surpassed the performance of existing leading models in a variety of NLP tasks. To further assess the effects of scaling, we have developed an even larger model called ERNIE 3.0 Titan, which consists of up to 260 billion parameters and is built on the PaddlePaddle platform. Additionally, we have implemented a self-supervised adversarial loss alongside a controllable language modeling loss, enabling ERNIE 3.0 Titan to produce texts that are both reliable and modifiable, thus pushing the boundaries of what these models can achieve. This approach not only enhances the model's capabilities but also opens new avenues for research in text generation and control.
Description
This repository showcases an implementation of model parallel autoregressive transformers utilizing GPUs, leveraging the capabilities of the DeepSpeed library. It serves as a record of EleutherAI's framework designed for training extensive language models on GPU architecture. Currently, it builds upon NVIDIA's Megatron Language Model, enhanced with advanced techniques from DeepSpeed alongside innovative optimizations. Our goal is to create a centralized hub for aggregating methodologies related to the training of large-scale autoregressive language models, thereby fostering accelerated research and development in the field of large-scale training. We believe that by providing these resources, we can significantly contribute to the progress of language model research.
API Access
Has API
API Access
Has API
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Baidu
Founded
2000
Country
China
Website
research.baidu.com/Public/uploads/61c4362c79ee8.pdf
Vendor Details
Company Name
EleutherAI
Founded
2020
Website
github.com/EleutherAI/gpt-neox