PanGu-Σ Description

The expansion of large language model has led to significant advancements in natural language processing, understanding and generation. This study introduces a new system that uses Ascend 910 AI processing units and the MindSpore framework in order to train a language with over one trillion parameters, 1.085T specifically, called PanGu-Sigma. This model, which builds on the foundation laid down by PanGu-alpha transforms the traditional dense Transformer model into a sparse model using a concept called Random Routed Experts. The model was trained efficiently on a dataset consisting of 329 billion tokens, using a technique known as Expert Computation and Storage Separation. This led to a 6.3 fold increase in training performance via heterogeneous computer. The experiments show that PanGu-Sigma is a new standard for zero-shot learning in various downstream Chinese NLP tasks.

Integrations

Reviews

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Company Details

Company:
Huawei
Year Founded:
1987
Headquarters:
China
Website:
huawei.com

Media

Get Started
Recommended Products
Manage Properties Better For Free Icon
Manage Properties Better For Free

For small to mid-sized landlords and property managers

Innago is a free and easy-to-use property management solution. Whether you have 1 unit or 1000, student housing, or commercial properties, Innago is built for you. Our software is designed to save you time and money, so you can spend more time doing the things that matter most.
Learn More

Product Details

Platforms
SaaS
On-Premises
Type of Training
Documentation

PanGu-Σ Features and Options

PanGu-Σ User Reviews

Write a Review
  • Previous
  • Next