Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
MindSpore, an open-source deep learning framework created by Huawei, is engineered to simplify the development process, ensure efficient execution, and enable deployment across various environments such as cloud, edge, and device. The framework accommodates different programming styles, including object-oriented and functional programming, which empowers users to construct AI networks using standard Python syntax. MindSpore delivers a cohesive programming experience by integrating both dynamic and static graphs, thereby improving compatibility and overall performance. It is finely tuned for a range of hardware platforms, including CPUs, GPUs, and NPUs, and exhibits exceptional compatibility with Huawei's Ascend AI processors. The architecture of MindSpore is organized into four distinct layers: the model layer, MindExpression (ME) dedicated to AI model development, MindCompiler for optimization tasks, and the runtime layer that facilitates collaboration between devices, edge, and cloud environments. Furthermore, MindSpore is bolstered by a diverse ecosystem of specialized toolkits and extension packages, including offerings like MindSpore NLP, making it a versatile choice for developers looking to leverage its capabilities in various AI applications. Its comprehensive features and robust architecture make MindSpore a compelling option for those engaged in cutting-edge machine learning projects.
Description
Recent breakthroughs in natural language processing, comprehension, and generation have been greatly influenced by the development of large language models. This research presents a system that employs Ascend 910 AI processors and the MindSpore framework to train a language model exceeding one trillion parameters, specifically 1.085 trillion, referred to as PanGu-{\Sigma}. This model enhances the groundwork established by PanGu-{\alpha} by converting the conventional dense Transformer model into a sparse format through a method known as Random Routed Experts (RRE). Utilizing a substantial dataset of 329 billion tokens, the model was effectively trained using a strategy called Expert Computation and Storage Separation (ECSS), which resulted in a remarkable 6.3-fold improvement in training throughput through the use of heterogeneous computing. Through various experiments, it was found that PanGu-{\Sigma} achieves a new benchmark in zero-shot learning across multiple downstream tasks in Chinese NLP, showcasing its potential in advancing the field. This advancement signifies a major leap forward in the capabilities of language models, illustrating the impact of innovative training techniques and architectural modifications.
API Access
Has API
API Access
Has API
Screenshots View All
No images available
Integrations
Ascend Cloud Service
Docker
Huawei Cloud
Huawei Cloud ModelArts
PanGu Chat
Python
Integrations
Ascend Cloud Service
Docker
Huawei Cloud
Huawei Cloud ModelArts
PanGu Chat
Python
Pricing Details
Free
Free Trial
Free Version
Pricing Details
No price information available.
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
MindSpore
Founded
2019
Country
China
Website
www.mindspore.cn/
Vendor Details
Company Name
Huawei
Founded
1987
Country
China
Website
huawei.com