Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
GLM-5 is a next-generation open-source foundation model from Z.ai designed to push the boundaries of agentic engineering and complex task execution. Compared to earlier versions, it significantly expands parameter count and training data, while introducing DeepSeek Sparse Attention to optimize inference efficiency. The model leverages a novel asynchronous reinforcement learning framework called slime, which enhances training throughput and enables more effective post-training alignment. GLM-5 delivers leading performance among open-source models in reasoning, coding, and general agent benchmarks, with strong results on SWE-bench, BrowseComp, and Vending Bench 2. Its ability to manage long-horizon simulations highlights advanced planning, resource allocation, and operational decision-making skills. Beyond benchmark performance, GLM-5 supports real-world productivity by generating fully formatted documents such as .docx, .pdf, and .xlsx files. It integrates with coding agents like Claude Code and OpenClaw, enabling cross-application automation and collaborative agent workflows. Developers can access GLM-5 via Z.ai’s API, deploy it locally with frameworks like vLLM or SGLang, or use it through an interactive GUI environment. The model is released under the MIT License, encouraging broad experimentation and adoption. Overall, GLM-5 represents a major step toward practical, work-oriented AI systems that move beyond chat into full task execution.
Description
The Hy3 preview represents Tencent Hy's most advanced model in the Hy series to date, featuring a substantial 295 billion parameters in a Mixture-of-Experts structure, with 21 billion parameters activated and an impressive 3.8 billion parameters dedicated to the MTP layer, all while accommodating a context window of up to 256,000 tokens. This groundbreaking model is the first to harness Tencent Hy's newly revamped infrastructure, aimed at enhancing practical applications in areas such as complex reasoning, following instructions, learning from context, coding tasks, and overall inference capabilities. By seamlessly integrating both rapid and thorough cognitive processing, it provides straightforward answers for simpler inquiries while facilitating in-depth analysis for intricate math, programming, and reasoning challenges. The model is crafted to exhibit comprehensive skills in understanding long contexts, adhering to instructions, employing tools, and executing agent workflows, with assessments conducted not only against conventional benchmarks but also within real-world business and development contexts. Furthermore, its design ensures adaptability to a wide range of scenarios, thereby broadening its usability in diverse applications.
API Access
Has API
API Access
Has API
Integrations
APIFree
Cherry Studio
Claude Code
Claw Code
Cline
Dessix
GLM Coding Plan
GLM-5-Turbo
Kilo Code
Ollama
Integrations
APIFree
Cherry Studio
Claude Code
Claw Code
Cline
Dessix
GLM Coding Plan
GLM-5-Turbo
Kilo Code
Ollama
Pricing Details
Free
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Zhipu AI
Founded
2023
Country
China
Website
z.ai/
Vendor Details
Company Name
Tencent
Founded
1998
Country
China
Website
hy.tencent.com/research/hy3-preview