Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

DiffusionGemma is an innovative open model that investigates text diffusion, representing a remarkably rapid method for generating text. Released under the Apache 2.0 license, this 26 billion parameter Mixture of Experts (MoE) model advances beyond the usual sequential token generation typical of autoregressive models. Instead, it produces entire blocks of text at once, achieving text generation speeds that are up to four times faster on GPUs. Drawing from the parameter efficiency of the Gemma 4 family and Gemini Diffusion research, DiffusionGemma incorporates a unique diffusion head that enhances generation speed significantly. It is particularly aimed at researchers and developers looking to optimize speed-sensitive, interactive local workflows, including in-line editing, swift iterations, and non-linear narrative forms. By reallocating the decode bottleneck from memory bandwidth to computational power, it can produce over 1,000 tokens per second on a single NVIDIA H100 and more than 700 tokens per second on an NVIDIA GeForce RTX 5090. This breakthrough allows for a new level of efficiency in text generation that could reshape various applications in natural language processing.

Description

Mercury Edit 2 is a cutting-edge AI model from Inception Labs, part of the Mercury suite, specifically crafted for rapid reasoning, coding, and editing by employing a novel architecture distinctly different from typical large language models. It enhances the capabilities of Mercury 2, a diffusion-based model that generates and refines complete outputs simultaneously, rather than the conventional method of creating text one token at a time, which results in markedly improved speeds and more agile editing processes. Rather than functioning as a linear “typewriter,” this system operates as a dynamic editor, beginning with a rough draft and methodically enhancing it across multiple tokens simultaneously, facilitating real-time engagement and swift iterations in various tasks such as code editing, content creation, and agent-based workflows. This innovative framework achieves an impressive throughput of up to approximately 1,000 tokens per second, significantly outpacing traditional models while still upholding competitive reasoning abilities across various benchmarks. Its unique design not only transforms the way users interact with AI but also sets a new standard for performance in the field of artificial intelligence.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Cline
Cursor
ElevenLabs
Gemini Enterprise Agent Platform
Gemma
Inception Labs
Kilo Code
LangChain
NVIDIA NIM
OpenClaw
OpenCode
Roo Code
Vapi AI
Zed

Integrations

Cline
Cursor
ElevenLabs
Gemini Enterprise Agent Platform
Gemma
Inception Labs
Kilo Code
LangChain
NVIDIA NIM
OpenClaw
OpenCode
Roo Code
Vapi AI
Zed

Pricing Details

Free
Free Trial
Free Version

Pricing Details

$0.25 per 1M input tokens
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Google

Founded

1998

Country

United States

Website

blog.google/innovation-and-ai/technology/developers-tools/diffusion-gemma-faster-text-generation/

Vendor Details

Company Name

Inception

Country

United States

Website

www.inceptionlabs.ai/blog/introducing-mercury-edit-2

Product Features

Product Features

Alternatives

Mercury Coder Reviews

Mercury Coder

Inception Labs

Alternatives

GPT-5.4 Reviews

GPT-5.4

OpenAI
Gemini Diffusion Reviews

Gemini Diffusion

Google DeepMind
Mercury Coder Reviews

Mercury Coder

Inception Labs
GPT-5.4 Pro Reviews

GPT-5.4 Pro

OpenAI
ByteDance Seed Reviews

ByteDance Seed

ByteDance
MiniMax M2.5 Reviews

MiniMax M2.5

MiniMax