Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Average Ratings 0 Ratings

Total
ease
features
design
support

No User Reviews. Be the first to provide a review:

Write a Review

Description

The Google AI Edge Gallery is an innovative, open-source Android application designed to showcase various applications of on-device machine learning and generative AI, allowing users to download and utilize models offline once installed. This app features a range of functionalities, such as AI Chat for engaging in multi-turn conversations, Ask Image for uploading images to inquire about objects or obtain descriptions, Audio Scribe for transcribing or translating audio files, and Prompt Lab for performing single-turn tasks like summarization and code generation. Additionally, it provides performance insights, offering metrics on aspects like latency and decode speed. Users have the flexibility to switch between compatible models, including options like Gemma 3n and models from Hugging Face, as well as the ability to incorporate their own LiteRT models while accessing model cards and source code for increased transparency. By processing all data locally on the device, the app prioritizes user privacy, requiring no internet connection for core functionalities after the initial model load, which ultimately minimizes latency and bolsters data security. Overall, the Google AI Edge Gallery empowers users to explore cutting-edge AI capabilities while maintaining their privacy and control over their data.

Description

Mistral AI has introduced Voxtral Transcribe 2, an advanced suite of speech-to-text models that provides remarkably fast, high-quality audio transcription and speaker identification, supporting a diverse range of languages. This collection features Voxtral Mini Transcribe V2, which is tailored for batch transcription and includes functionalities like word-level timestamps, context biasing, and compatibility with 13 different languages, alongside Voxtral Realtime, which is optimized for live speech recognition with adjustable latency that can drop below 200 ms for immediate use cases. Both models excel in transcription accuracy while maintaining efficiency and cost-effectiveness; Mini Transcribe V2 is noted for its exceptional performance and minimal error rates, while Realtime is made available as open-source under the Apache 2.0 license, enabling developers to implement it on edge devices or within secure environments. Furthermore, the innovative technology embedded in these models represents a significant leap forward in transcription solutions, catering to various applications across industries.

API Access

Has API

API Access

Has API

Screenshots View All

Screenshots View All

Integrations

Gemma 3n
Hugging Face
LiteRT

Integrations

Gemma 3n
Hugging Face
LiteRT

Pricing Details

Free
Free Trial
Free Version

Pricing Details

$14.99 per month
Free Trial
Free Version

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Deployment

Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Customer Support

Business Hours
Live Rep (24/7)
Online Support

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Types of Training

Training Docs
Webinars
Live Training (Online)
In Person

Vendor Details

Company Name

Google

Country

United States

Website

github.com/google-ai-edge/gallery/

Vendor Details

Company Name

Mistral AI

Founded

2023

Country

France

Website

mistral.ai/news/voxtral-transcribe-2

Product Features

Product Features

Alternatives

Alternatives

Gemma 3n Reviews

Gemma 3n

Google DeepMind
Scribe Reviews

Scribe

ElevenLabs
LiteRT Reviews

LiteRT

Google
LFM2 Reviews

LFM2

Liquid AI