Average Ratings 0 Ratings
Average Ratings 0 Ratings
Description
Introducing Gemma 3n, our cutting-edge open multimodal model designed specifically for optimal on-device performance and efficiency. With a focus on responsive and low-footprint local inference, Gemma 3n paves the way for a new generation of intelligent applications that can be utilized on the move. It has the capability to analyze and respond to a blend of images and text, with plans to incorporate video and audio functionalities in the near future. Developers can create smart, interactive features that prioritize user privacy and function seamlessly without an internet connection. The model boasts a mobile-first architecture, significantly minimizing memory usage. Co-developed by Google's mobile hardware teams alongside industry experts, it maintains a 4B active memory footprint while also offering the flexibility to create submodels for optimizing quality and latency. Notably, Gemma 3n represents our inaugural open model built on this revolutionary shared architecture, enabling developers to start experimenting with this advanced technology today in its early preview. As technology evolves, we anticipate even more innovative applications to emerge from this robust framework.
Description
WebLLM serves as a robust inference engine for language models that operates directly in web browsers, utilizing WebGPU technology to provide hardware acceleration for efficient LLM tasks without needing server support. This platform is fully compatible with the OpenAI API, which allows for smooth incorporation of features such as JSON mode, function-calling capabilities, and streaming functionalities. With native support for a variety of models, including Llama, Phi, Gemma, RedPajama, Mistral, and Qwen, WebLLM proves to be adaptable for a wide range of artificial intelligence applications. Users can easily upload and implement custom models in MLC format, tailoring WebLLM to fit particular requirements and use cases. The integration process is made simple through package managers like NPM and Yarn or via CDN, and it is enhanced by a wealth of examples and a modular architecture that allows for seamless connections with user interface elements. Additionally, the platform's ability to support streaming chat completions facilitates immediate output generation, making it ideal for dynamic applications such as chatbots and virtual assistants, further enriching user interaction. This versatility opens up new possibilities for developers looking to enhance their web applications with advanced AI capabilities.
API Access
Has API
API Access
Has API
Integrations
Gemma
Dolly
Gemini
Gemini Enterprise
Gemini Nano
Google AI Edge Gallery
Google AI Studio
Hugging Face
JAX
Llama 3
Integrations
Gemma
Dolly
Gemini
Gemini Enterprise
Gemini Nano
Google AI Edge Gallery
Google AI Studio
Hugging Face
JAX
Llama 3
Pricing Details
No price information available.
Free Trial
Free Version
Pricing Details
Free
Free Trial
Free Version
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Deployment
Web-Based
On-Premises
iPhone App
iPad App
Android App
Windows
Mac
Linux
Chromebook
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Customer Support
Business Hours
Live Rep (24/7)
Online Support
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Types of Training
Training Docs
Webinars
Live Training (Online)
In Person
Vendor Details
Company Name
Google DeepMind
Founded
2010
Country
United Kingdom
Website
deepmind.google/models/gemma/gemma-3n/
Vendor Details
Company Name
WebLLM
Website
webllm.mlc.ai/