Yesterday at 5:39 AM

Google launches new open Gemma 4 12B multimodal model for laptops with 16 GB of RAM

Google DeepMind has introduced Gemma 4 12B, a new 12 billion parameter open AI model designed to run multimodal tasks directly on standard laptops. It processes text, images, and audio together without separate encoders, reducing processing time, memory use, and latency. The model can run locally on devices with 16 GB of system RAM or VRAM, making it practical for many consumer and enterprise laptops.

According to Google, Gemma 4 12B has about half the memory footprint of Gemma 4 26B while matching much of its benchmark performance. It is also the first mid-sized Gemma model with native audio processing, supporting speech recognition, code generation, image understanding, and video analysis. In one test, it analyzed a five-minute keynote by processing 313 frames alongside the audio.

Gemma 4 12B also includes Multi-Token Prediction drafters by default, improving generation speed and efficiency. Google says the model supports complex multistep reasoning and agentic workflows that previously required larger Gemma models. It is available through Hugging Face, Kaggle, Ollama, Google AI Edge Gallery and LM Studio under the Apache 2.0 license, allowing commercial use.

Yesterday by Mauricio B. Holguin

justarandom found this interesting

MORE ABOUT: #AI Chatbots #Large Language Model (LLM) Tools #AI Writing Tools #Google Gemma

Google Gemma

AI Chatbot
Free Personal
Open Source

Google Gemma is an AI chatbot leveraging advanced research and technology from Google's Gemini models. Part of a family of lightweight, state-of-the-art open models, it offers AI-powered interactions. Rated 5, it stands out for its sophisticated capabilities in natural language processing and user engagement, with top alternatives yet to be specified.

Related news

All news about Google Gemma »

External links

Introducing Gemma 4 12B
Google The Keyword • Official source
Google's new open source Gemma 4 12B analyzes audio, video — and runs entirely locally on a typical 16GB enterprise laptop
VentureBeat
Google Deepmind's Gemma 4 12B squeezes multimodal AI onto a laptop with just 16 GB of RAM
The Decoder
Google's new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM
Ars Technica

Comments

xdillfrescott

CommentJun 4, 2026

This is actually a pretty cool one as it can take in audio too which is a bit rare