Gemma 4 12B: Google’s Encoder-Free Multimodal Model for Text and Vision10. June 2026AI Models, GoogleGemma 4 12B integrates text and vision capabilities in a single, encoder-free architecture, reducing deployment complexity while improving resource efficiency. Share on: