Introducing Gemma 3n, available in early preview → https://goo.gle/4dAmnqk The model uses a cutting-edge architecture optimized for on-device usage. It brings multimodality, super fast inference, and more. Key features deliver: ✨ Expanded multimodal understanding to process audio alongside text, images, and video ✨ Optimized on-device efficiency for 1.5x faster response on mobile compared to Gemma 3 4B ✨ Privacy-first, offline-ready function ✨ And more Start building with live, interactive apps and sophisticated audio-centric experiences, including real-time speech transcription, translation, and rich voice-driven interactions. We’re working to roll out full multimodal features, and are collaborating with partners to bring Gemma 3n to the open-source community in the coming weeks. Explore the model’s text and image capabilities in AI Edge, and text capabilities in Google AI Studio: ai.studio
Google AI for Developers
Technology, Information and Internet
AI for every developer. So what will you build?
About us
Our goal is to equip developers with the most advanced models to build new applications, helpful tools to write better and faster code, and make it easy to integrate across platforms and devices.
- Website
-
https://goo.gle/ai-devs
External link for Google AI for Developers
- Industry
- Technology, Information and Internet
- Company size
- 10,001+ employees
Updates
-
🎧🎉 AI Release Notes: #GoogleIO Edition → https://goo.gle/4dPOodH Unpack the latest AI news including Gemini 2.5 Pro Deep Think, Veo 3, and more with Josh Woodward, Tulsee Doshi, and host Logan Kilpatrick.
-
🆕 Gemini API updates from #GoogleIO → https://goo.gle/4jkZEQa Start building with the improved 2.5 Flash Preview, advanced text-to-speech (TTS), native audio dialog, and new tools like URL Context and Thought Summaries for better debugging and context.
-
-
See Native Audio in action 🤠🦊 Our "Mumble Jumble" demo in Google AI Studio showcases the Live API's advanced voice capabilities: natural flow, distinct tone, emotion, and multilingual support. Try the demo and then explore the code to integrate these features yourself: https://goo.gle/4jjz5em
-
Watch as Logan Kilpatrick shares what we shipped this year at #GoogleIO.
-
BIG news for Gemma this week. Gemma 3n, MedGemma, and more. Watch speakers Gus Martins and Omar Sanseviero to learn what’s new 🚀 Watch the full session → https://goo.gle/43u1msD
-
The experimental demo of Gemini Diffusion generates content significantly faster than our fastest model so far, while matching its coding performance. Sign up for the waitlist: https://goo.gle/3F5Alnl
-
Learn how to build your own AI agents with Gemini models using popular open-source frameworks such as LangGraph (from LangChain), CrewAI, LlamaIndex and Composio 🤖 → https://goo.gle/3GZfhiP
-
-
SignGemma is a sign language understanding model that’s coming later this year to the Gemma family 🤟🏼 Share your feedback and interest in early testing: https://goo.gle/SignGemma It’s a massively multilingual model that’s best at translating ASL into English text, enabling further development of tech access for Deaf and Hard of Hearing users 🧏