Google has announced the launch of Gemini 3.1 Flash-Lite model and it comes with thinking levels in AI Studio and Vertex AI.
Google has announced the launch of Gemini 3.1 Flash-Lite and says it is the company's fastest and most cost-efficient Gemini 3 series model.
Priced at $0.25/1M input tokens and $1.50/1M output tokens, 3.1 Flash-Lite delivers enhanced performance at a fraction of the cost of larger models, said Google.
Gemini 3.1 Flash-Lite outperforms 2.5 Flash with a 2.5X faster Time to First Answer Token and 45% increase in output speed, according to the Artificial Analysis benchmark while maintaining similar or better quality, noted Google. This low Latency is needed for high-frequency workflows, making it an ideal model for developers to build responsive, real-time experiences.
Further, 3.1 Flash-Lite achieves an Elo score of 1432 on the Arena.ai Leaderboard and outperforms other models of similar tier across reasoning and multimodal understanding benchmarks, including 86.9% on GPQA Diamond and 76.8% on MMMU Pro-even surpassing larger Gemini models from prior generations like 2.5 Flash.
Then, Gemini 3.1 Flash-Lite comes standard with thinking levels in AI Studio and Vertex AI, giving developers the control and flexibility to select how much the model "thinks" for a task. 3.1 Flash-Lite can further also tackle tasks at scale, like high-volume translation and content moderation, where cost is a priority. And it can also handle more complex workloads where more in-depth reasoning is needed, like generating user interfaces and dashboards, creating simulations or following instructions.
Starting today, 3.1 Flash-Lite is rolling out in preview to developers via the Gemini API in Google AI Studio and for enterprises via Vertex AI.
Meanwhile, Google recently announced Registered App Stores program for Android, where it will now offer a streamlined installation flow for app stores that meet certain quality and safety benchmarks.
Once this change has rolled out, app stores that choose to participate in this optional program will have registered with Google and so users who sideload them will have a more simplified installation flow.
