Summarize

Meet Google's fastest, cheapest AI model yet—Gemini 2.5 Flash‑Lite

By Mudit Dube

Jun 18, 2025

09:02 am

What's the story

Google has expanded its Gemini AI model family with the stable release of Gemini 2.5 Pro and Flash. The new models are now available for developers to build on, marking a major milestone in Google's AI journey. Gemini 2.5 builds on Google DeepMind's hybrid-reasoning architecture, aiming to deliver powerful AI reasoning capabilities with improved efficiency and speed across multiple applications. The tech giant also previewed its upcoming high-efficiency model, dubbed Gemini 2.5 Flash-Lite. This cost-friendly model was earlier in experimental stage.

Technological advancement

Gemini 2.5 Pro takes on OpenAI's GPT models

The Gemini 2.5 Pro mod, now available in stable form, is a major upgrade over its predecessors, making Google more competitive with OpenAI and its popular GPT models. It is positioned as the flagship model, offering enhanced creative, conversational, and coding capabilities. It now includes a "configurable thinking budget" feature that lets developers balance speed and computational resources. This addresses previous issues in long-form coherence and formatting, making the model more reliable for production use.

Cost efficiency

Preview of Gemini 2.5 Flash-Lite model

Google is also introducing Gemini 2.5 Flash-Lite, an experimental AI model. Currently in preview stage, it is the fastest and most cost-effective model in the 2.5 family. It maintains key capabilities—multimodal reasoning, tool access, and reasoning "thinking" mode—while dramatically reducing latency and pricing. It costs about $0.1 per million input tokens and $0.4 per million output tokens. In comparison, 2.5 Pro costs $1.25 per million input tokens and $10 per million output tokens.

Search integration

Custom versions of Flash models now in search

These updates mark Gemini 2.5's widespread availability across Google AI Studio, Vertex AI, and the consumer Gemini app. Custom versions already power parts of Google Search. The release solidifies Gemini's position in the competitive AI landscape alongside OpenAI, Anthropic, and others, as Google expands its "thinking" model arsenal for developers and users.

Meet Google's fastest, cheapest AI model yet—Gemini 2.5 Flash‑Lite

Gemini 2.5 Pro takes on OpenAI's GPT models

Preview of Gemini 2.5 Flash-Lite model

Custom versions of Flash models now in search

Meet Google's fastest, cheapest AI model yet—Gemini 2.5 Flash‑Lite