Google's new Gemini model is its fastest yet

Technology Mar 04, 2026

Google just launched Gemini 3.1 Flash-Lite—the fastest and most cost-efficient Gemini 3 series model.
Available now for preview, it can handle text, images, audio, video, and documents with a massive 1 million input token capacity.

It's cheaper and faster than the previous Pro version

Flash-Lite is priced at $0.25 per million input tokens and $1.50 per million output tokens—cheaper than the previous Pro version.
It's seriously quick too: outputs arrive at 363 tokens per second, and it has a 2.5X faster Time to First Answer Token compared with Gemini 2.5 Flash.

You can adjust how 'deep' it thinks for certain tasks

You can adjust how "deep" it thinks for tasks like translation or data extraction.
Early testers (like Latitude and Whering) say its accuracy on tough problems rivals much bigger models.

It beats Google's own older models in speed and quality

Gemini 3.1 Flash-Lite is designed for high efficiency and performance—and beats Google's own older models in speed and quality across different types of content.