Google: Gemini 3.1 Flash Lite Preview
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across key capabilities. Improvements span audio input/ASR, RAG snippet ranking, translation, data extraction, and code completion. Supports full thinking levels (minimal, low, medium, high) for fine-grained cost/performance trade-offs. Priced at half the cost of Gemini 3 Flash.
Undisclosed
Parameters
1.0M tokens
Context Window
Proprietary
License
Mar 3, 2026
Released
๐ฐ Pricing
Input
$0.25
per 1M tokens
Output
$1.50
per 1M tokens
API Available
This model is accessible via API for integration into your applications.
โญ Related Models
GPT-5
OpenAI
OpenAI's latest flagship. Advanced multimodal capabilities with native tool use, improved reasoning, and massive knowledge.
GPT-4o
OpenAI
Omni model supporting text, vision, and audio natively. Fast and capable with strong multimodal understanding.
Gemini 2.5 Pro
Google's most capable model with 1M+ token context, native multimodal understanding, and deep research capabilities.