Gemini 2.5 Flash-Lite

Google Vertex AI

Google's most cost-efficient multimodal model on Vertex AI optimized for low-latency, high-volume tasks

Pricing

per 1M tokens

Input / 1M$0.10
Output / 1M$0.40
Cache Read / 1M$0.01
Cache Read$0.01

Specifications

API Model IDgemini-2.5-flash-lite
Context Window1M tokens
Max Output65K tokens

Modalities

textimageaudiovideo

Capabilities

tool-usestreamingjson-modevisioncode

Other Google Vertex AI Text / Chat Models

ModelInput / 1MOutput / 1MCache Read / 1MCache Write / 1M
Gemini 2.5 Pro$1.25$10.00$0.13
Gemini 2.5 Flash$0.30$2.50$0.03
Claude Opus 4.6$5.00$25.00
Claude Sonnet 4.6$3.00$15.00
Claude Haiku 4.5$1.00$5.00