Gemini 2.5 Flash

Google Vertex AI

Fast, cost-efficient model on Vertex AI with strong reasoning, multimodal input, and sub-second response times

Pricing

per 1M tokens

Input / 1M$0.30
Output / 1M$2.50
Cache Read / 1M$0.03
Cache Read$0.03

Specifications

API Model IDgemini-2.5-flash
Context Window1M tokens
Max Output65K tokens

Modalities

textimageaudiovideo

Capabilities

tool-usestreamingjson-modevisioncodeextended-thinking

Other Google Vertex AI Text / Chat Models

ModelInput / 1MOutput / 1MCache Read / 1MCache Write / 1M
Gemini 2.5 Pro$1.25$10.00$0.13
Gemini 2.5 Flash-Lite$0.10$0.40$0.01
Claude Opus 4.6$5.00$25.00
Claude Sonnet 4.6$3.00$15.00
Claude Haiku 4.5$1.00$5.00