Question 1

How much does Gemini 2.5 Flash cost per 1M tokens?

Accepted Answer

Gemini 2.5 Flash costs $0.3 per 1M input tokens and $2.5 per 1M output tokens. Cached input tokens are available at $0.03 per 1M, a 90% discount.

Question 2

Is Gemini 2.5 Flash cheap or expensive?

Accepted Answer

Gemini 2.5 Flash is one of the more affordable LLM APIs at $0.3/1M input tokens. It competes with other budget models for high-volume workloads.

Question 3

What is the context window of Gemini 2.5 Flash?

Accepted Answer

Gemini 2.5 Flash supports a context window of 1M tokens. This determines how much text you can send in a single API call — including system prompts, conversation history, and the actual query.

Question 4

Does Gemini 2.5 Flash support prompt caching?

Accepted Answer

Yes. Google offers cached input at $0.03/1M tokens — a 90% discount over the base input price. This helps with repeated system prompts and few-shot examples.

Question 5

How to reduce Gemini 2.5 Flash API costs?

Accepted Answer

Three strategies: (1) Enable prompt caching if your provider supports it — savings of up to 90% on repeated input. (2) Route simple queries to cheaper models. (3) Reduce output tokens with concise instructions.

Question 6

How much does one Gemini 2.5 Flash API call cost?

Accepted Answer

A typical request with 500 input tokens and 300 output tokens costs approximately $0.0009. The exact cost depends on your prompt length and desired response length. Use the cost calculator above to estimate for your specific usage pattern.

Gemini 2.5 Flash API Pricing

FAQ