Question 1

How much does Gemini 2.5 Flash-Lite cost per 1M tokens?

Accepted Answer

Gemini 2.5 Flash-Lite costs $0.1 per 1M input tokens and $0.4 per 1M output tokens. Cached input tokens are available at $0.01 per 1M, a 90% discount.

Question 2

Is Gemini 2.5 Flash-Lite cheap or expensive?

Accepted Answer

Gemini 2.5 Flash-Lite is one of the more affordable LLM APIs at $0.1/1M input tokens. It competes with other budget models for high-volume workloads.

Question 3

What is the context window of Gemini 2.5 Flash-Lite?

Accepted Answer

Gemini 2.5 Flash-Lite supports a context window of 1M tokens. This determines how much text you can send in a single API call — including system prompts, conversation history, and the actual query.

Question 4

Does Gemini 2.5 Flash-Lite support prompt caching?

Accepted Answer

Yes. Google offers cached input at $0.01/1M tokens — a 90% discount over the base input price. This helps with repeated system prompts and few-shot examples.

Question 5

How to reduce Gemini 2.5 Flash-Lite API costs?

Accepted Answer

Three strategies: (1) Enable prompt caching if your provider supports it — savings of up to 90% on repeated input. (2) Route simple queries to cheaper models. (3) Reduce output tokens with concise instructions.

Question 6

How much does one Gemini 2.5 Flash-Lite API call cost?

Accepted Answer

A typical request with 500 input tokens and 300 output tokens costs approximately $0.00017. The exact cost depends on your prompt length and desired response length. Use the cost calculator above to estimate for your specific usage pattern.

Gemini 2.5 Flash-Lite API Pricing

FAQ