~google

Google Gemini Flash Latest

Google Gemini Flash Latest is a multimodal model from Google that accepts text, image, video, audio, and file inputs, making it one of the broader input options available. Its context window reaches 1,048,576 tokens, which accommodates very long documents or extended conversations. The model supports tool use and reasoning, though structured output support is unconfirmed from available data. At $1.50 per million input tokens and $9.00 per million output tokens, the pricing sits in a mid-range tier; the input cost is competitive, but the output cost warrants attention for high-volume generation workloads. There is currently no independent benchmark coverage to reference, so quality comparisons against rivals rest entirely on your own testing. Teams with multimodal pipelines, large-context requirements, or tool-calling workflows should shortlist it, but buyers who rely on third-party benchmark data before committing will need to wait for that coverage to emerge.

Quality Score
100/100
price + capability + benchmarks
Input Price
$1.50
per 1M tokens
Output Price
$9.00
per 1M tokens
Context Window
1,048,576
tokens
Model ID
~google/gemini-flash-latest
Vendor
~google
Tokenizer
Router
Input Modalities
text, image, video, file, audio
Output Modalities
text
Max Output
65,536 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
✓ accepts images
Audio
✓ accepts audio
Moderated
no

Strong choice for

Similar models