Google Gemini Flash Latest
Google Gemini Flash Latest is a multimodal model from Google that accepts text, image, video, audio, and file inputs, making it one of the broader input options available. Its context window reaches 1,048,576 tokens, which accommodates very long documents or extended conversations. The model supports tool use and reasoning, though structured output support is unconfirmed from available data. At $1.50 per million input tokens and $9.00 per million output tokens, the pricing sits in a mid-range tier; the input cost is competitive, but the output cost warrants attention for high-volume generation workloads. There is currently no independent benchmark coverage to reference, so quality comparisons against rivals rest entirely on your own testing. Teams with multimodal pipelines, large-context requirements, or tool-calling workflows should shortlist it, but buyers who rely on third-party benchmark data before committing will need to wait for that coverage to emerge.
- Model ID
- ~google/gemini-flash-latest
- Vendor
- Tokenizer
- Router
- Input Modalities
- text, image, video, file, audio
- Output Modalities
- text
- Max Output
- 65,536 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- ✓ accepts audio
- Moderated
- no