OpenAI: GPT-4o (2024-11-20)
GPT-4o (2024-11-20) is a multimodal model from OpenAI that accepts text, images, and files as input, with a 128,000-token context window and a 16,384-token output ceiling. It supports tool use, making it suitable for agentic workflows, but it does not include a built-in reasoning mode. Structured output support is unconfirmed in available data. At $2.50 per million input tokens and $10.00 per million output tokens, it sits in the mid-tier pricing range for capable frontier models. Its blended benchmark score of 17.3 is drawn from only one benchmark (aider_polyglot at 18.2), so cross-task performance comparisons are limited and should be treated cautiously. Teams that need reliable multimodal input handling and tool integration, and who already work within the OpenAI ecosystem, are the most natural fit, but buyers wanting broad, independently verified benchmark coverage should weigh that gap before committing.
- Model ID
- openai/gpt-4o-2024-11-20
- Vendor
- openai
- Tokenizer
- GPT
- Input Modalities
- text, image, file
- Output Modalities
- text
- Max Output
- 16,384 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- yes