openai

OpenAI: GPT-4o (2024-11-20)

GPT-4o (2024-11-20) is a multimodal model from OpenAI that accepts text, images, and files as input, with a 128,000-token context window and a 16,384-token output ceiling. It supports tool use, making it suitable for agentic workflows, but it does not include a built-in reasoning mode. Structured output support is unconfirmed in available data. At $2.50 per million input tokens and $10.00 per million output tokens, it sits in the mid-tier pricing range for capable frontier models. Its blended benchmark score of 17.3 is drawn from only one benchmark (aider_polyglot at 18.2), so cross-task performance comparisons are limited and should be treated cautiously. Teams that need reliable multimodal input handling and tool integration, and who already work within the OpenAI ecosystem, are the most natural fit, but buyers wanting broad, independently verified benchmark coverage should weigh that gap before committing.

Quality Score
89/100
price + capability + benchmarks
Input Price
$2.50
per 1M tokens
Output Price
$10.00
per 1M tokens
Context Window
128,000
tokens
Model ID
openai/gpt-4o-2024-11-20
Vendor
openai
Tokenizer
GPT
Input Modalities
text, image, file
Output Modalities
text
Max Output
16,384 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
✓ accepts images
Audio
no
Moderated
yes

Similar models