OpenAI: GPT-5.4 Image 2
GPT-5.4 Image 2 is a paid model from OpenAI that accepts text, images, and files as input. It offers a 272,000-token context window and can produce up to 128,000 tokens per completion, making it suited for long-document work. The model supports reasoning but does not support tool calling, and structured output availability is unconfirmed. At $8.00 per million input tokens and $15.00 per million output tokens, it sits at a premium price point relative to many alternatives. There is currently no independent benchmark coverage to validate its performance, so buyers cannot compare it against other models on objective measures. Teams that specifically need image and file understanding alongside a large context window may want to shortlist it, but given the unproven standing on benchmarks, cost-sensitive users or those needing tool use should weigh other options carefully before committing.
- Model ID
- openai/gpt-5.4-image-2
- Vendor
- openai
- Tokenizer
- GPT
- Input Modalities
- image, text, file
- Output Modalities
- image, text
- Max Output
- 128,000 tokens
- Tool Calling
- not supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- yes