Nous: Hermes 3 405B Instruct
Nous: Hermes 3 405B Instruct is a text-only model from Nous Research with a 131,072-token context window and a maximum completion length of 16,384 tokens. It accepts text input only and does not support tool use, native reasoning modes, or structured output, so workflows that depend on function calling or guaranteed JSON schemas will need to look elsewhere. At $1.00 per million tokens for both input and output, the pricing is straightforward and symmetrical, which makes cost estimation easy. The harder issue is that there is currently no independent benchmark coverage, so performance relative to similarly priced models is unverified. Builders who already have experience with the Hermes series and want a large-context text generation model at a flat, predictable rate may find it worth evaluating, but teams that need benchmark evidence before committing should treat this model as unproven until third-party scores become available.
- Model ID
- nousresearch/hermes-3-llama-3.1-405b
- Vendor
- nousresearch
- Tokenizer
- Llama3
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 16,384 tokens
- Tool Calling
- not supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- no
- Moderated
- no