How much does Meta: Llama 3.3 70B Instruct cost?

$0.13 per million input tokens and $0.40 per million output tokens.

What is Meta: Llama 3.3 70B Instruct's context window?

131,072 tokens, roughly 196 pages of text in a single request.

Does Meta: Llama 3.3 70B Instruct support tool calling?

It supports tool calling, structured output.

Can Meta: Llama 3.3 70B Instruct process images?

No, it is text-only on the input side.

meta-llama

Meta: Llama 3.3 70B Instruct

Meta's Llama 3.3 70B Instruct is designed for text-based tasks with a context length of 131,072 tokens and supports tool integration but lacks reasoning capabilities or structured output support. It processes inputs and generates responses exclusively in text format. This model should be considered by users requiring extensive input contexts or those who frequently integrate tools into their workflows, given its $0.13/M input and $0.4/M output token pricing. Its blended benchmark score of 10.4 across three independent benchmarks indicates solid performance but may not yet match the leading models in specialized areas like coding or agentic tasks.

Query via API → View on meta-llama → Estimate cost

Quality Score

86/100

price + capability + benchmarks

Input Price

$0.13

per 1M tokens

Output Price

$0.40

per 1M tokens

Context Window

131,072

tokens

Benchmark results

Independent, published benchmarks. Blended score 10.4 across 3 benchmarks, last refreshed 2026-07-29. How scoring works →

Benchmark	Measures	Score
AI Index	broad capability composite	15.5
AI Index Coding	software engineering tasks	19.7
AI Index Agentic	multi-step tool-using tasks	0.6

Model ID: meta-llama/llama-3.3-70b-instruct
Vendor: meta-llama
Released: December 2024
Tokenizer: Llama3
Input Modalities: text
Output Modalities: text
Max Output: 128,000 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: text only
Audio: no
Moderated: no

What it costs in practice

Computed from the current $0.13/M input and $0.40/M output rates. Run your own numbers →

Job	Tokens	Cost
Summarize a 50-page report	30k in / 1.5k out	under $0.01
Classify 1,000 customer emails	500k in / 50k out	$0.09
A month of a busy support chatbot	5M in / 2M out	$1.45

Similar models

meta-llama

Quick answers

How much does Meta: Llama 3.3 70B Instruct cost?: $0.13 per million input tokens and $0.40 per million output tokens.
What is Meta: Llama 3.3 70B Instruct's context window?: 131,072 tokens, roughly 196 pages of text in a single request.
Does Meta: Llama 3.3 70B Instruct support tool calling?: It supports tool calling, structured output.
Can Meta: Llama 3.3 70B Instruct process images?: No, it is text-only on the input side.

Meta: Llama 3.3 70B Instruct

Benchmark results

What it costs in practice

Similar models

Meta: Llama 3.1 70B Instruct

Meta: Llama 3.1 8B Instruct

Meta: Llama Guard 4 12B

Meta: Llama 3.2 3B Instruct

Meta: Llama 4 Maverick

Meta: Llama 4 Scout

Quick answers