Video · best for
Best AI model for Video Auto-Tagging (2026)
Bulk video metadata generation. Ranked from 350 live models on the OpenRouter catalog, weighted for video input, low latency.
| # | Model | Score | In / 1M | Out / 1M | Context | |
|---|---|---|---|---|---|---|
| 1 | Xiaomi: MiMo-V2.5xiaomi/mimo-v2.5 | 123 | $0.40 | $2.00 | 1,048,576 | Try → |
| 2 | Google: Gemma 4 26B A4B (free)google/gemma-4-26b-a4b-it:free | 123 | Free | Free | 262,144 | Try → |
| 3 | Google: Gemma 4 26B A4B google/gemma-4-26b-a4b-it | 123 | $0.06 | $0.33 | 262,144 | Try → |
| 4 | Google: Gemma 4 31B (free)google/gemma-4-31b-it:free | 123 | Free | Free | 262,144 | Try → |
| 5 | Google: Gemma 4 31Bgoogle/gemma-4-31b-it | 123 | $0.13 | $0.38 | 262,144 | Try → |
| 6 | Qwen: Qwen3.6 Plusqwen/qwen3.6-plus | 123 | $0.33 | $1.95 | 1,000,000 | Try → |
| 7 | Xiaomi: MiMo-V2-Omnixiaomi/mimo-v2-omni | 123 | $0.40 | $2.00 | 262,144 | Try → |
| 8 | ByteDance Seed: Seed-2.0-Litebytedance-seed/seed-2.0-lite | 123 | $0.25 | $2.00 | 262,144 | Try → |
| 9 | Qwen: Qwen3.5-9Bqwen/qwen3.5-9b | 123 | $0.10 | $0.15 | 262,144 | Try → |
| 10 | Google: Gemini 3.1 Flash Lite Previewgoogle/gemini-3.1-flash-lite-preview | 123 | $0.25 | $1.50 | 1,048,576 | Try → |
| 11 | ByteDance Seed: Seed-2.0-Minibytedance-seed/seed-2.0-mini | 123 | $0.10 | $0.40 | 262,144 | Try → |
| 12 | Qwen: Qwen3.5-35B-A3Bqwen/qwen3.5-35b-a3b | 123 | $0.16 | $1.30 | 262,144 | Try → |
| 13 | Qwen: Qwen3.5-27Bqwen/qwen3.5-27b | 123 | $0.20 | $1.56 | 262,144 | Try → |
| 14 | Qwen: Qwen3.5-122B-A10Bqwen/qwen3.5-122b-a10b | 123 | $0.26 | $2.08 | 262,144 | Try → |
| 15 | Qwen: Qwen3.5-Flashqwen/qwen3.5-flash-02-23 | 123 | $0.07 | $0.26 | 1,000,000 | Try → |
How we ranked these
For Video Auto-Tagging, we weight models on video input, low latency. Higher means better. Scores combine OpenRouter's model metadata (context length, modality support, tool calling, structured output, reasoning capability) with public pricing. See full methodology →