Professional · best for

Top picks for Medical Note Summarization (2026)

Patient note distillation. Not a substitute for a doctor. Ranked from 357 live models on the OpenRouter catalog, weighted for reasoning quality, context window, structured output.

What this is A capability-matched shortlist, not a benchmark-tested winner. Models are scored by the fit of their declared specs (structured output, reasoning, context, modality, price) against Medical Note Summarization. Pair with benchmark sources like Artificial Analysis or LMSys Arena before you ship. Full methodology →
#ModelScoreIn / 1MOut / 1MContext
1 Qwen: Qwen3.5 Plus 2026-04-20qwen/qwen3.5-plus-20260420 128 $0.40 $2.40 1,000,000 Details →
2 Qwen: Qwen3.6 Flashqwen/qwen3.6-flash 128 $0.25 $1.50 1,000,000 Details →
3 Xiaomi: MiMo-V2.5xiaomi/mimo-v2.5 128 $0.40 $2.00 1,048,576 Details →
4 Qwen: Qwen3.6 Plusqwen/qwen3.6-plus 128 $0.33 $1.95 1,000,000 Details →
5 xAI: Grok 4.20x-ai/grok-4.20 128 $2.00 $6.00 2,000,000 Details →
6 OpenAI: GPT-5.4 Nanoopenai/gpt-5.4-nano 128 $0.20 $1.25 400,000 Details →
7 OpenAI: GPT-5.4 Miniopenai/gpt-5.4-mini 128 $0.75 $4.50 400,000 Details →
8 OpenAI: GPT-5.4openai/gpt-5.4 128 $2.50 $15.00 1,050,000 Details →
9 Google: Gemini 3.1 Flash Lite Previewgoogle/gemini-3.1-flash-lite-preview 128 $0.25 $1.50 1,048,576 Details →
10 Qwen: Qwen3.5-Flashqwen/qwen3.5-flash-02-23 128 $0.07 $0.26 1,000,000 Details →
11 Google: Gemini 3.1 Pro Preview Custom Toolsgoogle/gemini-3.1-pro-preview-customtools 128 $2.00 $12.00 1,048,576 Details →
12 OpenAI: GPT-5.3-Codexopenai/gpt-5.3-codex 128 $1.75 $14.00 400,000 Details →
13 Google: Gemini 3.1 Pro Previewgoogle/gemini-3.1-pro-preview 128 $2.00 $12.00 1,048,576 Details →
14 Qwen: Qwen3.5 Plus 2026-02-15qwen/qwen3.5-plus-02-15 128 $0.26 $1.56 1,000,000 Details →
15 Google: Gemini 3 Flash Previewgoogle/gemini-3-flash-preview 128 $0.50 $3.00 1,048,576 Details →

How we ranked these

For Medical Note Summarization, we weight models on reasoning quality, context window, structured output. Higher means better. Scores combine each model's public metadata (context length, modality support, tool calling, structured output, reasoning capability) with live pricing. See full methodology →

Related tasks