Skip to main content
Refrase
  • Pricing
Star
← All models

Magistral Medium

Mistral · mistral family · Official docs

Magistral Medium 1.2 is Mistral's answer to the reasoning model trend — a dedicated chain-of-thought model with explicit [THINK]/[/THINK] tokens that make reasoning traces inspectable and debuggable. This transparency is a significant advantage for Refrase's prompt optimization workflow: users can see exactly how the model reasons and tune prompts to guide that reasoning. The 131K max output tokens is generous for reasoning-heavy tasks. The key limitation is the proprietary nature — unlike Mistral Large 3, this model cannot be self-hosted, locking users into Mistral's API. The 40K optimal context threshold is notably lower than the 128K advertised maximum, so Refrase should flag this discrepancy in recommendations. At $2/$5 per million tokens, it sits in the premium tier — best reserved for high-value reasoning tasks where the thinking trace visibility justifies the cost premium over alternatives.

Try Refrase on a Magistral Medium prompt

Paste any prompt — Refrase rewrites it using Magistral Medium's documentation as context. 4–7 seconds end-to-end.

Open in /enhanceTry Guided mode

Specifications

128K
Context window
131K
Max output
$2 / $5
Per 1M tokens (in/out)
Mistral API (La Plateforme) 'Premier' tier pricing. Not open-weight — proprietary model. Magistral Small 1.2 is available as a cheaper open-weight alternative at $0.50 input / $1.50 output. (source: Mistral Docs, Models overview page)

Strengths

analysiscode

Key capabilities

  • ✓Frontier-class reasoning model with dedicated [THINK] and [/THINK] tokens for chain-of-thought traces (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')
  • ✓Multimodal: visual encoder for image analysis and understanding (source: Mistral Docs, Magistral Medium 1.2 page)
  • ✓Native function calling, structured outputs, and agent conversations with built-in tools (source: Mistral Docs, Magistral Medium 1.2 page)
  • ✓Document AI: OCR with structured annotations, bounding box extraction, and document Q&A (source: Mistral Docs, Magistral Medium 1.2 page)
  • ✓Audio transcription with timestamps (source: Mistral Docs, Magistral Medium 1.2 page)
  • ✓Fill-in-the-Middle (FIM) for code generation and predicted outputs (source: Mistral Docs, Magistral Medium 1.2 page)
  • ✓25+ language support including French, German, Arabic, Japanese, and Chinese (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')

Known limitations

  • ⚠Proprietary model — weights not publicly available, no self-hosting option (unlike Mistral Large 3 and Magistral Small) (source: Mistral Docs, Models overview page)
  • ⚠Context window optimal performance under 40K tokens despite 128K maximum — quality may degrade at longer contexts (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')
  • ⚠Exact parameter count and architecture undisclosed — limited transparency compared to open-weight alternatives (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')
  • ⚠Premium pricing ($2/1M input, $5/1M output) — 4x the cost of Mistral Large 3 input and 3.3x output, reflecting reasoning compute overhead (source: Mistral Docs, Models overview page)

How to prompt Magistral Medium

Preferred instruction format

Standard chat format with system/user/assistant roles. Reasoning traces enclosed in [THINK] and [/THINK] tokens for developer inspection. API model name: magistral-medium-2509 or magistral-medium-latest.

Recommended practices

  • Use temperature=0.7, top_p=0.95, max_tokens=131072 as recommended sampling parameters (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')
  • Leverage [THINK]/[/THINK] tokens to inspect reasoning traces for debugging and quality assurance (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')
  • Use for reasoning-intensive tasks where chain-of-thought quality matters more than raw speed (source: Mistral Docs, Models overview page)
  • Apply the same Mistral prompt engineering best practices: explicit role definition, hierarchical structure, Markdown/XML formatting (source: Mistral Docs, Prompting Capabilities)
  • Keep context under 40K tokens for optimal reasoning quality (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')

Anti-patterns to avoid

  • Do not use for simple tasks where reasoning overhead is unnecessary — Mistral Large 3 or smaller models are more cost-effective (source: Mistral Docs, Models overview page)
  • Avoid same anti-patterns as Mistral Large 3: vague quantifiers, ambiguous descriptors, contradictory rules, numeric scales (source: Mistral Docs, Prompting Capabilities)
  • Do not exceed 40K tokens of context if reasoning quality is critical — performance is optimal below this threshold (source: apidog.com, 'Magistral Small 1.2 and Magistral Medium 1.2 are here')

Sources

  • https://docs.mistral.ai/models/magistral-medium-1-2-25-09
  • https://docs.mistral.ai/getting-started/models
  • https://docs.mistral.ai/capabilities/completion/prompting_capabilities
  • https://apidog.com/blog/magistral-small-1-2-magistral-medium-1-2/

Skip the manual application.

Refrase reads everything above and applies it for you. Try it on one of your own prompts.

Open /enhance with Magistral Medium
Refrase

Your prompts, upgraded.

Product

  • Enhance
  • Extension
  • API
  • MCP

Research

  • Papers
  • Methodology
  • Benchmarks
  • Models

Company

  • Blog
  • Changelog
  • Pricing
  • Docs
  • GitHub
Privacy Policy·Terms of Service·All Systems Operational

© 2026 Refrase. All rights reserved.