Name: MiniMax M3 API
Brand: MiniMax

MiniMax M3

MiniMax M3 is a flagship reasoning model from MiniMax with a 1M token context window. Designed for agentic workflows, large document analysis, coding, and complex multi-step reasoning tasks.

What MiniMax M3 Is

MiniMax M3 is the flagship reasoning model from MiniMax, built for agentic workflows, long-document processing, coding tasks, and complex multi-step reasoning at scale. It supports a context window of up to 1 million tokens — one of the largest available in production — and introduces multimodal input support alongside a reasoning architecture designed for frontier-tier performance.

Where previous MiniMax models excelled at conversational tasks and general instruction-following, M3 was purpose-built for environments where context depth, tool-use reliability, and sustained reasoning across long sessions actually matter. It's the first MiniMax model explicitly positioned at the agentic frontier.

Key Specs

Model ID: minimax/minimax-m3
Context window: 1,000,000 tokens
Model type: Chat + Reasoning
Multimodal: Yes (text + image input)
Tool calling: OpenAI-compatible function calling

MiniMax M3 Pricing

MiniMax M3 (≤512K input tokens):

Input: $0.39 per 1M tokens
Output: $1.56 per 1M tokens
Cache Read: $0.078 per 1M tokens

Note: Current pricing is a MiniMax promo price (7-day 50% off).

MiniMax M3 (>512K input tokens):

Input: $1.56 per 1M tokens
Output: $6.24 per 1M tokens
Cache Read: $0.312 per 1M tokens

Note: Input tokens above 512K are currently available in limited mode. Public availability is expected later.

What M3 Is Actually Built For

The 1M context window is not a marketing number — it changes what kinds of tasks are actually feasible. M3 was designed around workflows where the bottleneck is context capacity, not model intelligence: entire codebases, full research papers, lengthy conversation histories, large tool-call chains.

Agentic Workflows

Plan, execute, and iterate across multi-step tasks without losing coherence over long sessions. M3's reasoning architecture handles task decomposition, tool selection, and self-correction loops across hundreds of turns — the kind of sustained performance that breaks down in shorter-context models.

Code Generation & Engineering

Generate, review, and refactor code with full awareness of large codebases. Feed in entire repositories or multi-file projects without chunking. Useful for code migration, security audits, architecture reviews, and CI-integrated generation pipelines.

Long-Document Understanding

Feed in full contracts, research papers, financial reports, or technical specifications — not summaries, the actual documents. M3's 1M token window makes it one of the few models where "just send the whole thing" is a real strategy, not a workaround.

Multimodal Reasoning

Accepts image input alongside text, enabling document parsing with embedded figures, diagram interpretation, screenshot-based debugging, and visual context in agentic pipelines.

How It Stacks Up

M3 is positioned against Claude Opus 4.6, GPT-5, and Gemini 2.5 Pro in the reasoning-and-agentic tier. Its primary differentiators are context length and cost: at $0.39/$1.56 input/output for the base tier, it offers frontier-class context at a lower price point than most comparable models. Where it may trail is in narrow domain depth — specialized medicine, law, and finance benchmarks favor models with more training weight in those verticals.

Who Should Use M3 via API?

// 01 Agent Framework DevelopersIf you're building multi-step agent systems that need to hold large tool call histories, system prompts, and conversation context simultaneously — M3's 1M window removes the context management overhead that breaks most agent loops at scale.

// 02 Legal & Contract AnalysisOrganizations processing high-volume contracts, compliance documents, or regulatory filings benefit from sending full documents rather than chunking. M3 handles full-length legal documents as single-pass inputs.

// 03 Research & Data PipelinesTeams running evaluation pipelines, literature reviews, or large-scale data extraction workflows can feed M3 entire datasets or document collections in a single request.

// 04 Coding Infrastructure TeamsCodebases that are too large for standard context windows become tractable with M3. Repository-level code review, cross-file refactoring, and full-codebase Q&A are practical use cases at the 1M token level.

// 05 Cost-Conscious Frontier UsersFor teams currently paying Opus 4.6 or GPT-5 rates for reasoning-heavy workloads, M3's base tier pricing at $0.39 input / $1.56 output makes it worth a direct comparison before committing to higher-cost alternatives.

What You Should Know Before Committing

M3's >512K token tier is currently available in limited mode — if your use case depends on consistently sending inputs above 512K tokens, confirm availability before building a production dependency. Tool calling support is expected to be OpenAI-compatible but should be validated in your specific environment before relying on it in automated pipelines.

‍

Example H2

Try it now

What MiniMax M3 Is

Key Specs

Model ID: minimax/minimax-m3
Context window: 1,000,000 tokens
Model type: Chat + Reasoning
Multimodal: Yes (text + image input)
Tool calling: OpenAI-compatible function calling

MiniMax M3 Pricing

MiniMax M3 (≤512K input tokens):

Input: $0.39 per 1M tokens
Output: $1.56 per 1M tokens
Cache Read: $0.078 per 1M tokens

Note: Current pricing is a MiniMax promo price (7-day 50% off).

MiniMax M3 (>512K input tokens):

Input: $1.56 per 1M tokens
Output: $6.24 per 1M tokens
Cache Read: $0.312 per 1M tokens

Note: Input tokens above 512K are currently available in limited mode. Public availability is expected later.

What M3 Is Actually Built For

Agentic Workflows

Code Generation & Engineering

Long-Document Understanding

Multimodal Reasoning

Accepts image input alongside text, enabling document parsing with embedded figures, diagram interpretation, screenshot-based debugging, and visual context in agentic pipelines.

How It Stacks Up

Who Should Use M3 via API?

What You Should Know Before Committing

‍

Try it now

MiniMax M3

MiniMax M3

500+ AI Models

The Best Growth Choice
for Enterprise

Our Clients' Voices

MiniMax M3

MiniMax M3

500+ AI Models

The Best Growth Choice for Enterprise

Our Clients' Voices

The Best Growth Choice
for Enterprise