MiniMax M3

MiniMax M3 is MiniMax's first model with a 1.0M tokens context window and native multimodal input. It targets software engineering, terminal-based tool use, and agentic web browsing, with a max output of 1.0M tokens per request. Your use subject to MiniMax's Terms & Privacy Policies.

Implicit CachingReasoningTool UseVision (Image)

Use with AI Gateway View docs

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'minimax/minimax-m3',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

More models by MiniMax

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

minimax/minimax-m2.7

205K

0.8s

163tps

$0.30/MFast $0.60/M

$1.20/MFast $2.40/M

Read:$0.06/M

Write:$0.38/M

—

03/18/2026

minimax/minimax-m2.7-highspeed

205K

1.1s

53tps

$0.60/M

$2.40/M

Read:$0.06/M

Write:$0.38/M

—

03/18/2026

minimax/minimax-m2.5

0.6s

105tps

$0.07/MFast $0.60/M

$0.57/MFast $2.40/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

minimax/minimax-m2.5-highspeed

205K

1.1s

73tps

$0.60/M

$2.40/M

Read:$0.03/M

Write:$0.38/M

—

02/12/2026

minimax/minimax-m2.1

205K

0.7s

113tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

12/23/2025

minimax/minimax-m2

205K

1.1s

62tps

$0.30/M

$1.20/M

Read:$0.03/M

Write:$0.38/M

—

10/27/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

MiniMax M3

More models by MiniMax