MiniMax M2.7 Ultraspeed
400+ Tokens/Second. Fully EU Sovereign.
The latest 229B parameter frontier model from MiniMax, running on Infercom's fully EU sovereign infrastructure in Germany. No US hyperscalers. No CLOUD Act exposure. Native multi-agent support, 30% coding improvement over M2.5.
New in M2.7
M2.7 adds built-in self-critique for better first-attempt results, plus native multi-agent orchestration as a core capability.
Built-in Self-Critique
New in M2.7: The model automatically reviews and refines its outputs before responding. Better first-attempt results on complex coding and reasoning tasks.
Native Agent Teams
New in M2.7: Multi-agent collaboration as a core model capability - not just prompting. Stable role boundaries, adversarial reasoning, and behavioral differentiation built in.
Improved: Software Engineering
56.22% on SWE-Pro (matching GPT-5.3-Codex), 55.6% on VIBE-Pro for full project delivery, 57.0% on Terminal Bench 2 for complex system understanding.
Improved: Professional Skills
97% skill compliance across 40+ complex professional skills. Enhanced Excel, PowerPoint, and Word editing with multi-round revision support.
400+
Tokens per second on EU infrastructure
30%
Coding performance improvement over M2.5
Frontier Benchmark Performance
M2.7 achieves top-tier scores across coding, engineering, and ML benchmarks - matching or exceeding GPT-5.3-Codex on SWE-Pro.
SWE-Pro
56.22%
Matches GPT-5.3-Codex
SWE Multilingual
76.5%
Real-world engineering
VIBE-Pro
55.6%
Near Opus 4.6 level
Terminal Bench 2
57.0%
Deep system understanding
MLE Bench Lite
66.6%
Medal rate (9 gold, 5 silver)
Multi SWE Bench
52.7%
Complex codebases
Built for Agentic Workflows
M2.7 excels at long-horizon agent tasks that require autonomous decision-making, tool use, and multi-step reasoning across complex professional domains.
Financial Workflows
End-to-end research, Excel modeling, report generation
SRE & DevOps
Log analysis, incident response, production debugging
Document Processing
Word, Excel, PowerPoint with multi-round editing
ML Competitions
66.6% medal rate on MLE Bench Lite (9 gold, 5 silver)
Agentic Coding
56.22% on SWE-Pro
Matches GPT-5.3-Codex on the most demanding software engineering benchmark. Works with Aider, OpenCode, Cline, Cursor, Continue, Goose, Windsurf, and Claude Code.
Set up agentic codingMulti-Agent Teams
Native Agent Orchestration
Internalized multi-agent collaboration as a native capability. Stable identity across roles, enhanced emotional intelligence, and robust causal reasoning for production decisions.
Frontier Performance, Fraction of the Cost
Run a 229B parameter frontier model at open-source pricing.
| Model | Input (per 1M) | Output (per 1M) | Relative Cost |
|---|---|---|---|
| Claude Opus 4.6 | $15.00 | $75.00 | ~30x more |
| Claude Sonnet 4.6 | $3.00 | $15.00 | ~6x more |
| MiniMax M2.7 Ultraspeed | €0.60 | €2.40 | Baseline |
Pricing as of May 2026.
Why EU Hosting Matters
Running MiniMax through Infercom means full EU sovereignty - the fastest inference in Europe with zero foreign jurisdiction exposure:
- Your data is processed exclusively in Germany - never leaves EU jurisdiction
- No US CLOUD Act exposure - no American hyperscaler involvement
- Full GDPR compliance with EU-based Data Processing Agreement
- ISO 27001 certified infrastructure owned and operated by Infercom
- Zero data retention - we never train on your data
Start Building in Minutes
from openai import OpenAI
client = OpenAI(
api_key="your-infercom-key",
base_url="https://api.infercom.ai/v1"
)
response = client.chat.completions.create(
model="MiniMax-M2.7",
messages=[{"role": "user", "content": "Your prompt here"}],
max_tokens=4096
)
print(response.choices[0].message.content)