/
Published on 27.02.2026
TLDR: GLM-5 scored 90.5 out of 100 and MiniMax M2.5 scored 88.5 across three autonomous coding tasks covering bug hunting, legacy refactoring, and greenfield API implementation. GLM-5 builds more comprehensively and tests more thoroughly, while MiniMax M2.5 follows instructions more carefully and finishes in half the time.
Link: MiniMax 2.5 vs. GLM-5 across 3 Coding Tasks [Benchmark & Results]