Z.ai just announced GLM-5.2, the newest entry in its rapidly iterating GLM model family and its current coding flagship. The model is already live for all GLM Coding Plan subscribers, with a full API launch and open-source release under the MIT License arriving next week. For a company that only went public on the Hong Kong Stock Exchange in January 2026, the pace of releases is striking.

A lineage built for engineering, not just chat

To understand what GLM-5.2 is, you need to know what came before it. GLM-5 is a 744B-parameter Mixture-of-Experts model with 40B active parameters per token, roughly a 2x scale-up from GLM-4.5. It was trained entirely on Huawei Ascend chips using the MindSpore framework, with zero dependency on NVIDIA hardware. That model set the tone: open-weight, aggressively priced, and laser-focused on agentic coding tasks.

GLM-5.1, released in April 2026, was a post-training upgrade to GLM-5, built on the same 744B-parameter MoE architecture with a 200K token context window. The key improvement was sustained productivity in long-running tasks: where GLM-5 and many other models produce final output within a certain token budget, GLM-5.1 cycles through planning, execution, evaluation of intermediate results, and evaluation of its approach until it judges the task to be complete. On SWE-Bench Pro, GLM-5.1 achieved a score of 58.4, outperforming GPT-5.4, Claude Opus 4.6, and Gemini 3.1 Pro.

GLM-5.2 is the next step in that progression, and it brings a headline feature that the previous models lacked: a true 1M-token context window.

What's actually new

Alpha Signal

Don't miss what's next in AI

Join 300,000+ engineers and researchers who get the signal, not the noise.

  • Full access to in-depth AI research breakdowns
  • Be the first to know what's trending before it hits mainstream
  • Daily curated papers, repos, and industry moves