Z.ai Ships GLM-5.2 Coding Model With 1M Context, No Benchmarks

Z.ai released GLM-5.2, a flagship coding model with a usable 1-million-token context, shipping it to paying subscribers first while withholding benchmarks, an API and open weights.

By Daniel Mercer Edited by Maria Konash Published: Jun 15, 2026 at 12:55 pm UTC

Z.ai releases GLM-5.2, a flagship coding model with a usable 1-million-token context, to its paid Coding Plan. Image: Compagnons / Unsplash

Z.ai released GLM-5.2, calling it the new flagship of its GLM coding family and making it available the same day to every tier of its paid GLM Coding Plan. Z.ai is the international brand of Zhipu AI, a Beijing company spun out of Tsinghua University that listed in Hong Kong in January and has been shipping models at a fast clip. The headline feature is a usable 1-million-token context window, addressed through the glm-5.2[1m] model id, with maximum output of 131,072 tokens. The model offers two reasoning-effort levels, High and Max, and Z.ai recommends Max for coding.

What stands out is the order of the rollout. The model reached developers first, but the standalone API, the Z.ai chatbot and the open-source weights were all promised for next week, and Z.ai published no benchmarks at launch. That reverses the usual sequence, where proof precedes distribution.

The release is aimed squarely at existing subscribers, who can switch to GLM-5.2 in coding agents like Claude Code, Cline and OpenClaw with a single change to their model settings. Z.ai says the model was trained with a new asynchronous agent reinforcement-learning method built for stability across long chains of reasoning, and tested across more than 10,000 environments in nine programming languages.

Without numbers, the performance claims are vendor descriptions, not measured results. Any figure circulating now almost certainly belongs to earlier models. GLM-5, the family’s base, is credited with 77.8% on SWE-bench Verified, the top open-source score at the time, while GLM-5.1 self-reported roughly 94.6% of Claude Opus 4.6’s coding score.

That track record buys GLM-5.2 some benefit of the doubt, though Z.ai has not confirmed its architecture or parameter count. The company also pledged to open-source the weights next week under the permissive MIT License, a step up in openness from GLM-5’s Apache-2.0, and framed the launch with the line that “the future of AI is open, and it belongs to the people.”

What the Launch Reveals

The distribution-first approach tells you who the release is for. The GLM Coding Plan starts around $18 a month and meters usage in prompts rather than tokens, with each prompt estimated to trigger 15 to 20 model calls. Every tier, from Lite to Team, gets GLM-5.2; the tiers differ in how much usage they allow before hitting rolling and weekly caps.

For a current subscriber, trying the model costs an environment-variable change and an evening, which is the point. Some developers, though, called the paid-plan-first rollout at odds with the open framing, since the free web chat still runs the older GLM-5.1 and outsiders have nothing to test yet.

A Crowded Field

GLM-5.2 lands in one of the busiest stretches the coding-model market has seen, and notably on the same day the US barred foreign access to Anthropic’s most advanced models, sharpening the appeal of open weights.

Anthropic’s Claude Opus 4.8 anchors the closed frontier with a mature agent ecosystem and a 1M context. Moonshot shipped the open-weight Kimi K2.7-Code a day earlier, and Alibaba’s Qwen 3.7 Max posted closed-model benchmark wins in May. Z.ai’s bet is that shipping capable models under permissive licenses, fast, pressures closed rivals to justify premium pricing. Whether GLM-5.2 lives up to its billing will not be clear until the API, weights and independent benchmarks arrive.