GLM5.2 on AMD MI355X at 2626 tok/s/node at over 2x lower cost than Blackwell

July 4, 2026

2 min read 2

The AMD MI355X just delivered a major blow to the competition in the open-source large language model (LLM) space, as GLM5.2 achieved a mind-boggling 2626 tokens per second (tok/s) per node. That’s more than double the performance of Blackwell, a previously popular open-source LLM. What’s more, this AMD-powered behemoth comes with a price tag that’s over two times lower than its rival.

Open-Source Showdown

The LLM landscape has changed dramatically in recent months. Models like Claude, Fable, and Minimax M3 have raised the bar for enterprise applications, and companies are struggling to keep up with the demand for high-performance inference capabilities. This is where AMD’s MI355X comes in – a highly-efficient GPU that’s perfectly suited for the most compute-intensive tasks.

Achieving Unprecedented Performance

GLM5.2, developed by a team of anonymous contributors, has been pushing the boundaries of what’s possible with open-source LLMs. Its impressive performance is largely due to the MI355X’s optimized architecture, which provides a significant boost to the model’s throughput. The results are nothing short of astonishing: 2626 tok/s per node is an impressive feat, making GLM5.2 one of the fastest open-source LLMs currently available.

What This Means

What this means for businesses and developers is that they can now access high-performance LLM capabilities without breaking the bank. The reduced cost of GLM5.2 is a significant advantage, especially for smaller organizations that might have been priced out of the market by more expensive alternatives. With the demand for inference continuing to rise, it’s likely that we’ll see even more innovative solutions emerge in the coming months.

Open-Source Showdown

Achieving Unprecedented Performance

What This Means

Related Articles

SNAP restrictions could change what shoppers buy — and food giants are watching

ZTE wins three Selular Award 2026 honors for AI-powered network innovation

Heavy AI Spenders Are Adding Workers, Not Cutting Them

EyePromise Introduces Heyedrate Clinical, Gamechanger in Dry Eye Hydration

Leave a Comment Cancel Reply