In a bold move reshaping the global AI landscape, Chinese startup Z.ai has launched GLM-4.5, an open-source model touted as cheaper, smaller, and more efficient than rivals like DeepSeek.
The announcement, made at the World Artificial Intelligence Conference in Shanghai, has sent ripples across the tech sector.
What sets GLM-4.5 apart is its lean architecture. Requiring just eight Nvidia H20 chips—custom-built to comply with U.S. export restrictions—it slashes operating costs dramatically.
By comparison, DeepSeek’s model demands nearly double the compute power, making GLM-4.5 a tantalising alternative for cost-conscious developers and enterprises.
But the savings don’t stop there. Z.ai revealed that it will charge just $0.11 per million input tokens and $0.28 per million output tokens. In contrast, DeepSeek R1 costs $0.14 for input and a hefty $2.19 for output, putting Z.ai firmly in the affordability lead.
Functionally, GLM-4.5 leverages ‘agentic’ AI—meaning it can deconstruct tasks into subtasks autonomously, delivering more accurate results with minimal human intervention.
This approach marks a shift from traditional logic-based models and promises smarter integration into coding, design, and editorial workflows.
Z.ai, formerly known as Zhipu, boasts an impressive funding roster including Alibaba, Tencent, and state-backed municipal tech funds.
With IPO ambitions on the horizon, its momentum mirrors China’s broader push to dominate the next wave of AI innovation.
While the U.S. has placed Z.ai on its entity list, stifling some Western partnerships, the firm insists it has adequate computing resources to scale.
As AI becomes a battleground for technological and geopolitical influence, GLM-4.5 may prove to be a powerful competitor.
But it has some way yet to go.