BTC

ETH

HTX

SOL

BNB

ดูตลาด

简中

繁中

English

日本語

한국어

ภาษาไทย

Tiếng Việt

หน้าแรก

ข่าวด่วน

เจาะลึก

บทความ

ประเด็นร้อน

หัวข้อพิเศษ

กิจกรรม

มุมมอง

เครื่องมือ

ดาวน์โหลด

การตั้งค่า

More

เข้าสู่ระบบ

智谱 launches GLM-5.1 High-Speed API, achieving an output speed of 400 tokens/s

2026-05-22 03:19

Odaily reported that Zhipu has launched the GLM-5.1 High-Speed API for select enterprise customers, achieving a model output speed of 400 tokens/s, setting a new global record for end-to-end speed in official large model interfaces.

It is understood that this high-speed version, while retaining the capabilities of the original flagship model, is powered by a high-performance inference engine jointly developed by Zhipu and the TileRT team. The engine reduces kernel launch and memory read/write latency in traditional inference by reconstructing the GPU runtime scheduling mechanism, statically organizing the model into persistent engine kernels that reside on the GPU.

In multi-GPU scenarios, TileRT further specializes GPU nodes in an 8-card NVL topology into different functional workers to improve attention layer computation and cross-card communication efficiency.

Currently, this high-speed service has been made available to select enterprise customers of Zhipu's MaaS platform. In the future, the company will continue to optimize FP8 inference and ultra-long context capabilities, providing support for low-latency scenarios such as AI programming, real-time interaction, and real-time voice.

ลิงก์ต้นฉบับ

บทความแนะนำ

对话 CZ：อย่าคิดที่จะออกจากวงการคริปโท การเริ่มต้นใหม่ฉันก็ยังจะทำ exchange

支持率不足1%，BIP-110仍要将比特币推向软分叉？

MSX US Stock Daily Observations: Walsh's Hawkish Opener Rescinds Rate Cut Expectations

CLARITY Act Delay Has Become a Compliance Crisis, Not Merely a Political Stalemate

ค้นหา

ข่าวด่วน 24 ชั่วโมง

2026-07-17 12:31

荷兰加密平台Knaken被法院宣告破产，超700万欧元用户资产失踪引调查

2026-07-17 12:28

ECB warns: Widespread stablecoin adoption could erode bank deposit bases, accelerating the push for a digital euro

2026-07-17 12:22

David Sacks warns: Chinese AI model tops code test, US regulatory constraints may weaken AI competitiveness

2026-07-17 12:16

中际旭创获港交所批准拟上市募资80亿美元

2026-07-17 12:04

Abraxas Capital withdrew 12,477 ETH from CEX in the past 3 hours

2026-07-17 12:04

Peter Brandt: Nasdaq Futures Form Diamond Top Pattern, Bitcoin May Dip to $40,000 to Form a Bottom

ดาวน์โหลดแอพ Odaily พลาเน็ตเดลี่

ให้คนบางกลุ่มเข้าใจ Web3.0 ก่อน

Android

เกี่ยวกับเรา

ติดต่อเรา

ร่วมงานกับเรา

ลิงก์มิตรภาพ

ความร่วมมือด้านโฆษณา

ข้อตกลงผู้ใช้

นโยบายความเป็นส่วนตัว

ข้อสงวนสิทธิ์

ชุดสื่อแบรนด์ Odaily | โลโก้ทางการและแนวทางภาพลักษณ์

บริษัท ปักกิ่ง รุ่ยเค่อ คัลเจอร์ มีเดีย จำกัด

京ICP备 2026027382号

京公网安备11010502060861号