final GLM 5.2 served stats: ~12000 unique api keys served ~300B tokens total 232 tok/s/gpu output average 431 tok/s/gpu output max sustained 2.1 sec TTFT overage (1M ctx) 61 sec p95 TTFT (1M ctx) 81k tok average input size 41% cache hit rate 0 chat logs kept (dont be evil) thanks again everyone and hopefully you found the service and tokens useful
Today marks the end of the free GLM 5.2 with ncode. i hope y'all enjoyed the tokens and found some of our tools useful.






