Databricks' Yuchen Jin says system-level optimizations pushed GLM-5.2 inference to a leaderboard-topping 392 tokens/s · Digg