Micah Goldblum and collaborators open-source LCLMs, using latent context compression to deliver 8.8x faster long-context inference · Digg