OK, I tried GLM-5.2 and this is a good model.
Probably the first model good enough to eschew closed models from your workflow entirely (except if you need vision).
I know this won't run on your laptop, but what are the best current vllm/sglang serving recipes?



