Gemma 4 Multi-Token Prediction support merges into llama.cpp, offering a 2x speedup for local dense models · Digg