Accelerating Gemma 4: faster inference with multi-token prediction drafters May 5, 2026 by kamal Comments