[AI]■ STORY TIMELINE
GOOGLE SPEEDS UP GEMMA 4 WITH MULTI-TOKEN PREDICTION
Google has introduced multi-token prediction drafters for Gemma 4, a technique that accelerates inference speed by enabling the model to generate multiple tokens simultaneously rather than one at a time.
Hacker News+0m
Article URL: https://blog.google/innovation-and-ai/technology/developers-tools/multi-token-prediction-gemma-4/ Comments…