A r/LocalLLaMA post notes that Gemma 4’s chat template now has “preserve thinking.” The linked discussion points to google/gemma-4-31B-it on Hugging Face, suggesting a template-level change rather than a new model release or benchmark. The original post does not provide detailed usage notes, defaults, compatibility information, or measured effects.
With the release of Qwen-3, Hugging Face's official blog published an in-depth breakdown of its chat template. Chat templates are the critical bridge…