Google released DiffusionGemma, a 26B MoE experimental open model using text diffusion instead of token-by-token autoregressive decoding. It can generate blocks of text in parallel, reaching up to 4x faster output on dedicated GPUs. The model targets local, speed-sensitive workflows, but Google says its output quality is below standard Gemma 4 and recommends Gemma 4 for quality-critical production use.
Microsoft announced at Computex 2026 that Windows 11 has surpassed one billion users, framing the milestone as a base for its next PC strategy. This fall, AI laptops powered by NVIDIA RTX Spark are expected to arrive, emphasizing local inference. Microsoft also plans broader mainstream hardware upgrades to prepare Windows PCs for future AI agent workflows.