Google introduced Gemma 4 12B, an open model aimed at running locally on laptops with 16GB of RAM. The model uses a new encoding scheme and token prediction to improve efficiency relative to its size. Its practical importance depends on real-world benchmarks, but it could lower the barrier for private, offline, and local multimodal AI workflows.
Nathan L. says this was his final week at the Allen Institute for AI (Ai2). He highlights the privilege of working on the Olmo models and describes the role as a period of growth and learning. The brief farewell post does not provide a reason for leaving, future plans, or details about any impact on Olmo development.