Ars Technica AIMay 28, 2026, 6:30 PMRyan Whitwam

Apple working to cram massive Gemini model into iPhone to power new Siri

Apple is reportedly trying to shrink Gemini for iPhone-based Siri, though cloud support may still be unavoidable.

Ars Technica reports that Apple is working to compress Google’s massive Gemini model so it can run on iPhone and power a new Siri experience. The short summary emphasizes a key constraint: even with on-device ambitions, a cloud component is probably inevitable. Details remain limited, so the report is best read as a signal about Apple’s AI direction rather than a confirmed product launch.

The core message of this Ars Technica report is that Apple is reportedly attempting to "distill" or compress Google's very large Gemini AI model so that it can run on the iPhone, with the goal of powering a new version of Siri. The original title directly states that Apple wants to squeeze the massive Gemini into the iPhone to give Siri stronger AI capabilities; the abstract adds an important caveat: even though Apple's direction is to shrink the model and bring it closer to on-device computation, a cloud component is very likely still unavoidable.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Ars Technica AI →

Summaries are AI-generated; the original article is authoritative.