Interesting updates on Apple AI: As Apple's WWDC lands next month, and the long-delayed Siri and on-device AI upgrades are expected to be the centerpiece: a smaller, distilled version of Google's Gemini running locally on iPhone silicon, pitched on privacy and lower token costs.
Most of that stack is sourced from elsewhere. The local model is distilled from Gemini. Queries too heavy for the device route to Google Cloud (!), where Apple has now signed off on Nvidia's confidential-compute tech to process them. Apple is also reportedly hunting for small on-device-AI startups to speed up the model-shrinking work, having looked at Liquid AI among others.
One quiet shift from the 2024 rollout: Apple promised then that anything leaving your iPhone would run on Apple silicon inside Private Cloud Compute. It couldn't get the full Gemini running there, so those queries now sit in Google Cloud. The Private Cloud Compute name is staying anyway. Via The Information