Sources

Siri AI at WWDC 2026 simonwillison.net

Given how badly burned anyone who took Apple’s 2024 WWDC Apple Intelligence announcements at face value was, I’m holding to a strict “I’ll believe it when I see it” policy for everything they announced today . The new Siri AI features do at least look feasible with today’s technology, especially since Apple are licensing a custom Gemini-derived model that they can run on their own Private Cloud Compute . It sounds like they’ll be taking advantage of vision LLMs to extract information from the u…

The Open Source Community is backing OpenEnv for Agentic RL huggingface.co

The weather and climate science AI revolution isn’t revolutionary arstechnica.com

Machine learning has its limits—how is it being used?

References

Medium / Creative Compiler citing Ming-Chi Kuo medium.com

By using Gemini, Google effectively ‘sets the ceiling’ for Apple’s AI experience… if Apple cannot innovate on top of the model better than Google does on Android, it risks losing its premium differentiation.

Engadget engadget.com

Siri AI for iPhones and iPads will be delayed indefinitely in the EU… Apple’s proposed ‘Trusted System Agent’ and an 18-month phased rollout were rejected by regulators who insist any system-level AI access must be opened to third-party rivals like Gemini and OpenAI.

The Antitrust Attorney blog theantitrustattorney.com

The Gemini integration creates a second exclusive pipeline that is ‘exclusive in effect,’ even if not formally branded as such — foreclosing rivals from the most lucrative distribution channel, the iPhone’s native assistant.

Daring Fireball (John Gruber) daringfireball.net

Apple had ‘burned its reputation’ over previous years by announcing Siri features that frequently failed to materialize… [Gruber] specifically questioned the absence of ‘Siri AI Extensions’ — rumored tools that would allow users to swap Gemini for other assistants like Claude or ChatGPT.

Neowin neowin.net

Apple utilizes NVIDIA Confidential Computing on Blackwell B200 GPUs, Intel CPUs with TDX, and Google’s Titan security chips… Apple maintains a cryptographically verifiable ledger of all Google Cloud hardware in the PCC fleet to mitigate supply chain attacks.

MacRumors macrumors.com

Users must manually opt-in to a ‘Siri AI’ waitlist after installation… a common point of criticism is the lack of a system-level notification informing users that a waitlist exists; many updated to the beta and were confused when the new Siri interface failed to appear.

Turing blog (Calendar Gym benchmark) turing.com

agents achieved an 89% success rate on tasks with explicit identifiers, [but this] plummeted to 41% when faced with natural language ambiguity

Hugging Face blog — OpenEnv x Turing Calendar Gym huggingface.co

Calendar Gym… exposes over 25 MCP tools for scheduling and coordination tasks, testing an agent’s ability to handle permissions and partial information

Prime Intellect — Environments Hub launch primeintellect.ai

the ‘GitHub for RL environments,’ serving as a centralized marketplace for sharing and discovering train-ready tasks

DeepFabric — Introduction to OpenEnv deepfabric.dev

scaffolding a simple task can take minutes in an in-process framework like Gymnasium but may require multiple iterations and Docker configurations in HTTP-based frameworks like OpenEnv

huggingface/OpenEnv GitHub repo github.com

OpenEnv is in an ‘experimental stage’ with frequent API changes… contributors [should] coordinate significant changes with the technical committee to maintain API compatibility

pashpashpash Substack — ‘A response to everyone bashing evals’ pashpashpash.substack.com

Claude Code reportedly launched without traditional public evaluation frameworks, relying instead on domain-expert feedback and ‘vibes’… elaborate RL hubs can become an ‘echo chamber’ where popularity is prioritized over real-world utility

Physics World physicsworld.com

Physics-based models still beat AI for predicting extreme weather events

Karlsruhe Institute of Technology (Science Advances preprint repository) publikationen.bibliothek.kit.edu

AI models tended to predict less extreme values than what actually occurred… the magnitude of these errors grew larger as the record exceedance increased

University of Chicago Climate Institute — ‘Forecasting the Unseen: AI Weather Models and Gray Swan Extreme Events’ climate.uchicago.edu

AI models default to more moderate, statistically likely outcomes rather than the physically possible but rare extremes often termed ‘gray swans’

Towards Data Science — ‘Rethinking Environmental Costs of Training AI’ towardsdatascience.com

even when accounting for this heavy upfront training cost, AI models are estimated to consume at least 21 times less energy than traditional systems over a one-year operational cycle

Science Media Centre — expert reaction to NeuralGCM staging.sciencemediacentre.org

NeuralGCM demonstrated superior skill in predicting precipitation, particularly for extreme rainfall events in the top 0.1%… its physics-based core prevents drift, allowing realistic 15-day forecasts and multi-decadal climate simulations

Huawei Cloud blog — Pangu-Weather training details huaweicloud.com

Pangu-Weather’s training involved 192 NVIDIA V100 GPUs running for 16 days

Sources

References

Jack Sun, writing.