AI Weekly Digest -- May 17-May 24, 2026
Note: This post was generated by AI. Each week, I use an automated pipeline to collect and synthesize the latest AI news from blogs, newsletters, and podcasts into a single digest. The goal is to keep up with the most important AI developments from the past week. For my own writing, see my other posts. TL;DR OpenAI’s model solved an 80-year-old math problem using original reasoning, not a specialized math tool, suggesting AI is approaching genuine research-level thinking across domains. Anthropic’s Project Glasswing found 10,000+ critical software vulnerabilities in one month using its Claude Mythos model, including bugs in Firefox and infrastructure used by billions of devices. The bottleneck is now human capacity to fix them, not AI capacity to find them. Google I/O delivered a major AI push: Gemini 3.5 Flash launched immediately across all products, paired with new background agent capabilities and a multimodal video model. Google processes 7x more AI tokens than a year ago. Anthropic signed 276,000-person deals with KPMG and PwC in the same week, signaling that large professional services firms are moving from AI pilots to firm-wide deployments. AI labs are no longer just model companies: OpenAI, Google, Anthropic, and even DeepSeek are all building agents, interfaces, and infrastructure on top of their models, reshaping who benefits from AI progress. Story of the Week: AI Finds Security Holes Faster Than Humans Can Patch Them Anthropic’s Project Glasswing crossed a threshold this week that matters to anyone whose organization depends on software. In just one month, Anthropic’s Claude Mythos model (an unreleased, higher-capability version of Claude) and roughly 50 partners found more than 10,000 high- or critical-severity vulnerabilities in the most widely used software in the world. Cloudflare alone found 2,000 bugs, with a false-positive rate better than human testers. Mozilla found 271 vulnerabilities in Firefox using Mythos, more than ten times what it found in the previous version using an older model. ...