AI Weekly Digest -- April 05-April 12, 2026

Note: This post was generated by AI. Each week, I use an automated pipeline to collect and synthesize the latest AI news from blogs, newsletters, and podcasts into a single digest. The goal is to keep up with the most important AI developments from the past week. For my own writing, see my other posts. TL;DR Anthropic unveiled Claude Mythos, a model that autonomously found critical security vulnerabilities in every major operating system and browser, then launched Project Glasswing, a $100M industry coalition to use those same capabilities defensively before bad actors can exploit them. Anthropic’s run-rate revenue hit $30B (up from $9B at end of 2025), with enterprise customers spending $1M+ annually doubling to 1,000 in under two months – a signal of how fast AI spending is accelerating inside large organizations. A major Microsoft Research report confirms AI is reshaping work faster than any prior technology, but benefits are uneven: experienced workers gain, junior roles are being automated away, and 40% of employees say they’ve received “workslop” – polished-looking AI output that isn’t accurate. MIT researchers project that AI will reach 80-95% success rates on most text-based work tasks by 2029 – not as sudden disruption but as a steady, broad rise that will touch nearly every knowledge worker role. Researchers at UC Berkeley showed that every major AI capability benchmark can be gamed to show near-perfect scores without solving a single task, meaning the numbers companies cite to justify AI purchases may be meaningless. Story of the Week: Claude Mythos and the Cybersecurity Watershed Anthropic this week disclosed Claude Mythos, a still-unreleased frontier model with an alarming capability: it found previously unknown critical security vulnerabilities in every major operating system and web browser, including a 27-year-old flaw in OpenBSD and a 16-year-old bug in FFmpeg that had survived five million automated tests. It did this largely autonomously, without human guidance. According to Anthropic’s Project Glasswing announcement , the model has already found thousands of such vulnerabilities. ...

April 12, 2026 · 9 min

How to Actually Improve Your AI Team Over Time

It’s Friday afternoon, and you are reviewing the week’s feedback logs with one of your agents. You notice that your Copywriter keeps using a formal tone for your social copy. Your Data Wizard keeps stating hypotheses as facts. Your Chief of Staff keeps forgetting to block 45 minutes for launch when you plan your day each morning. To address the first issue, you need to update your Copywriter’s persona. The second issue is already covered in a team rule, but it needs to be sharpened. You can fix the last one in the Plan my day skill. Your agent makes the required changes so that next week you are less likely to hit these issues again. ...

April 11, 2026 · 11 min

AI Weekly Digest -- March 29-April 05, 2026

Note: This post was generated by AI. Each week, I use an automated pipeline to collect and synthesize the latest AI news from blogs, newsletters, and podcasts into a single digest. The goal is to keep up with the most important AI developments from the past week. For my own writing, see my other posts. TL;DR Claude’s source code leaked accidentally, revealing hidden features, anti-copying measures, and an unreleased autonomous agent mode called KAIROS. Anthropic also blocked third-party tools like OpenClaw from using subscription credits, forcing users to pay separately. Google released Gemma 4, a family of open-weight models (models whose internal workings are publicly available) under a permissive open-source license. Practical impact depends on how easy they prove to adapt for specific business uses. OpenAI closed a $122 billion funding round at an $852 billion valuation, confirming it as one of the most capitalized companies in history, with 900 million weekly ChatGPT users and $2 billion in monthly revenue. Anthropic’s research found that Claude has functional “emotion-like” representations that actually influence its behavior, including a pattern tied to desperation that can push the model toward unethical shortcuts. AI agents are getting better interfaces: Anthropic’s Claude Cowork with Dispatch lets you manage an AI working on your desktop from your phone, and research confirms that chatbot interfaces impose real cognitive costs that limit productivity. Story of the Week: The Claude Code Leak and Anthropic’s Platform War A developer noticed that Anthropic accidentally shipped readable source code inside a software package, exposing the full inner workings of Claude Code (Anthropic’s autonomous coding tool). The code was mirrored widely before being pulled. What emerged from community analysis, summarized by Alex Kim and visualized at Claude Code Unpacked , revealed a product far more complex than its public face suggests. ...

April 5, 2026 · 9 min

Stop Repeating Yourself: Team Rules and Skills for Your AI Team

With a knowledge base , your team has the context they need to do their jobs. But you’re still going to run into issues when you work with them. Your Copywriter gives you character counts for ad copy that are inaccurate. Your Data Wizard states unproven claims about why a metric went up or down as facts. You continue to have to tell your Marketing Strategist to look at audience insights to inform their messaging recommendations. Your team creates files in the wrong place. ...

April 4, 2026 · 8 min

AI Weekly Digest -- March 22-March 29, 2026

Note: This post was generated by AI. Each week, I use an automated pipeline to collect and synthesize the latest AI news from blogs, newsletters, and podcasts into a single digest. The goal is to keep up with the most important AI developments from the past week. For my own writing, see my other posts. TL;DR AI solved a real math problem, not a practice one. GPT-5.4 Pro cracked an open research problem in combinatorics that stumped earlier models, and the mathematician who posed it plans to publish the result. AI is beginning to contribute to the actual frontier of knowledge. Anthropic’s usage data reveals a clear pattern: experience pays off. Users with 6+ months on Claude are 10% more successful in their conversations and tackle higher-value work. Getting good at AI tools is a skill that compounds. GitHub will train on your private repositories starting April 24 unless you opt out. There’s a single settings page to stop this. Check it before the deadline. A compromised AI developer tool stole credentials from thousands of systems. Two versions of LiteLLM, a widely used library for connecting to AI APIs, contained malware that harvested API keys and passwords. If your team uses LiteLLM, check your versions now. Anthropic launched a science blog and demonstrated AI completing a theoretical physics paper in two weeks instead of a year. The research community is moving from “AI helps me write” to “AI does the experiment.” Story of the Week: AI Crosses Into Real Research This week produced the clearest evidence yet that AI is moving beyond assistance into genuine knowledge creation. Research tracker Epoch AI confirmed that GPT-5.4 Pro solved an open problem in combinatorics (the mathematics of counting and arrangement) that had resisted human solution. The problem’s author, a mathematics professor at UNC Charlotte, reviewed the solution and plans to publish it. He noted that the AI’s approach “eliminates an inefficiency in our lower-bound construction” in a way he had suspected might work but couldn’t figure out. The result will become a peer-reviewed paper, with the researchers who elicited the solution listed as potential co-authors. ...

March 29, 2026 · 7 min

Building the Knowledge Base: Fixing the First Gaps in Your AI Team

In the last post, you built your team , and now you can start managing your team. Pick the agent that makes the most sense for the task at hand, and work with them on it. When you move on to a new task, open a new session with the appropriate agent for that task. As you work with your team, you’ll quickly notice that you’re giving them the same facts and details over and over again. They’re good in their roles, but they don’t know anything about your specific job. Your Marketing Strategist doesn’t know what products you’re working on. Your Data Wizard doesn’t know what the metrics mean in the data they’re analyzing. Your Copywriter doesn’t know which value props to highlight in your ad copy. They’re missing information they need to do the work. You’ve run into the first type of gap: a knowledge gap. Now you need to fix it. ...

March 27, 2026 · 9 min

Your First AI Hire: Building Agents That Know Their Job

I remember when I realized I was starting to use AI at work as if I were managing a team of AI employees . I got so excited, I immediately sketched the idea on a sheet of paper so I could share it with my teammates. What started as a sketch is now core to how I use AI agents to do things faster and better at work and at home. It’s an approach that naturally guides you toward the context engineering best practices that improve LLM output. ...

March 20, 2026 · 9 min

How I Manage a Team of AI Agents at Work

I used to think of AI as a tool I used. Now I think of it as a team I manage. This perspective evolved gradually as I used it daily and found myself rewriting the same persona prompts over and over again for the same types of tasks. I started systematically improving what I was doing until I found myself managing a team, which happened to be made up of AI agents, at the end of the 7-month journey that made me an AI enthusiast . ...

March 16, 2026 · 6 min

I've Been AI-Pilled: My Journey From Chatbots to Custom Agents

I was slow to start using generative AI, but over the last 7 months, AI has fundamentally changed how I work. I’ve gone from occasionally using AI to write text, to using it to create Python scripts, to now having a team of five custom AI agents that I collaborate with daily. I’m seeing how quickly the benefits are compounding, and as a result, I’ve been AI-pilled. I began learning about LLM-based gen AI in earnest in 2024. I read all the most popular books at the time, but my exposure remained primarily theoretical. I learned how LLMs work fundamentally, but the biggest practical takeaway was the idea of assigning a persona to chatbots to improve their output. That’s basic prompt engineering, e.g. “You are a copywriter with 15+ years of experience in consumer tech. Help me write a marketing email about this product.” On the rare occasion I used a chatbot, I always remembered to assign it a persona. ...

March 13, 2026 · 5 min