Posts

GPT-5.6 Sol and Claude Mythos Show That the AI Race Has Reached a New Level

Image
  GPT-5.6 Sol matters because the performance story is strong. OpenAI says Sol is its strongest cyber model yet . On ExploitBench , it says Sol is competitive with Claude Mythos Preview while using about one-third of the output tokens . On ExploitGym , Sol leads the rest of the GPT-5.6 family as reasoning budgets rise. And on OpenAI's internal capture-the-flag tasks, Sol is close to saturation. That is already enough to make this launch important. But the bigger story starts right after the performance story. In GPT-5.5 Is Not Just a Better Model. It Is a Deployment Story. , I argued that frontier launches are starting to ship as packages: model, safeguards, access policy, and post-launch testing. GPT-5.6 Sol looks like the next step in that pattern. What changed this time is simple: this is no longer only a race for better answers or higher benchmark scores. It is also a race to decide who gets access, how closely that access is monitored, and how much institutional visib...

Anthropic’s Cyber Research Suggests AI Is Reducing the Time Between a Patch and an Exploit

Image
  On May 22, June 3, and June 8, 2026 , Anthropic published three cyber research posts that looked like different stories. One was about exploit benchmarks. One mapped malicious AI use to the MITRE ATT&CK framework, which is a common way security teams describe attacker behavior. One tested how fast models could turn published patches into working exploits. Read together, they point to a simpler operational shift. Back in Claude Mythos Preview: The Most Important AI Release Wasn't a Release , I argued that frontier cyber capability was becoming a deployment problem , not only a benchmark story. These new posts push that idea into plainer English: AI is reducing the time between a patch and an exploit. That sentence is my synthesis, not Anthropic's single official headline. But the pattern is hard to miss. What These Three Posts Actually Show The three posts answer three different questions: Measuring LLMs' ability to develop exploits asks how far models can go...

Why Anthropic and OpenAI’s S-1 Filings Matter

Image
On June 1, 2026 , Anthropic said it had confidentially submitted a draft S-1 to the SEC. On June 8, 2026 , OpenAI said it had recently submitted a confidential S-1 too. For frontier AI , that matters. But this is not really an IPO timing story. The better question is what Anthropic and OpenAI are starting to look like. Last week, I wrote that Anthropic’s Series H and Draft S-1 Point to a Bigger Shift in Frontier AI . OpenAI’s June 8 announcement makes that broader pattern harder to miss. What A Confidential S-1 Actually Means In simple terms, a confidential S-1 means a company has started the process that could let it go public later. It does not mean the IPO is happening now. It does not mean the price is set. And it does not tell us the exact date a listing will happen. Both companies say that clearly. Anthropic says the IPO would depend on market conditions and other factors. OpenAI says it has not decided on timing yet and may still prefer being private for some things...

Claude Fable 5 Shows a New Way to Release Powerful AI

Image
Anthropic has released its strongest public Claude yet , and that alone is big news. Claude Fable 5 is clearly a capability story. Anthropic says Fable 5 is its most capable model ever made widely available, and Google Cloud describes it as strong at multi-step reasoning, software work, and document analysis. That is a real jump. But what caught my attention is not only the model. It is the way Anthropic chose to release it, and what that says about where the most advanced AI models are going. The Interesting Part Is The Release Anthropic did not ship this model like a normal product release. It shipped with clear limits . Powerful models may increasingly reach the public with built-in control layers around them. In simple terms, Fable 5 suggests a new release pattern: better model, but not direct access to everything that model can do. The Public Version Has Limits The first detail is the easiest one to miss. Fable 5 is described as a Mythos-class model. In plain language, that...

Anthropic’s Series H and Draft S-1 Point to a Bigger Shift in Frontier AI

Image
 On May 28, 2026 , Anthropic announced a $65 billion Series H at a $965 billion post-money valuation . On June 1, 2026 , it said it had confidentially submitted a draft S-1 to the SEC, which means it now has the option to go public later if conditions are right. Those look like normal company-building signals. But the more interesting story is not the valuation, and not the IPO option by itself. It is the kind of company Anthropic has had to become first. Read next to Anthropic’s own posts on compute expansion , Responsible Scaling Policy , Constitutional Classifiers , and Project Glasswing , the signal is hard to miss: frontier AI companies are starting to look less like ordinary software companies and more like operators of critical systems. This is less an IPO story than an operating-model story. What Do I Mean By "Operators Of Critical Systems" Here? I do not mean Anthropic is literally becoming a power company, a telecom, or a public utility. I mean somethin...

Why Frontier AI Needs Critics It Can't Buy

Image
  The Vatican was the dramatic setting. The more important signal came from Chris Olah's remarks : every frontier AI lab is trapped inside incentives that can conflict with doing the right thing. That is the part builders should pay attention to. A lab at the frontier is saying, in public, that it cannot be its own final critic. This Is Not Really a Church Story You do not need to be Catholic to find the point here. The Vatican matters as a setting because it sits outside Silicon Valley's incentive stack. It is one example of an institution the labs do not control, cannot hire, and cannot fold into a product roadmap. Anthropic's own follow-up, Widening the conversation on frontier AI , makes the scope even clearer. They are not only talking to clergy. They are talking to philosophers, humanists, lawyers, writers, psychologists, and civic leaders across traditions. The real story is simple: the labs are looking for outside judgment because inside judgment is co...

Project Glasswing Update: The Bottleneck Is Moving From Discovery to Patching

Image
    Last month, I wrote that Project Glasswing mattered less as a model announcement and more as a deployment signal . Anthropic had a frontier model with unusually strong cyber capability, and instead of shipping it broadly, it wrapped it in a controlled defensive program. The new initial update for Project Glasswing makes that decision easier to understand. It does not prove vulnerability research is a solved problem. But it does show something important: AI-assisted vulnerability discovery is scaling faster than the human systems that verify, disclose, patch, and deploy fixes. What Changed In The First Month? Anthropic says its approximately 50 partners have already found more than 10,000 high- or critical-severity vulnerabilities across systemically important software. The public examples are notable: Cloudflare says Mythos Preview helped it find 2,000 bugs , including 400 high- or critical-severity issues, with a false positive rate its team consid...