Skip to content
Ironwood AI

Adam Holter

Support Me Model Tracker Ironwood AI LinkedIn X
Claude Mythos Preview: First AI to Complete AISI 32-Step Cyber Range End-to-End
Apr 14, 2026

Claude Mythos Preview: First AI to Complete AISI 32-Step Cyber Range End-to-End

AISI Cyber Evaluations: Mythos Preview’s Breakthrough Claude Mythos Preview finished an end-to-end corporate network attack simulation. The AI Security Institute designed this as a 32-step scenario. A human expert would need about 20 hours to work through it. Models from 2023 struggled with basic cyber tasks. Mythos crossed a clear line here. It shows what […]

All openai ai-models ai-leaders model-release industry-drama open-source nvidia models-releases models google gpt-5 models-products video-generation generative-ai cloud-ai tool-calling chatgpt big-number anthropic openrouter fast-cheap-models generate-as-many-tags-as-genuinely-apply news tools-products qwen china ai-research web-navigation llm-scaling deepseek market-share product-fragmentation claude-models coding-assistant cost-efficiency search-addendum ai-tools ai-trends ai-announcements amazon-web-services
Model Degradation: Intentional Sabotage or Accidental Slips?
Apr 14, 2026

Model Degradation: Intentional Sabotage or Accidental Slips?

Claims that AI models degrade over time are not just anecdotal. There’s a real pattern, especially with Anthropic’s…

AI Backlash Hits Home: Sam Altman’s Residence Faces Multiple Attacks
Apr 13, 2026

AI Backlash Hits Home: Sam Altman’s Residence Faces Multiple Attacks

Sam Altman’s San Francisco home has endured two attacks within days, part of three incidents connected to AI…

ai-leaders ai-tensions industry-drama
OpenAI Super App: ChatGPT, Codex, and Atlas Browser Combined
Apr 10, 2026

OpenAI Super App: ChatGPT, Codex, and Atlas Browser Combined

OpenAI is developing a desktop super app that unifies ChatGPT, Atlas browser, and Codex into a single interface.…

agentic-tasks chatgpt-atlas-codex coding-assistant
Elon Musk Accidentally Leaked Anthropic’s Model Sizes
Apr 10, 2026

Elon Musk Accidentally Leaked Anthropic’s Model Sizes

How many parameters does Claude have? Until recently, the honest answer was that nobody outside Anthropic knew. Anthropic…

AI Images 2014 vs 2026: OpenAI Optimized for Style, Google Optimized for Realism
Apr 9, 2026

AI Images 2014 vs 2026: OpenAI Optimized for Style, Google Optimized for Realism

If you told somebody in 2020 what AI image generation would look like in 2026, they would never…

Muse Spark: Meta’s Efficiency Play
Apr 8, 2026

Muse Spark: Meta’s Efficiency Play

Meta Superintelligence Labs dropped Muse Spark today. It’s their first reasoning model and the first product out of…

ai-leaders chatgpt-atlas-codex claude-models
OpenAI Spud: Leaked April 16 Release, Mythos-Level Benchmarks, and What GPT-5.5 or GPT-6 Might Mean
Apr 8, 2026

OpenAI Spud: Leaked April 16 Release, Mythos-Level Benchmarks, and What GPT-5.5 or GPT-6 Might Mean

Claude Mythos Preview posted 77.80% on SWE-bench Pro. GPT-5.4 is at 57.70%. OpenAI has been signaling that Spud,…

Did Claude Mythos Hack Linux? Yes!
Apr 7, 2026

Did Claude Mythos Hack Linux? Yes!

Anthropic announced Claude Mythos Preview and Project Glasswing today. The model is not publicly available. It is restricted…

ai-leaders claude-models industry-drama
The SenseMath Paper Tested Budget Models and Called It a Verdict on AI
Apr 7, 2026

The SenseMath Paper Tested Budget Models and Called It a Verdict on AI

A paper out of the University of Notre Dame called SenseMath is making the rounds with the claim…

Someone Made a Whip for Claude Code. Here’s Why That’s a Bad Idea.
Apr 7, 2026

Someone Made a Whip for Claude Code. Here’s Why That’s a Bad Idea.

Someone built a tool called Badclaude that gives you a literal animated whip to crack at Claude Code.…

Claude Code Leaked. But Can It Run Doom? And Can Doom Run Codex?
Apr 6, 2026

Claude Code Leaked. But Can It Run Doom? And Can Doom Run Codex?

Claude Code leaked. The entire source. Over 500,000 lines of TypeScript shipped inside an npm package that was…

A Better Golden Mean: The Color Picker Model for Courage
Mar 28, 2026

A Better Golden Mean: The Color Picker Model for Courage

I want to do something a little different from my normal AI content. I had a logical idea…

Sam Altman Posts First Footage of Steel Beams Going Up at Michigan Stargate Site
Mar 28, 2026

Sam Altman Posts First Footage of Steel Beams Going Up at Michigan Stargate Site

Sam Altman posted an aerial video on March 27, 2026, showing the first steel beams rising at the…

Claude Mythos Leaked: Anthropic’s Biggest Model Yet Has a Cybersecurity Problem
Mar 27, 2026

Claude Mythos Leaked: Anthropic’s Biggest Model Yet Has a Cybersecurity Problem

Anthropic accidentally exposed a draft blog post and roughly 3,000 unpublished assets on March 27, 2026, via an…

Codex Plugins Are Here: Slack, Figma, Google Drive, and More
Mar 27, 2026

Codex Plugins Are Here: Slack, Figma, Google Drive, and More

OpenAI rolled out plugins for Codex on March 26, 2026. The integrations cover Slack, Figma, Notion, Gmail, and…

ARC-AGI-3 Launch: SOTA Models Score Under 1% and the Human Baseline Is Rigged
Mar 27, 2026

ARC-AGI-3 Launch: SOTA Models Score Under 1% and the Human Baseline Is Rigged

ARC-AGI-3 launched on March 25, 2026, as an interactive reasoning benchmark. Where ARC-AGI-1 and ARC-AGI-2 gave models static…

xAI Should Stop Pretending to Be a Frontier Lab
Mar 27, 2026

xAI Should Stop Pretending to Be a Frontier Lab

xAI has a genuine strength, and it is not competing with Claude Opus 4.5 or GPT-5.2. It is…

ai-leaders cost-efficiency developer-tools
OpenAI Spud: What the Rumors Actually Say
Mar 25, 2026

OpenAI Spud: What the Rumors Actually Say

OpenAI Spud rumors: release date, features, comparison to GPT-5, and what insiders actually know about this potential new…

Every Claude Model Released Since January 2026: The Complete Timeline
Mar 25, 2026

Every Claude Model Released Since January 2026: The Complete Timeline

Claude 3.5, Claude 3.7, Claude Opus 4 — track every Anthropic model release since January 2026 with specs,…

claude-models industry-drama models-products
GPT-5.4 Mini and Nano: Benchmarks, Pricing, and What They’re Actually Good For
Mar 17, 2026

GPT-5.4 Mini and Nano: Benchmarks, Pricing, and What They’re Actually Good For

OpenAI released GPT-5.4 mini and GPT-5.4 nano on March 17, 2026. These are smaller, faster variants of GPT-5.4…

ai-leaders coding-assistant cost-efficiency
Data Centers Are Ordering Ship Engines for On-Site Power
Mar 16, 2026

Data Centers Are Ordering Ship Engines for On-Site Power

Data centers are buying massive ship-derived and industrial engines to generate their own power on-site. This is not…

ai-leaders cost-efficiency data-centers
Anthropic Hits $19B ARR as Apple Runs Its Internal Dev on Claude
Mar 16, 2026

Anthropic Hits $19B ARR as Apple Runs Its Internal Dev on Claude

Anthropic is generating $19 billion in annualized revenue. Bloomberg’s Mark Gurman reported that Apple, despite partnering with Google…

ai-leaders anthropic apple
OpenAI Adds Interactive Math and Science Tools to ChatGPT: 70+ Topics Now Live
Mar 16, 2026

OpenAI Adds Interactive Math and Science Tools to ChatGPT: 70+ Topics Now Live

OpenAI rolled out interactive math and science modules to ChatGPT on March 10, 2026. When you ask about…

CursorBench-3: How Cursor Evaluates Coding Agents on Real Developer Tasks
Mar 16, 2026

CursorBench-3: How Cursor Evaluates Coding Agents on Real Developer Tasks

Cursor released CursorBench-3, an updated internal benchmark for evaluating coding agents on tasks that actually look like real…

1 2 3 … 26 Next →

Highlights

  • Claude Mythos Preview: First AI to Complete AISI 32-Step Cyber Range End-to-End Apr 14, 2026
  • Model Degradation: Intentional Sabotage or Accidental Slips? Apr 14, 2026
  • AI Backlash Hits Home: Sam Altman’s Residence Faces Multiple Attacks Apr 13, 2026
  • OpenAI Super App: ChatGPT, Codex, and Atlas Browser Combined Apr 10, 2026
  • Elon Musk Accidentally Leaked Anthropic’s Model Sizes Apr 10, 2026

About Adam

I write about AI, language models, and what's actually happening in the space.

Founder of Ironwood AI.

Links

They're clicky!

Follow me on X Visit Ironwood AI →

© 2026 Adam Holter · Subscribe · X · LinkedIn · Ironwood AI