Skip to content
Ironwood AI

Adam Holter

Support Me Model Tracker Ironwood AI LinkedIn X
Cloud AI vs Open Source: Why the Music Industry Analogy Doesn’t Hold
Mar 13, 2026

Cloud AI vs Open Source: Why the Music Industry Analogy Doesn’t Hold

There’s a YouTube video making the rounds that predicts the giant cloud AI players will crumble the same…

Meta Delays Avocado After Weak Internal Evals While xAI Keeps Losing Ground
Mar 13, 2026

Meta Delays Avocado After Weak Internal Evals While xAI Keeps Losing Ground

Meta delayed Avocado to at least May after internal evaluations showed it lagging the top models from Google,…

ai-leaders anthropic claude-models
Cost Creep 2026: Gemini Flash Gets Worse While GPT-5.x and Claude Mostly Hold the Line
Mar 12, 2026

Cost Creep 2026: Gemini Flash Gets Worse While GPT-5.x and Claude Mostly Hold the Line

Cost creep is being used too loosely. A vendor raising list prices is not enough. The only definition…

ai-tensions cloud-ai coding-assistant
ChatGPT Pro vs Claude Max: Is the Leaked $100 OpenAI Plan Worth It?
Mar 11, 2026

ChatGPT Pro vs Claude Max: Is the Leaked $100 OpenAI Plan Worth It?

OpenAI's leaked $100 Pro plan finally lets us compare ChatGPT Pro and Claude Max at the same price.…

ai-leaders anthropic chatgpt
Gemini Tool Calling Problems: Why It Feels Nervous in Agents
Mar 9, 2026

Gemini Tool Calling Problems: Why It Feels Nervous in Agents

Gemini has a tool-calling problem, and I think the best way to describe it is that it seems…

ChinaBench: Open-Source LLM Censorship Benchmark Results Across Qwen, GLM, Kimi, MiniMax, DeepSeek, and GPT-OSS
Mar 9, 2026

ChinaBench: Open-Source LLM Censorship Benchmark Results Across Qwen, GLM, Kimi, MiniMax, DeepSeek, and GPT-OSS

I built ChinaBench because standard LLM benchmarks leave out a behavior that many users care about: what a…

GPT-5.4 Fast Mode: What It Changes and When to Use It
Mar 9, 2026

GPT-5.4 Fast Mode: What It Changes and When to Use It

GPT-5.4 Fast Mode explained for Codex and ChatGPT users: what /fast changes, where speed matters, and when token…

agentic-tasks ai-leaders ai-models
AI Fiesta vs aikmind vs TypingMind: Skip the Middleman
Mar 4, 2026

AI Fiesta vs aikmind vs TypingMind: Skip the Middleman

Big thanks to Haneet Grewal for reaching out after reading the AI Fiesta post. Haneet nearly signed up…

Gemini 3.1 Flash-Lite: Cost, Speed, and Intelligence
Mar 3, 2026

Gemini 3.1 Flash-Lite: Cost, Speed, and Intelligence

Gemini 3.1 Flash-Lite Preview dropped on March 3, 2026, positioned as a drop-in replacement for low-cost models. It…

ai-leaders cloud-ai cost-efficiency
GPT 5.4 Is Already Live for Pro Users and Its SVG Generation Is Something Else
Mar 3, 2026

GPT 5.4 Is Already Live for Pro Users and Its SVG Generation Is Something Else

OpenAI has not officially released GPT 5.4, but if you are a Pro user on the model currently…

BullshitBench v2: Claude and Qwen Are the Only Models That Push Back
Mar 2, 2026

BullshitBench v2: Claude and Qwen Are the Only Models That Push Back

BullshitBench v2 is out. Peter Gostev tested 70+ model variants across 100 questions spanning coding, medical, legal, finance,…

Every AI Model Released in February 2026 — Full List with Specs, Pricing, and Benchmarks
Mar 2, 2026

Every AI Model Released in February 2026 — Full List with Specs, Pricing, and Benchmarks

Complete running list of every AI model launched in February 2026. Covers Claude, GPT, Gemini, Grok, and open-source…

ai-leaders ai-tensions anthropic
OpenAI Raises $110 Billion: Amazon, NVIDIA, and SoftBank Break Down the Round
Feb 27, 2026

OpenAI Raises $110 Billion: Amazon, NVIDIA, and SoftBank Break Down the Round

OpenAI closed a $110 billion funding round led by Amazon, NVIDIA, and SoftBank, putting its pre-money valuation at…

ai-leaders amazon funding
Build Your Self-Surveillance System: Track Everything Locally on macOS
Feb 20, 2026

Build Your Self-Surveillance System: Track Everything Locally on macOS

Taking notes is a waste of time. I set up a self-surveillance system for myself instead. I gave…

GPT-5.3-Codex-Spark: 1000 Tokens Per Second, But Is It Actually Faster?
Feb 13, 2026

GPT-5.3-Codex-Spark: 1000 Tokens Per Second, But Is It Actually Faster?

OpenAI released GPT-5.3-Codex-Spark on February 12, 2026, a smaller version of GPT-5.3-Codex built for real-time coding. The headline…

ai-leaders benchmark chatgpt
Elon Says AI Will Generate Binary by 2026. Here’s Why That’s a Terrible Idea.
Feb 12, 2026

Elon Says AI Will Generate Binary by 2026. Here’s Why That’s a Terrible Idea.

Elon Musk claimed AI would write raw binary by 2026. Here's why that's a strange goalpost — and…

ai-trends coding-assistant developer-tools
Claude Opus 4.6 vs GPT-5.3-Codex: Model War Benchmarks and Self-Improvement
Feb 6, 2026

Claude Opus 4.6 vs GPT-5.3-Codex: Model War Benchmarks and Self-Improvement

On February 5, 2026, Anthropic and OpenAI did something everyone expected: they turned flagship launches into a direct…

Which AI Model Is Best in 2026? Claude vs GPT-5 vs Gemini 3 vs Grok vs GLM — Ranked
Jan 31, 2026

Which AI Model Is Best in 2026? Claude vs GPT-5 vs Gemini 3 vs Grok vs GLM — Ranked

Side-by-side benchmark results for the top AI models of 2026. See how Claude Opus, GPT-5, Gemini 3, Grok,…

OpenAI Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and o4-mini from ChatGPT on Feb 13 2026: Good Riddance to GPT-4o
Jan 31, 2026

OpenAI Retiring GPT-4o, GPT-4.1, GPT-4.1 mini, and o4-mini from ChatGPT on Feb 13 2026: Good Riddance to GPT-4o

OpenAI just announced the planned retirement of four ChatGPT models: GPT-4o, GPT-4.1, GPT-4.1 mini, and o4-mini. The hard…

Claude for Healthcare vs ChatGPT Health: Same Week, Different Strategy
Jan 23, 2026

Claude for Healthcare vs ChatGPT Health: Same Week, Different Strategy

Anthropic and OpenAI both decided that healthcare is a place to put an AI wrapper around messy context…

HeartMuLa (3B) Is the First Local Music Model That Feels Close, But AI Music Needs Editing to Really Take Off
Jan 23, 2026

HeartMuLa (3B) Is the First Local Music Model That Feels Close, But AI Music Needs Editing to Really Take Off

I do not think AI music takes off in a big way until we get good autoregressive models…

ai-leaders amazon benchmark
Typeless Android Keyboard: Real Voice-to-Text Without the Cleanup
Jan 23, 2026

Typeless Android Keyboard: Real Voice-to-Text Without the Cleanup

Typeless Android keyboard review after real use: voice-to-text accuracy, cleanup quality, workflow fit, and whether it is better…

ChatGPT Ads are Coming: The End of the Ad-Free Era for Free Users
Jan 16, 2026

ChatGPT Ads are Coming: The End of the Ad-Free Era for Free Users

OpenAI is finally pulling the trigger on monetization for the masses. Starting in the next few weeks, they…

ai-leaders chatgpt industry-drama
ChatGPT Health is OpenAI’s smartest wrapper yet, because healthcare is mostly paperwork and missing context
Jan 7, 2026

ChatGPT Health is OpenAI’s smartest wrapper yet, because healthcare is mostly paperwork and missing context

OpenAI just launched ChatGPT Health, a dedicated space inside ChatGPT for health conversations on mobile and web. The…

I Open-Sourced ai-aggregator: My Daily Dashboard for Tracking New AI Models Across Providers
Jan 7, 2026

I Open-Sourced ai-aggregator: My Daily Dashboard for Tracking New AI Models Across Providers

I officially open-sourced ai-aggregator, the dashboard I use every day to keep up with new AI model releases…

← Prev 1 2 3 4 5 … 27 Next →

HIGHLIGHTS

Google I/O 2026 Leaks: Gemini 3.5, Omni, and What to Expect May 19, 2026
Google Caught an AI-Generated Zero-Day Exploit May 13, 2026
You Shouldn't Use GPT-5.5 Instant May 5, 2026
SubQ and Long Context Efficiency May 5, 2026
I Should Have Applied to the GPT-5.5 Party Anyway May 5, 2026

ABOUT ADAM

I write about AI, language models, and what’s happening in the space.
Adam Holter
Adam Holter
Founder of Ironwood AI.

Links

They're clicky!

Follow me on X Visit Ironwood AI →

© 2026 Adam Holter. All rights reserved.

Ironwood AI