All Articles

AI news, analysis, and takes that actually age well.

Deep dives, model breakdowns, and the stories behind the headlines — written by practitioners, for practitioners.

All
AI News
Models
Workflows
Strategy
Tools
Fire Featured
Anthropic vs. the Pentagon: What the DoD ruling actually means for AI adoption
The DoD labeled Anthropic a supply-chain risk, nearly 40 OpenAI and Google employees signed on in support, and Claude hit #1 in the App Store the same week. We break down what this actually means for enterprise AI procurement in 2026.
March 14, 2026 · 8 min read
Models
GPT-5.4's 1M token context: what it unlocks and when it breaks
We ran 20 real-world tasks across the full context window. The results are more nuanced than the benchmarks suggest.
March 11, 2026 · 6 min read
Strategy
MiniMax M2.5 is the best model you're not using yet
Benchmarks neck-and-neck with Claude Opus 4.6. Cost arbitrage is wide open. Here's the exact prompt delta.
March 9, 2026 · 5 min read
Workflows
Apple rebuilt Siri on Gemini — here's what changed in your stack
iOS 26.4 routes Siri queries through Private Cloud Compute running Gemini. We mapped which tasks go where and how to prompt for max results.
March 7, 2026 · 7 min read
AI News
Yann LeCun raised $1B+ for AMI. Here's why it matters more than the headlines suggest
World models, open source, and a direct challenge to the current transformer paradigm. What LeCun's bet actually means for the next five years.
March 5, 2026 · 6 min read
Tools
OpenAI acquires Promptfoo: red-teaming goes in-house
The leading open-source LLM evaluation tool is now part of OpenAI. We explain what this means for the independent evals ecosystem.
March 3, 2026 · 4 min read
Models
DeepSeek V4 at 1T parameters: the full technical breakdown
Architecture, training data, benchmark results, and a blunt assessment of where it beats frontier models and where it still falls short.
Feb 28, 2026 · 9 min read
Prompting
The chain-of-density prompting technique that actually works
We extracted a buried research finding, tested it across 5 models, and have the exact template ready to copy. Your summaries are about to get sharper.
Feb 25, 2026 · 5 min read
Strategy
Humanity's Last Exam results: what frontier models still can't do
The benchmark designed to stump the best models revealed exactly where the capability ceiling still sits. The gaps are more specific than you'd expect.
Feb 22, 2026 · 7 min read

Get these articles first.

New articles drop in the newsletter every Tuesday before they hit the site.