Tagged: agents
Set the Bar, Hold the Bar
AI

Set the Bar, Hold the Bar

Agent deployments rarely fail because the model is weak. They fail because nobody defined what done means before the run, or nobody checked the result after. The Bar is the two-part framework for the only jobs left on the human side.

Cost per Solved Task, Not Cost per Token
AI

Cost per Solved Task, Not Cost per Token

Uber capped engineers at $1,500 a month after burning its annual AI budget in four months, and Fable 5 costs double Opus yet wins on long migrations. Per-token price stopped being the cost; cost per solved task is, and the lever that controls it is making loops halt.

Rules Are Who You Are, Skills Are What You Know, Prompts Are What You Want
AI

Rules Are Who You Are, Skills Are What You Know, Prompts Are What You Want

Rules, skills, and prompts each have their own cost model, and filing instructions under the wrong layer is why agents feel either bloated or ignorant. A field guide to sorting the pile.

The Judge Does Not Run Your Tests
AI

The Judge Does Not Run Your Tests

The judge model behind agent loops like Claude Code's /goal never runs your tests or reads your repo. It only reads the transcript, so verification is only as real as the receipts your agent produces.

Stop Being the Thing in the Loop
AI

Stop Being the Thing in the Loop

Boris Cherny writes loops that prompt the agent instead of prompting it himself. The job moved from writing code to writing the thing that writes the code, and only two properties make that loop trustworthy: an external check and hard stops.

From RAG to Agentic Memory, a Working Blueprint
AI

From RAG to Agentic Memory, a Working Blueprint

Replace static RAG with a memory-first agent. A working blueprint for episodic, semantic, and working memory.

Gartner Says 40% of Enterprise Apps Will Use AI Agents by December
AI

Gartner Says 40% of Enterprise Apps Will Use AI Agents by December

From under 5% to 40% in one year. Gartner predicts an eightfold increase in AI agent adoption across enterprise apps, while 88% of companies using AI still struggle to show bottom-line impact.

Healthcare AI Agents Have Shipped. The Validation Frameworks Haven't.
AI

Healthcare AI Agents Have Shipped. The Validation Frameworks Haven't.

Epic just put three AI agents on stage at HIMSS 2026. Art writes notes. Penny handles billing. Emmie talks to patients. The validation strategy was absent.

MCP Gave AI Agents Superpowers. Attackers Noticed.
AI

MCP Gave AI Agents Superpowers. Attackers Noticed.

The protocol that lets AI agents use tools also gave attackers a new attack surface. January 2026 showed us how bad it can get.

The CLAUDE.md File Is Your Actual Product Now
AI

The CLAUDE.md File Is Your Actual Product Now

The file that tells your AI agent how to behave has become the highest-leverage artifact in your entire workflow. Not the code. The configuration.

All ai agents coding enterprise rag