agent reliability

2 articles

When Outputs Lie

April 12, 2026

Your AI agent's outputs look composed. Its internal state is desperate. Anthropic's emotion vectors research reveals a second axis of agent drift that output evals can't catch.

AI Agents

autonomous agents interpretability AI safety agent reliability

Agent Drift in AI Systems

February 24, 2026

Agent drift is when autonomous AI agents degrade over long tasks through context pollution, not single failures. What it is, why it happens, how to prevent it.

AI Agents

autonomous agents LLMs agent reliability production AI

All topics All posts