Evaluating Context Compression for AI Agents | Factory.ai
When an AI agent helps you work through a complex task across hundreds of messages, what happens when it runs out of me…
Building on the recent empirical work of Kwa et al. (2025), I show that within their suite of research-engineering tasks the performance of AI agents on longer-duration tasks can be explained by an extremely simple mathematical model — a constant rate of failing during each minute a human would take