Yutori
We’re building AI agents that can reliably do everyday digital tasks for you on the web, towards an AI chief-of-staff for everyone.
Building on the recent empirical work of Kwa et al. (2025), I show that within their suite of research-engineering tasks the performance of AI agents on longer-duration tasks can be explained by an extremely simple mathematical model — a constant rate of failing during each minute a human would take