
Cognition is putting money behind a claim almost no AI vendor has been willing to make: that its agent actually pays for itself. The company behind Devin just announced an AI Productivity Guarantee for enterprise customers, promising that if Devin delivers less engineering value than a customer pays for over the course of an annual contract, Cognition will fund usage with credits up to $10 million until the numbers balance out.
The pitch, written by CEO Scott Wu, is aimed squarely at the dashboards CTOs have been staring at all year. Tokens consumed, lines of code, PRs opened, none of these answer whether the spend produced anything useful. Cognition is trying to replace that activity-metric culture with a single dollar-denominated number: how many engineer-hours of real work did the agent deliver?
Turning agent sessions into hours
The guarantee is backed by a technical system Cognition built to estimate the productive engineering hours behind every completed Devin session. An estimator agent runs over each session, first classifying whether it produced useful output, then estimating how long a human engineer would have taken to produce the same work. The agent gets the user prompt, the resulting PR (if any), the full execution trace of every action Devin took, and codebase context pulled from DeepWiki.
Filtering for useful work matters a lot. If a session created PRs and none were merged, the hours are thrown out. For sessions without a PR, a classifier removes 1 to 20 percent of runs depending on the customer, while keeping legitimately productive non-code work like security scans, bug triage, dependency cleanup, and analytics queries.
Don't miss what's next in AI
Join 300,000+ engineers and researchers who get the signal, not the noise.
- Full access to in-depth AI research breakdowns
- Be the first to know what's trending before it hits mainstream
- Daily curated papers, repos, and industry moves

