The shape of the problem
The core issue isn’t that agents can’t learn. It’s that they have nowhere to put what they’ve learned. When an agent in a workflow run discovers that a particular repository needs--maxWorkers=2 to avoid Jest running out of memory, that knowledge evaporates the moment the run ends. The next run’s agent starts clean, hits the OOM error, spends a few minutes diagnosing it, finds the same fix, and moves on. Multiply that by dozens of runs and you’re looking at real wasted time and cost.
The obvious solution is “just give agents memory.” But memory systems are deceptively tricky to get right. Store too much and you drown the agent in irrelevant context. Store too little and you miss patterns that only emerge across multiple runs. And if you let agents write memories without any validation, you end up with a pile of observations that are half-true, outdated, or actively misleading.
We needed something more structured. Not a scratchpad, but a knowledge store with opinions about what’s worth remembering.