Comment by austinbaggio
Comment by austinbaggio 16 hours ago
| need a solution for context decay and relevant curation — with benchmarks that prove it is also more valuable than constant rediscovery (for quality and cost).
I agree. We are looking at some metr benchmarks, not expecting a simple answer to this, but do you have any in mind you find compelling?
Not really. But, You can go viral again with a "Coding Agents with memory build better software using less tokens" showcasing how you benchmarked a "twitter rebuild" -
1. Setup Claude Code to build some layers of the stack
2. Setup Codex to build others.
In one instance equip them both with your product. Maybe bake in some tribal knowledge.
In another instance let them work raw.
In both instances, capture:
I imagine this benchmark working well in your favor.