What Dead Code Taught Us About Building Tools for AI Agents
We benchmarked AI agents on dead code detection across 50+ real-world runs and 12 open-source repositories. Graph-enhanced agents achieved 97% recall where baseline agents scored 0%. Here's what we learned about context engineering, honest benchmarking, and why code graphs are the missing primitive for software factories.


