Comment by qwesr123

Comment by qwesr123 3 days ago

0 replies

FYI the MarginLab Claude Code degradation tracker is showing a statistically significant ~4% drop in SWE-Bench-Pro accuracy over the past month