Repository-Exploration on MLLog.dev

Repository-Exploration on MLLog.devhttps://mllog.dev/pl/tags/repository-exploration/Recent content in Repository-Exploration on MLLog.devMLLog.devhttps://mllog.dev/images/default_mllog.pnghttps://mllog.dev/images/default_mllog.pngHugo -- 0.147.9plTue, 09 Jun 2026 08:00:00 +0100SWE-Explore: Benchmark oceniający jak agenci kodujący eksplorują repozytoriahttps://mllog.dev/pl/posts/swe-explore-benchmarking-coding-agents-repository-exploration/Tue, 09 Jun 2026 08:00:00 +0100https://mllog.dev/pl/posts/swe-explore-benchmarking-coding-agents-repository-exploration/SWE-Explore izoluje eksplorację repozytorium od generowania patchy - 848 issue'ów, 10 języków, 203 repozytoria. Benchmark ujawnia, że agenci świetnie znajdują właściwe pliki, ale fatalnie celują na poziomie linii kodu, a efektywność kontekstu koreluje z resolve rate na poziomie r = 0.950.