SWE-Atlas
Ai Tool
SWE-Atlas is an open benchmark from Scale AI that assesses large-language-model coding agents on realistic software-engineering tasks such as codebase Q&A, test writing and multi-file refactoring inside containerized repositories. It is intended for researchers and developers who want to measure and compare agent performance on end-to-end maintenance work rather than isolated code snippets.
article
1 story
calendar_today
First: 2026-03-09
update
Last: 2026-03-09
menu_book
Wikipedia