Language Models & Co.
Subscribe
Sign in
Share this post
Language Models & Co.
SWE-Bench authors reflect on the state of LLM agents at Neurips 2024
Copy link
Facebook
Email
Notes
More
SWE-Bench authors reflect on the state of LLM…
Jay Alammar
Jan 14
4
Share this post
Language Models & Co.
SWE-Bench authors reflect on the state of LLM agents at Neurips 2024
Copy link
Facebook
Email
Notes
More
The SWE-bench task measures AI agents on software engineering tasks at the level of a github issue.
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
SWE-Bench authors reflect on the state of LLM…
Share this post
The SWE-bench task measures AI agents on software engineering tasks at the level of a github issue.