Language Models & Co.

Language Models & Co.

Share this post

Language Models & Co.
Language Models & Co.
SWE-Bench authors reflect on the state of LLM agents at Neurips 2024
Copy link
Facebook
Email
Notes
More

SWE-Bench authors reflect on the state of LLM…

Jay Alammar
Jan 14
5

Share this post

Language Models & Co.
Language Models & Co.
SWE-Bench authors reflect on the state of LLM agents at Neurips 2024
Copy link
Facebook
Email
Notes
More
1

The SWE-bench task measures AI agents on software engineering tasks at the level of a github issue.

Read →
Comments
User's avatar
© 2025 Jay Alammar
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More