毎週水曜日更新中!
…
continue reading
Indhold leveret af Hajime Morrita , Jun Mukai. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Hajime Morrita , Jun Mukai eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.
Player FM - Podcast-app
Gå offline med appen Player FM !
Gå offline med appen Player FM !
#143 – SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
MP3•Episode hjem
Manage episode 454899224 series 2151064
Indhold leveret af Hajime Morrita , Jun Mukai. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Hajime Morrita , Jun Mukai eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.
GitHub の Issue を読んでバグを直すエーアイについて森田が読みました。ご意見感想などは Reddit やおたより投書箱にお寄せください。iTunes のレビューや星もよろしくね。
- [2310.06770] SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
- [2405.15793] SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
- SWE-bench
- Introducing SWE-bench Verified | OpenAI
- The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic
147 episoder
MP3•Episode hjem
Manage episode 454899224 series 2151064
Indhold leveret af Hajime Morrita , Jun Mukai. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Hajime Morrita , Jun Mukai eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.
GitHub の Issue を読んでバグを直すエーアイについて森田が読みました。ご意見感想などは Reddit やおたより投書箱にお寄せください。iTunes のレビューや星もよろしくね。
- [2310.06770] SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
- [2405.15793] SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
- SWE-bench
- Introducing SWE-bench Verified | OpenAI
- The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic
147 episoder
כל הפרקים
×Velkommen til Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.