254 subscribers
Gå offline med appen Player FM !
Podcasts der er værd at lytte til
SPONSORERET


1 Family Secrets: Chris Pratt & Millie Bobby Brown Share Stories From Set 22:08
Can we build a generalist agent? Dr. Minqi Jiang and Dr. Marc Rigter
Manage episode 407961751 series 2803422
Dr. Minqi Jiang and Dr. Marc Rigter explain an innovative new method to make the intelligence of agents more general-purpose by training them to learn many worlds before their usual goal-directed training, which we call "reinforcement learning". Their new paper is called "Reward-free curricula for training robust world models" https://arxiv.org/pdf/2306.09205.pdf https://twitter.com/MinqiJiang https://twitter.com/MarcRigter Interviewer: Dr. Tim Scarfe Please support us on Patreon, Tim is now doing MLST full-time and taking a massive financial hit. If you love MLST and want this to continue, please show your support! In return you get access to shows very early and private discord and networking. https://patreon.com/mlst We are also looking for show sponsors, please get in touch if interested mlstreettalk at gmail. MLST Discord: https://discord.gg/machine-learning-street-talk-mlst-937356144060530778
214 episoder
Manage episode 407961751 series 2803422
Dr. Minqi Jiang and Dr. Marc Rigter explain an innovative new method to make the intelligence of agents more general-purpose by training them to learn many worlds before their usual goal-directed training, which we call "reinforcement learning". Their new paper is called "Reward-free curricula for training robust world models" https://arxiv.org/pdf/2306.09205.pdf https://twitter.com/MinqiJiang https://twitter.com/MarcRigter Interviewer: Dr. Tim Scarfe Please support us on Patreon, Tim is now doing MLST full-time and taking a massive financial hit. If you love MLST and want this to continue, please show your support! In return you get access to shows very early and private discord and networking. https://patreon.com/mlst We are also looking for show sponsors, please get in touch if interested mlstreettalk at gmail. MLST Discord: https://discord.gg/machine-learning-street-talk-mlst-937356144060530778
214 episoder
All episodes
×

1 The Compendium - Connor Leahy and Gabriel Alfour 1:37:10


1 ARC Prize v2 Launch! (Francois Chollet and Mike Knoop) 54:15


1 Test-Time Adaptation: the key to reasoning with DL (Mohamed Osman) 1:03:36


1 GSMSymbolic paper - Iman Mirzadeh (Apple) 1:11:23


1 Reasoning, Robustness, and Human Feedback in AI - Max Bartolo (Cohere) 1:23:11


1 Tau Language: The Software Synthesis Future (sponsored) 1:41:19


1 John Palazza - Vice President of Global Sales @ CentML ( sponsored) 54:50


1 Transformers Need Glasses! - Federico Barbero 1:00:54


1 Sakana AI - Chris Lu, Robert Tjarko Lange, Cong Lu 1:37:54


1 Clement Bonnet - Can Latent Program Networks Solve Abstract Reasoning? 51:26


1 Prof. Jakob Foerster - ImageNet Moment for Reinforcement Learning? 53:31


1 Daniel Franzen & Jan Disselhoff - ARC Prize 2024 winners 1:09:04


1 Sepp Hochreiter - LSTM: The Comeback Story? 1:07:01


1 Want to Understand Neural Networks? Think Elastic Origami! - Prof. Randall Balestriero 1:18:10


1 Nicholas Carlini (Google DeepMind) 1:21:15
Velkommen til Player FM!
Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.