Give That Model A Treat! : Reinforcement Learning Explained Tic-Tac-Toe The Hard Way podcast

Artwork

Tech Podcasting Education Rebecca Salois People AI Research Machine Learning Human Centered Reinforcement Learning Supervised Learning Tic-tac-toe Games Google

Indhold leveret af Lucas Dixon and People + AI Research. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Lucas Dixon and People + AI Research eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.

Tic-Tac-Toe the Hard Way « »
Give that model a treat! : Reinforcement learning explained

4+ y ago 26:04

Del

MP3•Episode hjem

Indhold leveret af Lucas Dixon and People + AI Research. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Lucas Dixon and People + AI Research eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.

Resources:

Deep Learning for JavaScript book

Playing Atari with Deep Reinforcement Learning

Two Minute Papers episode on Atari DQN

For more information about the show, check out pair.withgoogle.com/thehardway/.

You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

… continue reading

10 episoder

#Tech #Podcasting Education #Rebecca Salois #People AI Research #Machine Learning #Human Centered #Reinforcement Learning #Supervised Learning #Tic-tac-toe #Games #Google

Artwork

Give that model a treat! : Reinforcement learning explained

Tic-Tac-Toe the Hard Way

published 4+ y ago

Del

MP3•Episode hjem

Indhold leveret af Lucas Dixon and People + AI Research. Alt podcastindhold inklusive episoder, grafik og podcastbeskrivelser uploades og leveres direkte af Lucas Dixon and People + AI Research eller deres podcastplatformspartner. Hvis du mener, at nogen bruger dit ophavsretligt beskyttede værk uden din tilladelse, kan du følge processen beskrevet her https://da.player.fm/legal.

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.

Resources:

Deep Learning for JavaScript book

Playing Atari with Deep Reinforcement Learning

Two Minute Papers episode on Atari DQN

For more information about the show, check out pair.withgoogle.com/thehardway/.

You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

… continue reading

10 episoder

#Tech #Podcasting Education #Rebecca Salois #People AI Research #Machine Learning #Human Centered #Reinforcement Learning #Supervised Learning #Tic-tac-toe #Games #Google

Alle Folgen

×

Velkommen til Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

Lyt til 500+ emner