Give That Model A Treat! : Reinforcement Learning Explained Tic-Tac-Toe The Hard Way podcast

Artwork

Tech Podcasting Education Rebecca Salois People AI Research Machine Learning Human Centered Reinforcement Learning Supervised Learning Tic-tac-toe Games Google

内容由Lucas Dixon and People + AI Research提供。所有播客内容（包括剧集、图形和播客描述）均由 Lucas Dixon and People + AI Research 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品，您可以按照此处概述的流程进行操作https://zh.player.fm/legal。

Tic-Tac-Toe the Hard Way « »
Give that model a treat! : Reinforcement learning explained

5y ago 26:04

分享

MP3•单集首页

内容由Lucas Dixon and People + AI Research提供。所有播客内容（包括剧集、图形和播客描述）均由 Lucas Dixon and People + AI Research 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品，您可以按照此处概述的流程进行操作https://zh.player.fm/legal。

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.

Resources:

Deep Learning for JavaScript book

Playing Atari with Deep Reinforcement Learning

Two Minute Papers episode on Atari DQN

For more information about the show, check out pair.withgoogle.com/thehardway/.

You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

… continue reading

10集单集

#Tech #Podcasting Education #Rebecca Salois #People AI Research #Machine Learning #Human Centered #Reinforcement Learning #Supervised Learning #Tic-tac-toe #Games #Google

Artwork

Give that model a treat! : Reinforcement learning explained

Tic-Tac-Toe the Hard Way

published 5y ago

分享

MP3•单集首页

内容由Lucas Dixon and People + AI Research提供。所有播客内容（包括剧集、图形和播客描述）均由 Lucas Dixon and People + AI Research 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品，您可以按照此处概述的流程进行操作https://zh.player.fm/legal。

Switching gears, we focus on how Yannick’s been training his model using reinforcement learning. He explains the differences from David’s supervised learning approach. We find out how his system performs against a player that makes random tic-tac-toe moves.

Resources:

Deep Learning for JavaScript book

Playing Atari with Deep Reinforcement Learning

Two Minute Papers episode on Atari DQN

For more information about the show, check out pair.withgoogle.com/thehardway/.

You can reach out to the hosts on Twitter: @dweinberger and @tafsiri.

… continue reading

10集单集

#Tech #Podcasting Education #Rebecca Salois #People AI Research #Machine Learning #Human Centered #Reinforcement Learning #Supervised Learning #Tic-tac-toe #Games #Google

所有剧集

×

欢迎使用Player FM

Player FM正在网上搜索高质量的播客，以便您现在享受。它是最好的播客应用程序，适用于安卓、iPhone和网络。注册以跨设备同步订阅。

收听超过500个主题

边探索边听这个节目