使用Player FM应用程序离线!
Democratizing ML for speech (Practical AI #164)
Manage episode 317978943 series 1423445
You might know about MLPerf, a benchmark from MLCommons that measures how fast systems can train models to a target quality metric. However, MLCommons is working on so much more! David Kanter joins us in this episode to discuss two new speech datasets that are democratizing machine learning for speech via data scale and language/speaker diversity.
Changelog++ members save 3 minutes on this episode because they made the ads disappear. Join today!
Sponsors:
- Changelog++ – You love our content and you want to take it to the next level by showing your support. We’ll take you closer to the metal with no ads, extended episodes, outtakes, bonus content, a deep discount in our merch store (soon), and more to come. Let’s do this!
- The Brave Browser – Browse the web up to 8x faster than Chrome and Safari, block ads and trackers by default, and reward your favorite creators with the built-in Basic Attention Token. Download Brave for free and give tipping a try right here on changelog.com.
- Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
Featuring:
- David Kanter – Twitter, GitHub
- Chris Benson – Twitter, GitHub, LinkedIn, Website
- Daniel Whitenack – Twitter, GitHub, Website
Show Notes:
- Press Release about MLCommons datasets: MLCommons™ Association Unveils Open Datasets and Tools to Drive Democratization of Machine Learning
- NeurIPS Papers:
- Gradient article: New Datasets to Democratize Speech Recognition Technology
- Blog posts for more insight:
- Downloads:
Something missing or broken? PRs welcome!
1974集单集
Manage episode 317978943 series 1423445
You might know about MLPerf, a benchmark from MLCommons that measures how fast systems can train models to a target quality metric. However, MLCommons is working on so much more! David Kanter joins us in this episode to discuss two new speech datasets that are democratizing machine learning for speech via data scale and language/speaker diversity.
Changelog++ members save 3 minutes on this episode because they made the ads disappear. Join today!
Sponsors:
- Changelog++ – You love our content and you want to take it to the next level by showing your support. We’ll take you closer to the metal with no ads, extended episodes, outtakes, bonus content, a deep discount in our merch store (soon), and more to come. Let’s do this!
- The Brave Browser – Browse the web up to 8x faster than Chrome and Safari, block ads and trackers by default, and reward your favorite creators with the built-in Basic Attention Token. Download Brave for free and give tipping a try right here on changelog.com.
- Fastly – Our bandwidth partner. Fastly powers fast, secure, and scalable digital experiences. Move beyond your content delivery network to their powerful edge cloud platform. Learn more at fastly.com
Featuring:
- David Kanter – Twitter, GitHub
- Chris Benson – Twitter, GitHub, LinkedIn, Website
- Daniel Whitenack – Twitter, GitHub, Website
Show Notes:
- Press Release about MLCommons datasets: MLCommons™ Association Unveils Open Datasets and Tools to Drive Democratization of Machine Learning
- NeurIPS Papers:
- Gradient article: New Datasets to Democratize Speech Recognition Technology
- Blog posts for more insight:
- Downloads:
Something missing or broken? PRs welcome!
1974集单集
所有剧集
×欢迎使用Player FM
Player FM正在网上搜索高质量的播客,以便您现在享受。它是最好的播客应用程序,适用于安卓、iPhone和网络。注册以跨设备同步订阅。