Artwork

内容由Dr. Andrew Clark & Sid Mangalik, Dr. Andrew Clark, and Sid Mangalik提供。所有播客内容(包括剧集、图形和播客描述)均由 Dr. Andrew Clark & Sid Mangalik, Dr. Andrew Clark, and Sid Mangalik 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal
Player FM -播客应用
使用Player FM应用程序离线!

Baseline modeling and its critical role in AI and business performance

36:23
 
分享
 

Manage episode 412932327 series 3475282
内容由Dr. Andrew Clark & Sid Mangalik, Dr. Andrew Clark, and Sid Mangalik提供。所有播客内容(包括剧集、图形和播客描述)均由 Dr. Andrew Clark & Sid Mangalik, Dr. Andrew Clark, and Sid Mangalik 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal

Baseline modeling is a necessary part of model validation. In our expert opinion, it should be required before model deployment. There are many baseline modeling types and in this episode, we're discussing their use cases, strengths, and weaknesses. We're sure you'll appreciate a fresh take on how to improve your modeling practices.
Show notes
Introductions and news: why reporting and visibility is a good thing for AI 0:03

  • Spoiler alert: Providing visibility to AI bias audits does NOT mean exposing trade secrets. Some reports claim otherwise.
  • Discussion about AI regulation in the context of current events and how regulation is playing out between Boeing and the FAA (tbc)

Understanding baseline modeling for machine learning 7:41

  • Establishing baselines allows us to understand how models perform relative to simple rules-based models, aka heuristics.
  • Reporting results without baselines to compare against is like giving a movie a rating of 5 without telling the listener that you were using a 10-point scale.
  • Baseline modeling comparisons are part of rigorous model validations and should always be conducted during early model development and final production deployment.
  • Pairs with analyses of theoretical upper bounds for modeling performance to show how your technique scores between acceptable worst and best case performance.
  • We often find complex models being deployed in the real world that haven’t proven their value over simpler and explainable baseline models

Classification baselines and model performance comparison 19:40

  • Uniform Random Selection - simulate how your model does against a baseline model that guesses classes randomly like a dice.
  • Most Frequent Class (MFC) - the most telling test and often the most telling test in the case of highly skewed data with inappropriate metrics.
  • Single-feature modeling - Validates how much the complex signal from your data and model improves over a bare minimum explainable model.
  • And more…

Exploring regression and more advanced baselines for modeling 24:11

  • Regression baselines: mean, median mode, Single-variable linear regression, Lag 1, and Least 5% re-interpretation
  • Advanced baselines in language and vision

Conclusions 35:39

  • Baseline modeling is a necessary part of model validation
  • There are differing flavors of baselines that are appropriate for all types of modeling
  • Baselines are needed to establish fair and realistic lower bounds for performance
  • If your model can’t perform significantly better than a baseline consider scrapping the model and trying a new approach

Do you have a question or a discussion topic for the AI Fundamentalists? Connect with them to comment on your favorite topics:

  • LinkedIn - Episode summaries, shares of cited articles, and more.
  • YouTube - Was it something that we said? Good. Share your favorite quotes.
  • Visit our page - see past episodes and submit your feedback! It continues to inspire future episodes.
  continue reading

17集单集

Artwork
icon分享
 
Manage episode 412932327 series 3475282
内容由Dr. Andrew Clark & Sid Mangalik, Dr. Andrew Clark, and Sid Mangalik提供。所有播客内容(包括剧集、图形和播客描述)均由 Dr. Andrew Clark & Sid Mangalik, Dr. Andrew Clark, and Sid Mangalik 或其播客平台合作伙伴直接上传和提供。如果您认为有人在未经您许可的情况下使用您的受版权保护的作品,您可以按照此处概述的流程进行操作https://zh.player.fm/legal

Baseline modeling is a necessary part of model validation. In our expert opinion, it should be required before model deployment. There are many baseline modeling types and in this episode, we're discussing their use cases, strengths, and weaknesses. We're sure you'll appreciate a fresh take on how to improve your modeling practices.
Show notes
Introductions and news: why reporting and visibility is a good thing for AI 0:03

  • Spoiler alert: Providing visibility to AI bias audits does NOT mean exposing trade secrets. Some reports claim otherwise.
  • Discussion about AI regulation in the context of current events and how regulation is playing out between Boeing and the FAA (tbc)

Understanding baseline modeling for machine learning 7:41

  • Establishing baselines allows us to understand how models perform relative to simple rules-based models, aka heuristics.
  • Reporting results without baselines to compare against is like giving a movie a rating of 5 without telling the listener that you were using a 10-point scale.
  • Baseline modeling comparisons are part of rigorous model validations and should always be conducted during early model development and final production deployment.
  • Pairs with analyses of theoretical upper bounds for modeling performance to show how your technique scores between acceptable worst and best case performance.
  • We often find complex models being deployed in the real world that haven’t proven their value over simpler and explainable baseline models

Classification baselines and model performance comparison 19:40

  • Uniform Random Selection - simulate how your model does against a baseline model that guesses classes randomly like a dice.
  • Most Frequent Class (MFC) - the most telling test and often the most telling test in the case of highly skewed data with inappropriate metrics.
  • Single-feature modeling - Validates how much the complex signal from your data and model improves over a bare minimum explainable model.
  • And more…

Exploring regression and more advanced baselines for modeling 24:11

  • Regression baselines: mean, median mode, Single-variable linear regression, Lag 1, and Least 5% re-interpretation
  • Advanced baselines in language and vision

Conclusions 35:39

  • Baseline modeling is a necessary part of model validation
  • There are differing flavors of baselines that are appropriate for all types of modeling
  • Baselines are needed to establish fair and realistic lower bounds for performance
  • If your model can’t perform significantly better than a baseline consider scrapping the model and trying a new approach

Do you have a question or a discussion topic for the AI Fundamentalists? Connect with them to comment on your favorite topics:

  • LinkedIn - Episode summaries, shares of cited articles, and more.
  • YouTube - Was it something that we said? Good. Share your favorite quotes.
  • Visit our page - see past episodes and submit your feedback! It continues to inspire future episodes.
  continue reading

17集单集

所有剧集

×
 
Loading …

欢迎使用Player FM

Player FM正在网上搜索高质量的播客,以便您现在享受。它是最好的播客应用程序,适用于安卓、iPhone和网络。注册以跨设备同步订阅。

 

快速参考指南