Best of ML Engineered in 2020 (Most Popular Episodes and Most Clicked Links)

For the last newsletter of 2020, I wanted to highlight the “best of” content I produced and featured since starting ML Engineered. Below you’ll find the top 5 most-downloaded podcast episodes and most-clicked newsletter links.

But before we get to that, a follow-up from last week:

More on Auto-Encoders for Out of Distribution Detection

In last week’s newsletter, I linked to Alejandro Saucedo’s excellent article on production ML monitoring, noting his use of auto-encoders for out of distribution detection for complex data.

Since then I’ve done additional reading on the topic, learning, among other things, that it can be viewed as a particular type of anomaly detection, of which there is substantial literature for. I’m not quite sure how production-ready this type of research is, and would love to explore it’s possible use at work this year if I get the chance.

Regardless, here are the papers I found most useful on the topic:

(Chalapathy & Chawla, 2019) Deep Learning for Anomaly Detection: A Survey
(Ren et al., 2019) Likelihood Ratios for Out-of-Distribution Detection
(Pang et al., 2020) Deep Learning for Anomaly Detection: A Review
(Bulusu et al., 2020) Out-of-Distribution Detection in Deep Learning: A Survey

5 Most Popular ML Engineered Episodes

5 Most Clicked Newsletter Links

“High Performance Natural Language Processing” An ACL 2020 tutorial containing an overview of all the recent advances in deep learning for NLP
“Challenges in Deploying Machine Learning” An awesome survey of case studies by University of Cambridge researchers on production ML
“Louis Dorard’s Machine Learning Canvas” A super useful tool to get each team involved in an ML project on the same page (literally)
“Made With ML’s Applied ML” Goku Mohandas’s pivot to online courses teaching practical machine learning
“Underspecification Presents Challenges for Credibility in Modern Machine Learning” A new Google ML engineering paper highlighting an incredibly pernicious issue

Machine Learning Engineered

Best of ML Engineered in 2020 (Most Popular Episodes and Most Clicked Links)

More on Auto-Encoders for Out of Distribution Detection

5 Most Popular ML Engineered Episodes

5 Most Clicked Newsletter Links

What I've learned from hosting the ML Engineered podcast (PLUS: the research area you NEED to know about, data science project management, and more...)

A study guide for ML engineering, a new Google paper on "Data Cascades", and more...

Can machine learning solve scarcity? This founder thinks so... (PLUS: Interest in MLOps "exploding" while ML research is stagnating?)