Podevcast

When data leakage turns into a flood of trouble

Practical AI: Machine Learning, Data Science

Episode | Podcast

Date: Tue, 20 Oct 2020 14:10:00 +0000

Rajiv Shah teaches Daniel and Chris about data leakage, and its major impact upon machine learning models. It’s the kind of topic that we don’t often think about, but which can ruin our results. Raj discusses how to use activation maps and image embedding to find leakage, so that leaking information in our test set does not find its way into our training set.