Neural Network Quantization and Compression with Tijmen Blankevoort - TWIML Talk #292

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Episode | Podcast

Date: Mon, 19 Aug 2019 18:07:03 -0000

Today we’re joined by Tijmen Blankevoort, a staff engineer at Qualcomm, who leads their compression and quantization research teams. In our conversation with Tijmen we discuss:  • The ins and outs of compression and quantization of ML models, specifically NNs, • How much models can actually be compressed, and the best way to achieve compression,  • We also look at a few recent papers including “Lottery Hypothesis."