Networking Optimizations for Multi-Node Deep Learning on Kubernetes with Erez Cohen - #345

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Episode | Podcast

Date: Wed, 05 Feb 2020 17:33:00 -0000

Today we conclude the KubeCon ‘19 series joined by Erez Cohen, VP of CloudX & AI at Mellanox, who we caught up with before his talk “Networking Optimizations for Multi-Node Deep Learning on Kubernetes.” In our conversation, we discuss NVIDIA’s recent acquisition of Mellanox, the evolution of technologies like RDMA and GPU Direct, how Mellanox is enabling Kubernetes and other platforms to take advantage of the recent advancements in networking tech, and why we should care about networking in Deep Lea