Scaling Model Training with Kubernetes at Stripe with Kelley Rivoire - TWIML Talk #272

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Episode | Podcast

Date: Thu, 06 Jun 2019 16:34:42 -0000

Today we’re joined by Kelley Rivoire, engineering manager working on machine learning infrastructure at Stripe. Kelley and I caught up at a recent Strata Data conference to discuss: • Her talk "Scaling model training: From flexible training APIs to resource management with Kubernetes." • Stripe’s machine learning infrastructure journey, including their start from a production focus. • Internal tools used at Stripe, including Railyard, an API built to manage model training at scale & more!