Accelerate The Development And Delivery Of Your Machine Learning Applications Using Ray And Deploy It At Anyscale

The Python Podcast.__init__

Episode | Podcast

Date: Sun, 06 Mar 2022 16:00:00 -0500

<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Building a machine learning application is inherently complex. Once it becomes necessary to scale the operation or training of the model, or introduce online re-training the process becomes even more challenging. In order to reduce the operational burden of AI developers Robert Nishihara helped to create the Ray framework that handles the distributed computing aspects of machine learning operations. To support the ongoing development and simplify adoption of Ray he co-founded Anyscale. In this episode he re-joins the show to share how the project, its community, and the ecosystem around it have grown and evolved over the intervening two years. He also explains how the techniques and adoption of machine learning have influenced the direction of the project.</p> <h2>Announcements</h2> <ul> <li>Hello and welcome to Podcast.__init__, the podcast about Python&#8217;s role in data and science.</li> <li>When you&#8217;re ready to launch your next app or want to try a project you hear about on the show, you&#8217;ll need somewhere to deploy it, so take a look at our friends over at Linode. With the launch of their managed Kubernetes platform it&#8217;s easy to get started with the next generation of deployment and scaling, powered by the battle tested Linode platform, including simple pricing, node balancers, 40Gbit networking, dedicated CPU and GPU instances, and worldwide data centers. Go to <a href="https://www.pythonpodcast.com/linode?utm_source=rss&amp;utm_medium=rss">pythonpodcast.com/linode</a> and get a $100 credit to try out a Kubernetes cluster of your own. And don&#8217;t forget to thank them for their continued support of this show!</li> <li>Your host as usual is Tobias Macey and today I&#8217;m interviewing Robert Nishihara about his work at Anyscale and the Ray distributed execution framework</li> </ul> <h2>Interview</h2> <ul> <li>Introductions</li> <li>How did you get introduced to Python?</li> <li>Can you describe what Anyscale is and the story behind it?</li> <li>How has the Ray project and ecosystem evolved since we last spoke? (2 years ago) <ul> <li>How has the landscape of AI/ML technologies and techniques shifted in that time?</li> </ul> </li> <li>What are the main areas where organizations are trying to apply ML/AI?</li> <li>What are some of the issues that teams encounter when trying to move from prototype to production with ML/AI applications? <ul> <li>What are the features of Ray that help to mitigate those challenges?</li> </ul> </li> <li>With the introduction of more widely available streaming/real-time technologies the viability of reinforcement learning has increased. What new challenges does that approach introduce?</li> <li>What are some of the operational complexities associated with managing a deployment of Ray? <ul> <li>What are some of the specialized utilities that you have had to develop to maintain a large and multi-tenant platform for your customers?</li> </ul> </li> <li>What is the governance model around the Ray project and how does the work at Anyscale influence the roadmap?</li> <li>What are the most interesting, innovative, or unexpected ways that you have seen Anyscale/Ray used?</li> <li>What are the most interesting, unexpected, or challenging lessons that you have learned while working on Ray and Anyscale?</li> <li>When is Anyscale/Ray the wrong choice?</li> <li>What do you have planned for the future of Anyscale/Ray?</li> </ul> <h2>Keep In Touch</h2> <ul> <li><a href="https://github.com/robertnishihara?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">robertnishihara</a> on GitHub</li> <li><a href="https://twitter.com/robertnishihara?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">@robertnishihara</a> on Twitter</li> <li><a href="http://www.robertnishihara.com/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Website</a></li> <li><a href="https://www.linkedin.com/in/robert-nishihara-b6465444/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">LinkedIn</a></li> </ul> <h2>Picks</h2> <ul> <li>Tobias <ul> <li><a href="https://edgechronicles.co.uk/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">The Edge Chronicles</a>: <a href="http://www.edgechronicles.co.uk/discover-the-books/the-twig-saga/beyond-the-deepwoods/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Beyond The Deepwoods</a></li> </ul> </li> <li>Robert <ul> <li><a href="https://www.anyscale.com/production-rl-summit?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Production RL Summit</a></li> <li><a href="https://www.andyweirauthor.com/books/project-hail-mary-hc/project-hail-mary-el?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Project Hail Mary</a> by Andy Weir</li> </ul> </li> </ul> <h2>Closing Announcements</h2> <ul> <li>Thank you for listening! Don&#8217;t forget to check out our other show, the <a href="https://www.dataengineeringpodcast.com?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast</a> for the latest on modern data management.</li> <li>Visit the <a href="https://www.pythonpodcast.com?utm_source=rss&amp;utm_medium=rss">site</a> to subscribe to the show, sign up for the mailing list, and read the show notes.</li> <li>If you&#8217;ve learned something or tried out a project from the show then tell us about it! Email <a href="mailto:hosts@podcastinit.com">hosts@podcastinit.com</a>) with your story.</li> <li>To help other people find the show please leave a review on <a href="https://itunes.apple.com/us/podcast/podcast.-init/id981834425?mt=2&amp;uo=6&amp;at=&amp;ct=&amp;utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">iTunes</a> and tell your friends and co-workers</li> </ul> <h2>Links</h2> <ul> <li><a href="https://docs.ray.io/en/latest/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Ray</a> <ul> <li><a href="https://www.pythonpodcast.com/ray-distributed-computing-episode-258/?utm_source=rss&amp;utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://www.anyscale.com/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Anyscale</a></li> <li><a href="https://www.berkeley.edu/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">UC Berkeley</a></li> <li><a href="https://www.mathworks.com/products/matlab.html?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Matlab</a></li> <li><a href="https://en.wikipedia.org/wiki/Deep_learning?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Deep Learning</a></li> <li><a href="https://pandas.pydata.org/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Pandas</a></li> <li><a href="https://numpy.org/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">NumPy</a></li> <li><a href="https://horovod.ai/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Horovod</a> <ul> <li><a href="https://www.pythonpodcast.com/ludwig-horovod-distributed-declarative-deep-learning-episode-341/?utm_source=rss&amp;utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://xgboost.ai/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">XGBoost</a></li> <li><a href="https://modin.readthedocs.io/en/latest/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Modin</a> <ul> <li><a href="https://www.pythonpodcast.com/modin-parallel-dataframe-episode-324/?utm_source=rss&amp;utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://dask.org/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Dask</a></li> <li><a href="https://docs.ray.io/en/latest/data/dataset.html?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Ray Datasets</a></li> <li><a href="https://en.wikipedia.org/wiki/Reinforcement_learning?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Reinforcement Learning</a></li> <li><a href="https://www.anyscale.com/production-rl-summit?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Production Reinforcement Learning Summit</a></li> <li><a href="https://en.wikipedia.org/wiki/AlphaGo?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">AlphaGo</a></li> <li><a href="https://databricks.com/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Databricks</a></li> <li><a href="https://www.snowflake.com/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Snowflake</a> <ul> <li><a href="https://www.dataengineeringpodcast.com/snowflakedb-cloud-data-warehouse-episode-110/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode</a></li> </ul> </li> <li><a href="https://en.wikipedia.org/wiki/Tensor_Processing_Unit?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">TPU == Tensor Processing Unit</a></li> <li><a href="https://www.wandb.com/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Weights and Biases</a></li> <li><a href="https://mlflow.org/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">MLFlow</a></li> <li><a href="https://docs.ray.io/en/latest/rllib/index.html?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">RLLib</a></li> <li><a href="https://docs.ray.io/en/latest/serve/index.html?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">Ray Serve</a></li> </ul> <p>The intro and outro music is from Requiem for a Fish <a href="http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">The Freak Fandango Orchestra</a> / <a href="http://creativecommons.org/licenses/by-sa/3.0/?utm_source=rss&amp;utm_medium=rss" rel="noopener" target="_blank">CC BY-SA</a></p> </div> <img alt="" height="0" src="https://analytics.boundlessnotions.com/piwik.php?idsite=1&amp;rec=1&amp;url=https%3A%2F%2Fwww.pythonpodcast.com%2Fanyscale-machine-learning-applications-episode-355%2F&amp;action_name=Accelerate+The+Development+And+Delivery+Of+Your+Machine+Learning+Applications+Using+Ray+And+Deploy+It+At+Anyscale+-+Episode+355&amp;urlref=https%3A%2F%2Fwww.pythonpodcast.com%2Ffeed%2F&amp;utm_source=rss&amp;utm_medium=rss" style="border: 0; width: 0; height: 0;" width="0" />