Podevcast

Experimenting With Reinforcement Learning Using MushroomRL

The Python Podcast.init

Episode | Podcast

Date: Sun, 19 Sep 2021 16:00:00 -0400

<div class="wp-block-jetpack-markdown"><h2>Summary</h2> <p>Reinforcement learning is a branch of machine learning and AI that has a lot of promise for applications that need to evolve with changes to their inputs. To support the research happening in the field, including applications for robotics, Carlo D’Eramo and Davide Tateo created MushroomRL. In this episode they share how they have designed the project to be easy to work with, so that students can use it in their study, as well as extensible so that it can be used by businesses and industry professionals. They also discuss the strengths of reinforcement learning, how to design problems that can leverage its capabilities, and how to get started with MushroomRL for your own work.</p> <h2>Announcements</h2> <ul> <li>Hello and welcome to Podcast.__init__, the podcast about Python’s role in data and science.</li> <li>When you’re ready to launch your next app or want to try a project you hear about on the show, you’ll need somewhere to deploy it, so take a look at our friends over at Linode. With the launch of their managed Kubernetes platform it’s easy to get started with the next generation of deployment and scaling, powered by the battle tested Linode platform, including simple pricing, node balancers, 40Gbit networking, dedicated CPU and GPU instances, and worldwide data centers. Go to <a href="https://www.pythonpodcast.com/linode?utm_source=rss&utm_medium=rss">pythonpodcast.com/linode</a> and get a $100 credit to try out a Kubernetes cluster of your own. And don’t forget to thank them for their continued support of this show!</li> <li>Your host as usual is Tobias Macey and today I’m interviewing Davide Tateo and Carlo D’Eramo about MushroomRL, a library for building reinforcement learning experiments</li> </ul> <h2>Interview</h2> <ul> <li>Introductions</li> <li>How did you get introduced to Python?</li> <li>Can you start by describing what reinforcement learning is and how it differs from other approaches for machine learning?</li> <li>What are some example use cases where reinforcement learning might be necessary?</li> <li>Can you describe what MushroomRL is and the story behind it? <ul> <li>Who are the target users of the project?</li> <li>What are its main goals?</li> </ul> </li> <li>What are your suggestions to other developers for implementing a succesful library?</li> <li>What are some of the core concepts that researchers and/or engineers need to understand to be able to effectively use reinforcement learning techniques?</li> <li>Can you describe how MushroomRL is architected? <ul> <li>How have the goals and design of the project changed or evolved since you began working on it?</li> </ul> </li> <li>What is the workflow for building and executing an experiment with MushroomRL? <ul> <li>How do you track the states and outcomes of experiments?</li> </ul> </li> <li>What are some of the considerations involved in designing an environment and reward functions for an agent to interact with?</li> <li>What are some of the open questions that are being explored in reinforcement learning?</li> <li>How are you using MushroomRL in your own research?</li> <li>What are the most interesting, innovative, or unexpected ways that you have seen MushroomRL used?</li> <li>What are the most interesting, unexpected, or challenging lessons that you have learned while working on MushroomRL?</li> <li>When is MushroomRL the wrong choice?</li> <li>What do you have planned for the future of MushroomRL?</li> <li>How can the open-source community contribute to MushroomRL?</li> <li>What kind of support you are willing to provide to users?</li> </ul> <h2>Keep In Touch</h2> <ul> <li>Davide <ul> <li><a href="https://github.com/boris-il-forte?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">boris-il-forte</a> on GitHub</li> <li><a href="https://www.ias.informatik.tu-darmstadt.de/Team/DavideTateo?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Website</a></li> </ul> </li> <li>Carlo <ul> <li><a href="https://github.com/carloderamo?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">carloderamo</a> on GitHub</li> <li><a href="https://www.ias.informatik.tu-darmstadt.de/Team/CarloDEramo?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Website</a></li> </ul> </li> </ul> <h2>Picks</h2> <ul> <li>Tobias <ul> <li><a href="https://www.imdb.com/title/tt5932548/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Britannia TV Series</a></li> </ul> </li> <li>Davide <ul> <li><a href="https://en.wikipedia.org/wiki/Nineteen_Eighty-Four?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">1984</a> by George Orwell</li> </ul> </li> <li>Carlo <ul> <li><a href="https://en.wikipedia.org/wiki/Twin_Peaks?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Twin Peaks</a> TV Series</li> </ul> </li> </ul> <h2>Closing Announcements</h2> <ul> <li>Thank you for listening! Don’t forget to check out our other show, the <a href="https://www.dataengineeringpodcast.com?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast</a> for the latest on modern data management.</li> <li>Visit the <a href="https://www.pythonpodcast.com?utm_source=rss&utm_medium=rss">site</a> to subscribe to the show, sign up for the mailing list, and read the show notes.</li> <li>If you’ve learned something or tried out a project from the show then tell us about it! Email <a href="mailto:hosts@podcastinit.com">hosts@podcastinit.com</a>) with your story.</li> <li>To help other people find the show please leave a review on <a href="https://itunes.apple.com/us/podcast/podcast.-init/id981834425?mt=2&uo=6&at=&ct=&utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">iTunes</a> and tell your friends and co-workers</li> <li>Join the community in the new Zulip chat workspace at <a href="https://www.pythonpodcast.com/chat?utm_source=rss&utm_medium=rss">pythonpodcast.com/chat</a></li> </ul> <h2>Links</h2> <ul> <li><a href="https://mushroomrl.readthedocs.io/en/latest/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">MushroomRL</a></li> <li><a href="https://www.tu-darmstadt.de/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">TU Darmstadt</a></li> <li><a href="http://www.mujoco.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">MuJoCo</a></li> <li><a href="https://pybullet.org/wordpress/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">PyBullet</a></li> <li><a href="http://svl.stanford.edu/igibson/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">iGibson</a></li> <li><a href="https://aihabitat.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Habitat</a></li> <li><a href="https://gym.openai.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">OpenAI Gym</a></li> <li><a href="https://pytorch.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">PyTorch</a> <ul> <li><a href="https://www.pythonpodcast.com/pytorch-deep-learning-epsiode-202/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://ray.readthedocs.io/en/latest/rllib.html?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">RLLib</a></li> <li><a href="https://docs.ray.io/en/latest/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Ray</a> <ul> <li><a href="https://www.pythonpodcast.com/ray-distributed-computing-episode-258/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://github.com/openai/baselines?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">OpenAI Baselines</a></li> <li><a href="https://github.com/DLR-RM/stable-baselines3?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Stable Baselines</a></li> <li><a href="https://www.ros.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">ROS</a></li> </ul> <p>The intro and outro music is from Requiem for a Fish <a href="http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">The Freak Fandango Orchestra</a> / <a href="http://creativecommons.org/licenses/by-sa/3.0/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">CC BY-SA</a></p> </div> <img alt="" height="0" src="https://analytics.boundlessnotions.com/piwik.php?idsite=1&rec=1&url=https%3A%2F%2Fwww.pythonpodcast.com%2Fmushroomrl-reinforcement-learning-library-episode-332%2F&action_name=Experimenting+With+Reinforcement+Learning+Using+MushroomRL+-+Episode+332&urlref=https%3A%2F%2Fwww.pythonpodcast.com%2Ffeed%2F&utm_source=rss&utm_medium=rss" style="border: 0; width: 0; height: 0;" width="0" />

Experimenting With Reinforcement Learning Using MushroomRL

The Python Podcast.__init__

The Python Podcast.init