Date: Sun, 21 Oct 2018 22:00:00 -0400
<h3>Summary</h3> <p>As data science becomes more widespread and has a bigger impact on the lives of people, it is important that those projects and products are built with a conscious consideration of ethics. Keeping ethical principles in mind throughout the lifecycle of a data project helps to reduce the overall effort of preventing negative outcomes from the use of the final product. Emily Miller and Peter Bull of Driven Data have created Deon to improve the communication and conversation around ethics among and between data teams. It is a Python project that generates a checklist of common concerns for data oriented projects at the various stages of the lifecycle where they should be considered. In this episode they discuss their motivation for creating the project, the challenges and benefits of maintaining such a checklist, and how you can start using it today.</p> <h3>Preface</h3> <ul> <li>Hello and welcome to Podcast.__init__, the podcast about Python and the people who make it great.</li> <li>When you’re ready to launch your next app you’ll need somewhere to deploy it, so check out Linode. With private networking, shared block storage, node balancers, and a 40Gbit network, all controlled by a brand new API you’ve got everything you need to scale up. Go to <a href="https://www.pythonpodcast.com/linode?utm_source=rss&utm_medium=rss">podcastinit.com/linode</a> to get a $20 credit and launch a new server in under a minute.</li> <li>Visit the <a href="https://www.pythonpodcast.com?utm_source=rss&utm_medium=rss">site</a> to subscribe to the show, sign up for the newsletter, and read the show notes. And if you have any questions, comments, or suggestions I would love to hear them. You can reach me on Twitter at <a href="https://twtiter.com/podcastinit?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">@Podcast__init__</a> or email <a href="mailto:hosts@podcastinit.com">hosts@podcastinit.com</a>)</li> <li>To help other people find the show please leave a review on <a href="https://itunes.apple.com/us/podcast/podcast.-init/id981834425?mt=2&uo=6&at=&ct=&utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">iTunes</a>, or <a href="https://play.google.com/music/m/I7ogju4xv6adasgqz6545jndgsy?t=Podcastinit_-_Python_and_the_people_who_make_it_great&utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Google Play Music</a>, tell your friends and co-workers, and share it on social media.</li> <li>Join the community in the new Zulip chat workspace at <a href="https://www.pythonpodcast.com/chat?utm_source=rss&utm_medium=rss">podcastinit.com/chat</a></li> <li>Your host as usual is Tobias Macey and today I’m interviewing Emily Miller and Peter Bull about Deon, an ethics checklist for data projects</li> </ul> <h3>Interview</h3> <ul> <li>Introductions</li> <li>How did you get introduced to Python?</li> <li>Can you start by describing what Deon is and your motivation for creating it?</li> <li>Why a checklist, specifically? What’s the advantage of this over an oath, for example?</li> <li>What is unique to data science in terms of the ethical concerns, as compared to traditional software engineering?</li> <li>What is the typical workflow for a team that is using Deon in their projects?</li> <li>Deon ships with a default checklist but allows for customization. What are some common addendums that you have seen? <ul> <li>Have you received pushback on any of the default items?</li> </ul> </li> <li>How does Deon simplify communication around ethics across team boundaries?</li> <li>What are some of the most often overlooked items?</li> <li>What are some of the most difficult ethical concerns to comply with for a typical data science project?</li> <li>How has Deon helped you at Driven Data?</li> <li>What are the customer facing impacts of embedding a discussion of ethics in the product development process?</li> <li>Some of the items on the default checklist coincide with regulatory requirements. Are there any cases where regulation is in conflict with an ethical concern that you would like to see practiced?</li> <li>What are your hopes for the future of the Deon project?</li> </ul> <h3>Keep In Touch</h3> <ul> <li>Emily <ul> <li><a href="https://www.linkedin.com/in/emily-miller/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">LinkedIn</a></li> <li><a href="https://github.com/ejm714?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">ejm714</a> on GitHub</li> </ul> </li> <li>Peter <ul> <li><a href="https://www.linkedin.com/in/pjbull/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">LinkedIn</a></li> <li><a href="https://twitter.com/pjbull?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">@pjbull</a> on Twitter</li> <li><a href="https://github.com/pjbull?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">pjbull</a> on GitHub</li> </ul> </li> <li>Driven Data <ul> <li><a href="https://twitter.com/drivendataorg?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">@drivendataorg</a> on Twitter</li> <li><a href="https://github.com/drivendataorg?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">drivendataorg</a> on GitHub</li> <li><a href="http://drivendata.co/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Website</a></li> </ul> </li> </ul> <h3>Picks</h3> <ul> <li>Tobias <ul> <li><a href="http://richardbondartist.yolasite.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Richard Bond Glass Art</a></li> </ul> </li> <li>Emily <ul> <li><a href="https://www.tandemcoffee.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Tandem Coffee</a> in <a href="https://www.visitportland.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Portland, Maine</a></li> </ul> </li> <li>Peter <ul> <li><a href="https://www.themodelbakery.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">The Model Bakery</a> in Saint Helena and Napa, California</li> </ul> </li> </ul> <h3>Links</h3> <ul> <li><a href="https://deon.drivendata.org?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Deon</a></li> <li><a href="http://drivendata.co?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Driven Data</a></li> <li><a href="https://en.wikipedia.org/wiki/International_development?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">International Development</a></li> <li><a href="https://www.brookings.edu?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Brookings Institution</a></li> <li><a href="https://www.stata.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Stata</a></li> <li><a href="https://en.wikipedia.org/wiki/Econometrics?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Econometrics</a></li> <li><a href="https://thisismetis.com?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Metis Bootcamp</a></li> <li><a href="https://pandas.pydata.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Pandas</a> <ul> <li><a href="https://www.pythonpodcast.com/episode-98-pandas-with-jeff-reback/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://en.wikipedia.org/wiki/C_Sharp_(programming_language)?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">C#</a></li> <li><a href="https://feeds.fireside.fm/pythonpodcast/rss">.NET</a></li> <li><a href="https://www.pythonpodcast.com/episode-17-glyph-on-ethics-in-software/?utm_source=rss&utm_medium=rss">Podcast.__init__ Episode On Software Ethics</a></li> <li><a href="https://jupyter.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Jupyter Notebook</a> <ul> <li><a href="https://www.pythonpodcast.com/episode-10-brian-granger-and-fernando-perez-of-the-ipython-project/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://en.wikipedia.org/wiki/Word2vec?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Word2Vec</a></li> <li><a href="https://drivendata.github.io/cookiecutter-data-science?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">cookiecutter data science</a></li> <li><a href="https://en.wikipedia.org/wiki/Logistic_regression?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Logistic Regression</a></li> </ul> <p>The intro and outro music is from Requiem for a Fish <a href="http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">The Freak Fandango Orchestra</a> / <a href="http://creativecommons.org/licenses/by-sa/3.0/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">CC BY-SA</a><img alt="" height="0" src="https://analytics.boundlessnotions.com/piwik.php?idsite=1&rec=1&url=https%3A%2F%2Fwww.pythonpodcast.com%2Fdeon-with-emily-miller-and-peter-bull-episode-184%2F&action_name=Of+Checklists%2C+Ethics%2C+and+Data+with+Emily+Miller+and+Peter+Bull+-+Episode+184&urlref=https%3A%2F%2Fwww.pythonpodcast.com%2Ffeed%2F&utm_source=rss&utm_medium=rss" style="border: 0; width: 0; height: 0;" width="0" /></p>