Date: Mon, 08 Mar 2021 19:00:00 -0500
<div class="wp-block-jetpack-markdown"><h3>Summary</h3> <p>There are a large and growing number of businesses built by and for data science and machine learning teams that rely on Python. Tony Liu is a venture investor who is following that market closely and betting on its continued success. In this episode he shares his own journey into the role of an investor and discusses what he is most excited about in the industry. He also explains what he looks at when investing in a business and gives advice on what potential founders and early employees of startups should be thinking about when starting on that journey.</p> <h3>Announcements</h3> <ul> <li>Hello and welcome to Podcast.__init__, the podcast about Python and the people who make it great.</li> <li>When you’re ready to launch your next app or want to try a project you hear about on the show, you’ll need somewhere to deploy it, so take a look at our friends over at Linode. With the launch of their managed Kubernetes platform it’s easy to get started with the next generation of deployment and scaling, powered by the battle tested Linode platform, including simple pricing, node balancers, 40Gbit networking, dedicated CPU and GPU instances, and worldwide data centers. Go to <a href="https://www.pythonpodcast.com/linode?utm_source=rss&utm_medium=rss">pythonpodcast.com/linode</a> and get a $100 credit to try out a Kubernetes cluster of your own. And don’t forget to thank them for their continued support of this show!</li> <li>Your host as usual is Tobias Macey and today I’m interviewing Tony Liu about his perspectives on the landscape of Python in the data ecosystem from his role as an investor</li> </ul> <h3>Interview</h3> <ul> <li>Introductions</li> <li>How did you get introduced to Python?</li> <li>Can you start by sharing your background in the data ecosystem?</li> <li>What led you to your current role as a venture investor? <ul> <li>What is your current area of focus in your investments?</li> </ul> </li> <li>What do you see as the major strengths of Python in the current landscape for data and analytics? <ul> <li>What are the areas where the ecosystem is still lacking?</li> <li>Where are you seeing growth in the space and what do you see as the motivating factors?</li> </ul> </li> <li>As an investor, what are the qualities that you look for in a startup that is trying to compete in the data ecosystem? <ul> <li>What is your process for learning about and identifying companies that demonstrate the potential to succeed?</li> <li>Do you focus on a particular problem domain and research a grouping of companies that are focused on that problem, or do you start from a given company to determine where to place your bets?</li> <li>How has COVID changed the competitive landscape?</li> </ul> </li> <li>Can you share some of the companies that you have invested in? <ul> <li>What was noteable about their respective businesses that provided you with the confidence that they were worth investing in?</li> </ul> </li> <li>What are some of the most interesting, unexpected, or challenging lessons that you have learned from your experience as a venture investor?</li> <li>What are some of the companies that you are keeping a close eye on, whether as potential investments or as competitors to your existing portfolio?</li> <li>What are some of the problem spaces that you would like to see companies try to tackle?</li> <li>What advice do you have for engineers who might be considering building a new business? <ul> <li>Do you have any advice for engineers who are working at a startup as to how best to compete in the current market?</li> </ul> </li> </ul> <h3>Keep In Touch</h3> <ul> <li><a href="https://www.linkedin.com/in/tonydl/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">LinkedIn</a></li> </ul> <h3>Picks</h3> <ul> <li>Tobias <ul> <li><a href="https://www.imdb.com/title/tt10888708/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">The Sleepover</a> movie</li> <li><a href="https://www.youtube.com/watch?v=kRRmQ1Tz-Ao&utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">What do ya do with a Bernie Sanders?</a> music video</li> </ul> </li> <li>Tony <ul> <li><a href="https://www.imdb.com/title/tt5727208/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Uncut Gems</a></li> </ul> </li> </ul> <h3>Closing Announcements</h3> <ul> <li>Thank you for listening! Don’t forget to check out our other show, the <a href="https://feeds.fireside.fm/pythonpodcast/rss">Data Engineering Podcast</a> for the latest on modern data management.</li> <li>Visit the <a href="https://www.pythonpodcast.com?utm_source=rss&utm_medium=rss">site</a> to subscribe to the show, sign up for the mailing list, and read the show notes.</li> <li>If you’ve learned something or tried out a project from the show then tell us about it! Email <a href="mailto:hosts@podcastinit.com">hosts@podcastinit.com</a>) with your story.</li> <li>To help other people find the show please leave a review on <a href="https://itunes.apple.com/us/podcast/podcast.-init/id981834425?mt=2&uo=6&at=&ct=&utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">iTunes</a> and tell your friends and co-workers</li> <li>Join the community in the new Zulip chat workspace at <a href="https://www.pythonpodcast.com/chat?utm_source=rss&utm_medium=rss">pythonpodcast.com/chat</a></li> </ul> <h3>Links</h3> <ul> <li><a href="https://www.costanoavc.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Costanoa Ventures</a></li> <li><a href="https://en.wikipedia.org/wiki/Sports_analytics?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Sports Analytics</a></li> <li><a href="https://turo.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Turo</a></li> <li><a href="https://databricks.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Databricks</a></li> <li><a href="https://koalas.readthedocs.io/en/latest/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Koalas</a></li> <li><a href="https://www.datarobot.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">DataRobot</a></li> <li><a href="https://faust.readthedocs.io/en/latest/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Faust</a> <ul> <li><a href="https://www.pythonpodcast.com/fast-stream-processing-in-python-using-faust-with-ask-solem-episode-176/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://oozie.apache.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Oozie</a></li> <li><a href="https://azkaban.github.io/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Azkaban</a></li> <li><a href="https://airflow.apache.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Airflow</a> <ul> <li><a href="https://www.pythonpodcast.com/episode-44-airflow-with-maxime-beauchemin/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://www.prefect.io/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Prefect</a> <ul> <li><a href="https://www.dataengineeringpodcast.com/prefect-workflow-engine-episode-86/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode</a></li> </ul> </li> <li><a href="https://dagster.io/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Dagster</a> <ul> <li><a href="https://www.pythonpodcast.com/dagster-data-orchestration-episode-279/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> <li><a href="https://www.dataengineeringpodcast.com/dagster-data-applications-episode-104/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode</a></li> </ul> </li> <li><a href="https://www.kubeflow.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Kubeflow</a></li> <li><a href="https://mlflow.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">MLFlow</a></li> <li><a href="https://metaflow.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Metaflow</a> <ul> <li><a href="https://www.pythonpodcast.com/metaflow-machine-learning-operations-episode-274/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://pandas.pydata.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Pandas</a> <ul> <li><a href="https://www.pythonpodcast.com/episode-98-pandas-with-jeff-reback/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://spark.apache.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Spark</a> <ul> <li><a href="https://www.dataengineeringpodcast.com/putting-apache-spark-into-action-with-jean-georges-perrin-episode-60/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode</a></li> </ul> </li> <li><a href="https://www.getdbt.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">DBT</a> <ul> <li><a href="https://www.dataengineeringpodcast.com/dbt-data-analytics-episode-81/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode</a></li> </ul> </li> <li><a href="https://www.snowflake.com/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">SnowflakeDB</a> <ul> <li><a href="https://www.dataengineeringpodcast.com/snowflakedb-cloud-data-warehouse-episode-110/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode</a></li> </ul> </li> <li><a href="https://coiled.io/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Coiled</a> <ul> <li><a href="https://www.pythonpodcast.com/coiled-dask-python-data-science-episode-275/?utm_source=rss&utm_medium=rss">Podcast Episode</a></li> </ul> </li> <li><a href="https://noteable.io/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Noteable</a></li> <li><a href="https://dask.org/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Dask</a> <ul> <li><a href="https://www.dataengineeringpodcast.com/episode-2-dask-with-matthew-rocklin/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode</a></li> </ul> </li> <li><a href="https://www.dataengineeringpodcast.com/using-notebooks-as-the-unifying-layer-for-data-roles-at-netflix-with-matthew-seal-episode-54/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">Data Engineering Podcast Episode About Notebooks at Netflix</a></li> </ul> <p>The intro and outro music is from Requiem for a Fish <a href="http://freemusicarchive.org/music/The_Freak_Fandango_Orchestra/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">The Freak Fandango Orchestra</a> / <a href="http://creativecommons.org/licenses/by-sa/3.0/?utm_source=rss&utm_medium=rss" rel="noopener" target="_blank">CC BY-SA</a></p> </div> <img alt="" height="0" src="https://analytics.boundlessnotions.com/piwik.php?idsite=1&rec=1&url=https%3A%2F%2Fwww.pythonpodcast.com%2Ftony-liu-python-venture-investing-episode-305%2F&action_name=Analyzing+The+Ecosystem+of+Python+Data+Companies+With+Tony+Liu+-+Episode+305&urlref=https%3A%2F%2Fwww.pythonpodcast.com%2Ffeed%2F&utm_source=rss&utm_medium=rss" style="border: 0; width: 0; height: 0;" width="0" />