Dataprep with Eric Anderson

Google Cloud Platform Podcast

Episode | Podcast

Date: Wed, 22 Nov 2017 00:00:00 +0000

<p>On this week’s podcast, <a href="https://twitter.com/ericmander">Eric Anderson</a> shares how <a href="https://cloud.google.com/dataprep/">Dataprep</a> helps summarize, transform, visualize and cleanup data on the Google Cloud Platform. When doing data analysis, typically data munging can take up most of the time and this serverless tool helps optimize the process.</p> <h5 id="about-eric-anderson">About Eric Anderson</h5> <p>Eric is a Product Manager at Google working on Cloud Dataprep and recently Cloud Dataflow. Previously he was at Amazon Web Services, Harvard Business School, General Electric and University of Utah. He’s from Salt Lake City, Utah and lives in Mountain View, California with and wife and three kids.</p> <h5 id="cool-things-of-the-week">Cool things of the week</h5> <ul> <li>Intel Performance Libraries and Python Distribution enhance performance and scaling of Intel Xeon Scalable (‘Skylake’) processors on GCP <a href="https://cloudplatform.googleblog.com/2017/11/Intel-performance-libraries-and-python-distribution-enhance-performance-and-scaling-of-Intel-Xeon-Scalable-processors-on-GCP.html"> blog</a></li> <li>The hidden costs of cloud <a href="https://medium.com/google-cloud/the-hidden-costs-of-cloud-ddb702495e93"> blog</a> and <a href="https://www.gcppodcast.com/post/episode-69-server-density/">Server Density podcast</a></li> <li>Monitor and manage your costs with Cloud Platform billing export to BigQuery <a href="https://cloudplatform.googleblog.com/2017/11/monitor-and-manage-your-costs-with.html"> blog</a> and <a href="https://www.gcppodcast.com/post/episode-83-public-datasets-with-mike-hamberg-and-will-curran/"> Public Datasets podcast</a></li> <li>Kaggle TensorFlow Speech Recognition Challenge <a href="https://www.kaggle.com/c/tensorflow-speech-recognition-challenge">site</a></li> </ul> <h5 id="interview">Interview</h5> <ul> <li>Cloud Dataprep <a href="https://cloud.google.com/dataprep/">site</a> <a href="https://cloud.google.com/dataprep/docs">docs</a></li> <li>Cloud Dataflow <a href="https://cloud.google.com/dataflow/">site</a> <a href="https://cloud.google.com/dataflow/docs">docs</a></li> <li>7 Steps to Mastering Data Preparation with Python <a href="https://www.kdnuggets.com/2017/06/7-steps-mastering-data-preparation-python.html"> blog</a></li> <li>Design Your Pipeline <a href="https://beam.apache.org/documentation/pipelines/design-your-pipeline/"> blog</a></li> <li>Apache Beam <a href="https://beam.apache.org/">site</a></li> </ul> <h5 id="question-of-the-week">Question of the week</h5> <p>What is feature engineering?</p> <ul> <li>Intro to Feature Engineering with TensorFlow <a href="https://www.youtube.com/watch?v=d12ra3b_M-0">video</a></li> </ul> <h5 id="where-can-you-find-us-next">Where can you find us next?</h5> <p>Mark will be Montreal in December to speak at <a href="http://www.migs17.com/en/home/">Montreal International Games Summit</a>.<br /> Melanie will be at <a href="https://nips.cc/">NIPS (Neural Information Processing Systems)</a> in Long Beach in December</p>