Big Data with Felipe Hoffa

Google Cloud Platform Podcast

Episode | Podcast

Date: Wed, 16 Dec 2015 16:36:15 +0000

<p>In the eighth episode of this podcast and last of 2015, your hosts <a href="http://twitter.com/francesc">Francesc</a> and <a href="http://twitter.com/neurotic">Mark</a> interview <a href="https://twitter.com/felipehoffa">Felipe Hoffa</a>. Felipe is a developer advocate for Google Cloud Platform and he specializes in Big Data.</p> <h5 id="about-felipe">About Felipe</h5> <p>In 2011 Felipe Hoffa moved from Chile to San Francisco to join Google as a Software Engineer. Since 2013 he’s been a Developer Advocate on Big Data - to inspire developers around the world to leverage the Google Cloud Platform tools to analyze and understand their data in ways they could never before. You can find him in several YouTube videos, blog posts, and conferences around the world.</p> <h5 id="cool-thing-of-the-week">Cool thing of the week</h5> <ul> <li>Cloud SQL second generation <a href="https://cloud.google.com/sql/docs/introduction#v2">docs</a> and announcement <a href="http://googlecloudplatform.blogspot.com/2015/12/the-next-generation-of-managed-MySQL-offerings-on-Cloud-SQL.html"> blog post</a></li> </ul> <p style="text-align: center;"><img alt="Cloud SQL Version 2" src="https://googlecloudpodcast.libsyn.com/images/post/cloudsqlv2.png" /></p> <h5 id="interview">Interview</h5> <ul> <li>BigQuery <a href="https://cloud.google.com/bigquery/">docs</a></li> <li>MapReduce: Simplified Data Processing on Large Clusters <a href="http://research.google.com/archive/mapreduce.html">research paper</a></li> <li>Dremel: Interactive Analysis of Web-Scale Datasets <a href="http://research.google.com/pubs/pub36632.html">research paper</a></li> <li>Cloud DataFlow <a href="https://cloud.google.com/dataflow/">docs</a></li> <li>FlumeJava: Easy, Efficient Data-Parallel Pipelines <a href="http://pages.cs.wisc.edu/~akella/CS838/F12/838-CloudPapers/FlumeJava.pdf"> research paper</a></li> <li>MillWheel: Fault-Tolerant Stream Processing at Internet Scale <a href="http://research.google.com/pubs/pub41378.html">research paper</a></li> <li>Cloud Datalab <a href="https://cloud.google.com/datalab/">docs</a></li> <li>Jupyter project <a href="http://jupyter.org/">homepage</a></li> <li>Cloud BigTable <a href="https://cloud.google.com/bigtable/docs/">docs</a></li> <li>Bigtable: A Distributed Storage System for Structured Data <a href="http://research.google.com/archive/bigtable.html">research paper</a></li> <li>Google Cloud Genomics <a href="https://cloud.google.com/genomics/">docs</a> and 23andme <a href="https://www.23andme.com/">homepage</a></li> <li>Hey I just met you .<a href="https://www.youtube.com/watch?v=fWNaR-rxAic">.</a>. <a href="https://twitter.com/vambenepe/status/601545199056068608">tweet</a></li> <li>BigQuery <a href="https://www.reddit.com/r/bigquery/">subreddit</a></li> </ul> <h5 id="question-of-the-week">Question of the week</h5> <ul> <li>App Engine environment variables <a href="https://cloud.google.com/appengine/docs/java/config/appconfig#Java_appengine_web_xml_System_properties_and_environment_variables"> docs</a></li> <li>Kubernetes secrets <a href="http://kubernetes.io/v1.0/docs/user-guide/secrets.html">docs</a></li> <li>Google Compute Engine metadata <a href="https://cloud.google.com/compute/docs/metadata">docs</a></li> <li>App Engine <a href="https://github.com/thesandlord/samples/tree/master/app-engine-metadata"> example code</a> to access project metadata</li> <li>Google Cloud Storage Security and Privacy considerations <a href="https://cloud.google.com/storage/docs/gsutil/addlhelp/SecurityandPrivacyConsiderations"> docs</a></li> </ul>