BigLake with Gaurav Saxena and Justin Levandoski

Google Cloud Platform Podcast

Episode | Podcast

Date: Wed, 27 Apr 2022 16:15:00 +0000

<p><a href="https://twitter.com/stephr_wong"><span style="font-weight: 400;">Stephanie Wong</span></a> <span style="font-weight: 400;">and Debi Cabrera are learning all about BigLake from guests</span> <a href="https://twitter.com/gavsa82"><span style="font-weight: 400;">Gaurav Saxena</span></a> <span style="font-weight: 400;">and</span> <a href="https://twitter.com/jstnlvndski"><span style="font-weight: 400;">Justin Levandoski</span></a> <span style="font-weight: 400;">of the BigQuery team.</span></p> <p><span style="font-weight: 400;">BigLake offers unified data management from both data warehouses and data lakes. What exactly is the difference between a data warehouse and a data lake? Justin explains what a data lake is, how they came to be, and the benefits. Each data option has its cons too, like the limitations of data lakes for enterprise use. Enter BigLake built on BigQuery, which helps enterprise clients manage and analyze their data from both data warehouses and data lakes. The best features of BigQuery are now available for Google Cloud Storage and across multi-cloud solutions.</span></p> <p><span style="font-weight: 400;">Guarav describes BigLake behind the scenes and how the principles of BigQuery’s data management can now be used for open file formats in BigLake. It’s BigQuery for more data formats, Justin explains. BigLake solves many data problems quickly with a special emphasis on improving security. Our guests talk specifically about clients who gain the most from using BigLake, especially those looking to analyze distributed data and those who need easy and fast security and compliance solutions. With tightened security, BigLake offers access delegation and secure APIs that work over object storage. We hear about the user experience and how easy it is to get started, especially for customers already familiar with and using other GCP products.</span></p> <p><span style="font-weight: 400;">Google’s advocacy of open source projects means many clients are coming in with workloads built with open source software. BigLake supports multi-cloud projects so that tables can be built on top of any data system. No matter the format of your data, you can run analytics with BigLake. We talk more about the security features of BigLake and how easy it is to unify data warehouses and data lakes with optimal data security.</span></p> <p><span style="font-weight: 400;">The customers have helped shape BigLake, and Gaurav describes how these clients are using this data software. We hear about integration with BigQuery Omni and Dataplex and how BigLake is different. In the future, Google will continue to make simple, effective solutions for data management and analytics, building further off of BigQuery.</span></p> <h5><strong>Gaurav Saxena</strong></h5> <p><a href="https://twitter.com/gavsa82"><span style="font-weight: 400;">Gaurav Saxena</span></a> <span style="font-weight: 400;">is a product management lead at Google BigQuery. He has 12+ years of experience building products at the intersection of cloud, data and AI. Before Google, Gaurav led product management at Microsoft Azure and Amazon Web Services for some of the most widely used cloud offerings in storage and data.</span></p> <h5><strong>Justin Levandoski</strong></h5> <p><a href="https://twitter.com/jstnlvndski"><span style="font-weight: 400;">Justin</span></a> <span style="font-weight: 400;">is a tech lead/manager in BigQuery leading BigLake and other projects pushing the frontier of BigQuery. Prior to Google, just worked on Amazon Aurora and was part of the Database research group at Microsoft Research.</span></p> <h5><strong>Cool things of the week</strong></h5> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">Your ultimate guide to Speech on Google Cloud</span> <a href="https://cloud.google.com/blog/products/ai-machine-learning/your-ultimate-guide-to-speech-on-google-cloud"> <span style="font-weight: 400;">blog</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Announcing the Climate Innovation Challenge—grants to support cutting-edge earth research</span> <a href="https://cloud.google.com/blog/topics/sustainability/climate-innovation-challenge-provides-google-cloud-credits"> <span style="font-weight: 400;">blog</span></a></li> </ul> <h5><strong>Interview</strong></h5> <ul> <li style="font-weight: 400;"><span style="font-weight: 400;">BigLake</span> <a href="https://cloud.google.com/biglake"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">BigQuery</span> <a href="https://cloud.google.com/bigquery"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Cloud Storage</span> <a href="https://cloud.google.com/storage"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Spark</span> <a href="https://spark.apache.org/"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Apache Ranger</span> <a href="https://ranger.apache.org/"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">BigQuery Omni</span> <a href="https://cloud.google.com/bigquery-omni/docs/introduction"><span style="font-weight: 400;"> docs</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Apache Iceberg</span> <a href="https://iceberg.apache.org/"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Delta Lake</span> <a href="https://delta.io/"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Presto</span> <a href="https://prestodb.io/"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">TensorFlow</span> <a href="https://www.tensorflow.org/"><span style="font-weight: 400;">site</span></a></li> <li style="font-weight: 400;"><span style="font-weight: 400;">Dataplex</span> <a href="https://cloud.google.com/dataplex"><span style="font-weight: 400;">site</span></a></li> </ul> <h5><strong>What’s something cool you’re working on?</strong></h5> <p><span style="font-weight: 400;">Debi is working on a series about automatic DLP. Cloud Data Loss Prevention is now automatic and allows you to scan data across your whole org with the click of one button!</span></p> <h5><strong>Hosts</strong></h5> <p><span style="font-weight: 400;">Stephanie Wong and Debi Cabrera</span></p>