Introduction to BigQuery

Introduction to BigQuery

Matt Forrest

Field CTO

What is BigQuery?

  • Uses SQL
  • Scalable to analyze massive datasets
  • Enterprise data warehouse
  • Launched in 2012 using the same tools as Google uses

BigQuery Logo Official launch post

Introduction to BigQuery

What makes BigQuery unique?

  • Online analytical processing (OLAP)
  • Separate compute and storage
  • Serverless
Introduction to BigQuery

Compute and storage

BigQuery architecture showing distributed compute and storage

1 https://cloud.google.com/bigquery/docs/storage_overview
Introduction to BigQuery

Snowflake and BigQuery

Snowflake

  • Popular amongst developers
  • Runs on any cloud
  • Specific tiers of compute resources (small, medium, etc.)

 

Snowflake Logo

BigQuery

  • Popular for analytical queries (reports)
  • Only run on Google Cloud
  • Completely serverless, no tiers

 

BigQuery

Introduction to BigQuery

Redshift and BigQuery

Redshift

  • Constant computation or serverless
  • Suited for live dashboarding

 

Amazon Redshift Logo

BigQuery

  • Only serverless
  • Point in time analysis (once a day or hour)

 

BigQuery

Introduction to BigQuery

Traditional SQL databases and BigQuery

SQL databases

  • Online transactional processing (OTAP)
  • Compute and storage are linked
  • Non-distributed

 

SQL icon

BigQuery

  • Online analytical processing (OLAP)
  • Compute and storage are separate
  • Distributed query engine

 

BigQuery logo

Introduction to BigQuery

How is BigQuery used?

  1. Reports that run at a specific time period or day (daily ecommerce reports, quarterly reports)
  2. Ad-hoc discovery (marketing campaign analysis)

Scheduled queries interface within Google Cloud BigQuery

Introduction to BigQuery

Let's practice!

Introduction to BigQuery

Preparing Video For Download...