SlideShare a Scribd company logo
Cut Video
Course 1: Exploring and Preparing your Data
with BigQuery
Module 1: Introduction
Lesson Title: Introduction
Format: Talking head with slides
Video Name: xxx
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google
and the Google logo are trademarks of Google Inc.
All other company and product names may be
trademarks of the respective companies with
which they are associated.
From Data to Insights with
Google Cloud Platform
2
v1.0
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Facilities
3
Facilities Food
Parking
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Course etiquette
4
Recording
this class
is prohibited
Ask questions
interactively or
via chat (online)
Please silence
your phone and
take calls outside
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Agenda
5
1- Introduction to Data on the Google Cloud Platform
2 - Big Data Tools
3 - Exploring your Data with SQL in BigQuery
4 - Pricing
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Agenda
6
5 - Cleaning and Transforming Data
6 - Storing and Exporting Data
7 - Ingesting New Datasets
8 - Visualization Basics with Data Studio
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Agenda
7
9 - Joining and Merging Datasets
10 - Advanced Clauses and Functions
11 - Schema Design and Nested Data Structures
12 - Advanced Visualization with Google Data Studio
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Agenda
8
13 - Optimizing for Performance
14 - Advanced Insights with Cloud Datalab
15 - Data Access
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Audience and Prerequisites
9
Target Audiences
1. Data Analysts, Business Analysts,
Business Intelligence professionals
2. Data Engineers who will be
partnering with Data Analysts to
build scalable data solutions on
Google Cloud Platform
Prerequisites
1. Basic Knowledge of SQL
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo
are trademarks of Google Inc. All other company and product names may
be trademarks of the respective companies with which they are associated.
Introductions
Your instructor
• Organization
• Background
• Course goals
You
• Name
• Organization
• Job role
• Course goals
10
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Pay special attention to slides with key messages or pitfalls
11
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
12
Module 1
Introduction to Data on
the Google Cloud
Platform
In this module we will:
• Highlight Analytics Challenges Faced by Data
Analysts
• Compare Big Data On-Premise vs on the Cloud
• Learn from Real-World Use Cases of
Companies Transformed through Analytics on
the Cloud
• Navigate Google Cloud Platform Project Basics
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Data analysts face query, infrastructure, and storage challenges
“We can only afford
to store a subset of
the data our business
generates”
“My queries are taking
way too long to run
and is stalling my
analysis.”
“We’re a data department, not
an infrastructure department.
Maintaining and upgrading
our own servers is
unsustainable.”
“My on premise
clusters aren’t scaling
with my analysis”
“We don’t have a
central data analytics
warehouse or set of
tools”
“I have no easy way to
combine and query all
the data I’ve collected”
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
14
Module 1
Introduction to Data on
the Google Cloud
Platform
In this module we will:
• Highlight Analytics Challenges Faced by Data
Analysts
• Compare Big Data On-Premise vs
on the Cloud
• Learn from Real-World Use Cases of
Companies Transformed through Analytics on
the Cloud
• Navigate Google Cloud Platform Project Basics
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Reasons why Google Cloud Platform
is used for Data Analysis
● Storage is Cheap
● Focus on Queries,
not Infrastructure
● Massive Scalability
15
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
The cost of 1GB of storage has dropped dramatically
16
Cost of 1 GB from 1980 to 2017 drops exponentially
Cost
per
1
GB
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
17
1 4
3
2
Traditional big data platforms require an investment in
infrastructure
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Time to Understanding
Typical Big Data
Processing
Insights
Resource
provisioning
Performance
tuning
Monitoring
Reliability
Deployment &
configuration
Handling
growing scale
Utilization
improvements
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Time to Understanding
Big Data with Google:
Focus on insights,
not infrastructure.
Writing code
Insights
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem
Proprietary + Confidential
Training and Certification 20
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
21
“[Google's] ability to build, organize, and
operate a huge network of servers and
fiber-optic cables with an efficiency and
speed that rocks physics on its heels.
This is what makes Google Google: its
physical network, its thousands of fiber
miles, and those many thousands of
servers that, in aggregate, add up to the
mother of all clouds.”
- Wired
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
22
3
4
2
1
Edge locations
in 30+
countries
Software-
defined
networking
(why this matters)
Global data
centers
Global
network
Google Cloud Platform opens Google-scale big data analysis
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
You Manage
Hardware
On Premise
Your kit, someone
else’s building.
Yours to manage.
Assembly required True On-Demand Cloud
After
Storage Processing Memory Network
Ad Hoc Querying
and Scalable Storage
In the Cloud
An actual, global
elastic cloud
Invest your time in query
writing, not infrastructure
23
Before
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Google Cloud Platform enables on-demand scalability
24
Query Processing Time
Query Processing Time
(10s, 120 Machines)
= 1 Cloud Virtual Machine
On-Premise
Underprovisioned
(demand > capacity)
Query Processing Time
(120s, 1 Machine)
Overprovisioned
(demand < capacity)
Google Cloud Platform
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Separation of storage and computing power enables efficient
resource allocation
25
Pay for ability to use processing power
even when no queries running
On-Premise Google Cloud Platform
Pay for only the resources you
are using and no more
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
BigQuery scales automatically and you only pay for what you use
26
Query Processing Time
Fully-Managed
Infrastructure Scales
to Process Faster
.. and you only pay for
bytes processed +
storage
$
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
27
Module 1
Introduction to Data on
the Google Cloud
Platform
In this module we will:
• Highlight Analytics Challenges Faced by Data
Analysts
• Compare Big Data On-Premise vs on the Cloud
• Learn from Real-World Use Cases of
Companies Transformed through Analytics on
the Cloud
• Navigate Google Cloud Platform Project Basics
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
“[Our mission is] to make our data so
intelligent it has the answer before the
question is even asked. It was a stretch
goal but essentially one that means we
have to capture all the data we produce
- both now and in the future.”
Dan Nelson - Head of Data
Ocado
28
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Store Petabytes of Data
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
“The less time that we can spend
solving problems that are already
solved, like scaling,... the more time and
energy we can spend on turning our
data into value”
Nicholas Harteau - VP Infrastructure
Spotify
29
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Focus on your Business, not Hardware
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
30
Module 1
Introduction to Data on
the Google Cloud
Platform
In this module we will:
• Highlight Analytics Challenges Faced by Data
Analysts
• Compare Big Data On-Premise vs on the Cloud
• Learn from Real-World Use Cases of
Companies Transformed through Analytics on
the Cloud
• Navigate Google Cloud Platform
Project Basics
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Navigate the Google Cloud Platform using the dashboard
1. Projects
2. Resources
3. Billing
31
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
32
1. Projects organize and govern your activities in the cloud
● Navigate and launch cloud tools for
your project by exploring the Products
and Services menu
● Work collaboratively by adding project
users through IAM (Identity and
Access Management)
● Authorize Tools and Apps through the
API manager
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
33
2. Resources are what you are using in the cloud
Commonly used by data analysts:
● Storage in Google Cloud Storage
○ Example: You use a Bucket for
uploading large CSV files to ingest
later for analysis
● Datasets in Google BigQuery
○ Example: You perform analysis on
raw data and create a brand new
dataset
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
34
2. The Cloud Storage Bucket is your goto for scalable storage
● Buckets are scalable containers that
hold your data.
● You can create and upload files to your
buckets within your Cloud Console
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
35
3. You are billed for the resources you use
After this course, try exporting BigQuery logs using this
tutorial to recreate the above Data Studio billing dashboard
Commonly used by data analysts:
● Storage in Google Cloud Storage
○ Billed for Bucket Storage
● Datasets in Google BigQuery
○ Billed for Query processing
○ Billed for Table Storage
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
36
Manage and
monitor your
project resources
in one place
Efficiently scale
your compute
and storage
needs
Overcome query
speed,
infrastructure,
and cost
challenges
Summary: GCP offers you the ability to:
Module Summary: Scale with the Google Cloud Platform
Evangelize data
analysis in your
organization
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Lab 0
Getting Started with
Google Cloud Platform and Qwiklabs
37
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Getting started with Google Cloud Platform and Qwiklabs
● Open an incognito window
● Navigate to:
googlecloud.qwiklabs.com
● Create a new account with the
email address you used when
you registered for this course
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
What you get
39
For each lab, Qwiklabs offers:
• A free set of resources for a fixed
amount of time
• A clean environment with
permissions
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Qwiklabs sign-in process
40
Open an incognito browser
From the incognito browser,
sign in to Qwiklabs
Select the lab and
click Start Lab
Sign in to the GCP
console using the
provided credentials
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Open Qwiklabs
41
Open an incognito window
(or private/anonymous window).
Go to the Qwiklabs URL your
instructor provides.
Username
Password
Sign in and launch the course
(with credentials you used to register
for the course).
1
2
3
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
View your labs
42
Lab completed
Active lab
Not yet available
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Select a lab
43
You cannot pause and restart
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
1. Click
2. Note the following
3. Click and sign in
4. Accept terms and note the project
Run a lab
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
End a lab
45
Some labs may require you to NOT end the lab; the instructor will
inform you.
● When done, click to free your resources.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
© 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other
company and product names may be trademarks of the respective companies with which they are associated.
Course materials: End of class
46
Click Materials on
the top navigation bar.
Select the class from the
Course Materials list.
2
1
From Data to Insights on Google Cloud Platform

More Related Content

Similar to From Data to Insights with Google Cloud Platform (20)

PDF
BI, Hive or Big Data Analytics?
Datameer
 
PDF
"You don't need a bigger boat": serverless MLOps for reasonable companies
Data Science Milan
 
PDF
Return material authorization advance replacement programs apr 27, suite wo...
Bala Ramachandran
 
PDF
Take groovy to places you never thought were possible
Kyle Goodfriend
 
PPTX
Tech Winter Break - GDG OnCampus International Institute of Information Techn...
VarnitMittal1
 
PDF
How Google Does Big Data - DevNexus 2014
James Chittenden
 
PDF
A6 harnessing the power of big data and business analytics to transform bus...
Dr. Wilfred Lin (Ph.D.)
 
PPT
Are you ready for Drupal 8?
Stephanie Peugh
 
PDF
Learn to-use-google-data-studio-jan22
Rahmat Taufiq Sigit
 
PDF
Using Graphs for Data Analysis
オラクルエンジニア通信
 
PPTX
Presentation of Google
MdAlMamun44
 
PPTX
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
Matillion
 
PDF
What is Office 365? A Simple Answer
Aptera Inc
 
PPTX
The new dominant companies are running on data
SnapLogic
 
PPTX
IndexConf 2018: AI Strategy Pattern
Sean Kennedy
 
PDF
The Big Picture on Big Data and Cognos
Senturus
 
PPTX
Optimize Content for an Impactful Customer Journey
Kirill Kronrod
 
PDF
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
PDF
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
PDF
How Google apps Can Increase Innovation and Streamline It
Redpath Consulting Group
 
BI, Hive or Big Data Analytics?
Datameer
 
"You don't need a bigger boat": serverless MLOps for reasonable companies
Data Science Milan
 
Return material authorization advance replacement programs apr 27, suite wo...
Bala Ramachandran
 
Take groovy to places you never thought were possible
Kyle Goodfriend
 
Tech Winter Break - GDG OnCampus International Institute of Information Techn...
VarnitMittal1
 
How Google Does Big Data - DevNexus 2014
James Chittenden
 
A6 harnessing the power of big data and business analytics to transform bus...
Dr. Wilfred Lin (Ph.D.)
 
Are you ready for Drupal 8?
Stephanie Peugh
 
Learn to-use-google-data-studio-jan22
Rahmat Taufiq Sigit
 
Using Graphs for Data Analysis
オラクルエンジニア通信
 
Presentation of Google
MdAlMamun44
 
Using Google Cloud for Marketing Analytics: How the7stars, the UK’s largest i...
Matillion
 
What is Office 365? A Simple Answer
Aptera Inc
 
The new dominant companies are running on data
SnapLogic
 
IndexConf 2018: AI Strategy Pattern
Sean Kennedy
 
The Big Picture on Big Data and Cognos
Senturus
 
Optimize Content for an Impactful Customer Journey
Kirill Kronrod
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Matt Stubbs
 
How Google apps Can Increase Innovation and Streamline It
Redpath Consulting Group
 

Recently uploaded (20)

PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PPTX
Introduction to Artificial Intelligence.pptx
StarToon1
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PPTX
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
PPTX
Slide studies GC- CRC - PC - HNC baru.pptx
LLen8
 
PDF
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PPTX
Rocket-Launched-PowerPoint-Template.pptx
Arden31
 
PPTX
fashion industry boom.pptx an economics project
TGMPandeyji
 
PPT
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
PPTX
加拿大尼亚加拉学院毕业证书{Niagara在读证明信Niagara成绩单修改}复刻
Taqyea
 
PPT
1 DATALINK CONTROL and it's applications
karunanidhilithesh
 
PDF
T2_01 Apuntes La Materia.pdfxxxxxxxxxxxxxxxxxxxxxxxxxxxxxskksk
mathiasdasilvabarcia
 
PPTX
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
Climate Action.pptx action plan for climate
justfortalabat
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
Introduction to Artificial Intelligence.pptx
StarToon1
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
recruitment Presentation.pptxhdhshhshshhehh
devraj40467
 
Slide studies GC- CRC - PC - HNC baru.pptx
LLen8
 
apidays Helsinki & North 2025 - Monetizing AI APIs: The New API Economy, Alla...
apidays
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
Rocket-Launched-PowerPoint-Template.pptx
Arden31
 
fashion industry boom.pptx an economics project
TGMPandeyji
 
01 presentation finyyyal معهد معايره.ppt
eltohamym057
 
加拿大尼亚加拉学院毕业证书{Niagara在读证明信Niagara成绩单修改}复刻
Taqyea
 
1 DATALINK CONTROL and it's applications
karunanidhilithesh
 
T2_01 Apuntes La Materia.pdfxxxxxxxxxxxxxxxxxxxxxxxxxxxxxskksk
mathiasdasilvabarcia
 
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
Ad

From Data to Insights with Google Cloud Platform

  • 1. Cut Video Course 1: Exploring and Preparing your Data with BigQuery Module 1: Introduction Lesson Title: Introduction Format: Talking head with slides Video Name: xxx
  • 2. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. From Data to Insights with Google Cloud Platform 2 v1.0
  • 3. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Facilities 3 Facilities Food Parking
  • 4. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Course etiquette 4 Recording this class is prohibited Ask questions interactively or via chat (online) Please silence your phone and take calls outside
  • 5. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Agenda 5 1- Introduction to Data on the Google Cloud Platform 2 - Big Data Tools 3 - Exploring your Data with SQL in BigQuery 4 - Pricing
  • 6. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Agenda 6 5 - Cleaning and Transforming Data 6 - Storing and Exporting Data 7 - Ingesting New Datasets 8 - Visualization Basics with Data Studio
  • 7. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Agenda 7 9 - Joining and Merging Datasets 10 - Advanced Clauses and Functions 11 - Schema Design and Nested Data Structures 12 - Advanced Visualization with Google Data Studio
  • 8. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Agenda 8 13 - Optimizing for Performance 14 - Advanced Insights with Cloud Datalab 15 - Data Access
  • 9. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Audience and Prerequisites 9 Target Audiences 1. Data Analysts, Business Analysts, Business Intelligence professionals 2. Data Engineers who will be partnering with Data Analysts to build scalable data solutions on Google Cloud Platform Prerequisites 1. Basic Knowledge of SQL
  • 10. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Introductions Your instructor • Organization • Background • Course goals You • Name • Organization • Job role • Course goals 10
  • 11. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Pay special attention to slides with key messages or pitfalls 11
  • 12. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 12 Module 1 Introduction to Data on the Google Cloud Platform In this module we will: • Highlight Analytics Challenges Faced by Data Analysts • Compare Big Data On-Premise vs on the Cloud • Learn from Real-World Use Cases of Companies Transformed through Analytics on the Cloud • Navigate Google Cloud Platform Project Basics
  • 13. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Data analysts face query, infrastructure, and storage challenges “We can only afford to store a subset of the data our business generates” “My queries are taking way too long to run and is stalling my analysis.” “We’re a data department, not an infrastructure department. Maintaining and upgrading our own servers is unsustainable.” “My on premise clusters aren’t scaling with my analysis” “We don’t have a central data analytics warehouse or set of tools” “I have no easy way to combine and query all the data I’ve collected”
  • 14. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 14 Module 1 Introduction to Data on the Google Cloud Platform In this module we will: • Highlight Analytics Challenges Faced by Data Analysts • Compare Big Data On-Premise vs on the Cloud • Learn from Real-World Use Cases of Companies Transformed through Analytics on the Cloud • Navigate Google Cloud Platform Project Basics
  • 15. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Reasons why Google Cloud Platform is used for Data Analysis ● Storage is Cheap ● Focus on Queries, not Infrastructure ● Massive Scalability 15 © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated.
  • 16. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. The cost of 1GB of storage has dropped dramatically 16 Cost of 1 GB from 1980 to 2017 drops exponentially Cost per 1 GB
  • 17. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 17 1 4 3 2 Traditional big data platforms require an investment in infrastructure
  • 18. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Time to Understanding Typical Big Data Processing Insights Resource provisioning Performance tuning Monitoring Reliability Deployment & configuration Handling growing scale Utilization improvements
  • 19. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Time to Understanding Big Data with Google: Focus on insights, not infrastructure. Writing code Insights
  • 20. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Source: Lorem ipsum dolor sit amet, consectetur adipiscing elit. Duis non erat sem Proprietary + Confidential Training and Certification 20
  • 21. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 21 “[Google's] ability to build, organize, and operate a huge network of servers and fiber-optic cables with an efficiency and speed that rocks physics on its heels. This is what makes Google Google: its physical network, its thousands of fiber miles, and those many thousands of servers that, in aggregate, add up to the mother of all clouds.” - Wired
  • 22. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 22 3 4 2 1 Edge locations in 30+ countries Software- defined networking (why this matters) Global data centers Global network Google Cloud Platform opens Google-scale big data analysis
  • 23. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. You Manage Hardware On Premise Your kit, someone else’s building. Yours to manage. Assembly required True On-Demand Cloud After Storage Processing Memory Network Ad Hoc Querying and Scalable Storage In the Cloud An actual, global elastic cloud Invest your time in query writing, not infrastructure 23 Before
  • 24. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Google Cloud Platform enables on-demand scalability 24 Query Processing Time Query Processing Time (10s, 120 Machines) = 1 Cloud Virtual Machine On-Premise Underprovisioned (demand > capacity) Query Processing Time (120s, 1 Machine) Overprovisioned (demand < capacity) Google Cloud Platform
  • 25. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Separation of storage and computing power enables efficient resource allocation 25 Pay for ability to use processing power even when no queries running On-Premise Google Cloud Platform Pay for only the resources you are using and no more
  • 26. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. BigQuery scales automatically and you only pay for what you use 26 Query Processing Time Fully-Managed Infrastructure Scales to Process Faster .. and you only pay for bytes processed + storage $
  • 27. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 27 Module 1 Introduction to Data on the Google Cloud Platform In this module we will: • Highlight Analytics Challenges Faced by Data Analysts • Compare Big Data On-Premise vs on the Cloud • Learn from Real-World Use Cases of Companies Transformed through Analytics on the Cloud • Navigate Google Cloud Platform Project Basics
  • 28. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. “[Our mission is] to make our data so intelligent it has the answer before the question is even asked. It was a stretch goal but essentially one that means we have to capture all the data we produce - both now and in the future.” Dan Nelson - Head of Data Ocado 28 © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Store Petabytes of Data
  • 29. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. “The less time that we can spend solving problems that are already solved, like scaling,... the more time and energy we can spend on turning our data into value” Nicholas Harteau - VP Infrastructure Spotify 29 © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Focus on your Business, not Hardware
  • 30. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 30 Module 1 Introduction to Data on the Google Cloud Platform In this module we will: • Highlight Analytics Challenges Faced by Data Analysts • Compare Big Data On-Premise vs on the Cloud • Learn from Real-World Use Cases of Companies Transformed through Analytics on the Cloud • Navigate Google Cloud Platform Project Basics
  • 31. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Navigate the Google Cloud Platform using the dashboard 1. Projects 2. Resources 3. Billing 31
  • 32. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 32 1. Projects organize and govern your activities in the cloud ● Navigate and launch cloud tools for your project by exploring the Products and Services menu ● Work collaboratively by adding project users through IAM (Identity and Access Management) ● Authorize Tools and Apps through the API manager
  • 33. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 33 2. Resources are what you are using in the cloud Commonly used by data analysts: ● Storage in Google Cloud Storage ○ Example: You use a Bucket for uploading large CSV files to ingest later for analysis ● Datasets in Google BigQuery ○ Example: You perform analysis on raw data and create a brand new dataset
  • 34. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 34 2. The Cloud Storage Bucket is your goto for scalable storage ● Buckets are scalable containers that hold your data. ● You can create and upload files to your buckets within your Cloud Console
  • 35. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 35 3. You are billed for the resources you use After this course, try exporting BigQuery logs using this tutorial to recreate the above Data Studio billing dashboard Commonly used by data analysts: ● Storage in Google Cloud Storage ○ Billed for Bucket Storage ● Datasets in Google BigQuery ○ Billed for Query processing ○ Billed for Table Storage
  • 36. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 36 Manage and monitor your project resources in one place Efficiently scale your compute and storage needs Overcome query speed, infrastructure, and cost challenges Summary: GCP offers you the ability to: Module Summary: Scale with the Google Cloud Platform Evangelize data analysis in your organization
  • 37. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Lab 0 Getting Started with Google Cloud Platform and Qwiklabs 37
  • 38. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Getting started with Google Cloud Platform and Qwiklabs ● Open an incognito window ● Navigate to: googlecloud.qwiklabs.com ● Create a new account with the email address you used when you registered for this course
  • 39. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. What you get 39 For each lab, Qwiklabs offers: • A free set of resources for a fixed amount of time • A clean environment with permissions
  • 40. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Qwiklabs sign-in process 40 Open an incognito browser From the incognito browser, sign in to Qwiklabs Select the lab and click Start Lab Sign in to the GCP console using the provided credentials
  • 41. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Open Qwiklabs 41 Open an incognito window (or private/anonymous window). Go to the Qwiklabs URL your instructor provides. Username Password Sign in and launch the course (with credentials you used to register for the course). 1 2 3
  • 42. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. View your labs 42 Lab completed Active lab Not yet available
  • 43. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Select a lab 43 You cannot pause and restart
  • 44. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. 1. Click 2. Note the following 3. Click and sign in 4. Accept terms and note the project Run a lab
  • 45. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. End a lab 45 Some labs may require you to NOT end the lab; the instructor will inform you. ● When done, click to free your resources.
  • 46. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. © 2017 Google Inc. All rights reserved. Google and the Google logo are trademarks of Google Inc. All other company and product names may be trademarks of the respective companies with which they are associated. Course materials: End of class 46 Click Materials on the top navigation bar. Select the class from the Course Materials list. 2 1 From Data to Insights on Google Cloud Platform