SlideShare a Scribd company logo
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights Reserved
ZERO TO
SNOWFLAKE IN
90 MINUTES
in partnership with
© 2018 Snowflake Computing Inc. All Rights
Reserved.
• Download Materials @ https://tinyurl.com/yy9frfuw (Hotmail / outlook / o365)
• Download Worksheets and unzip
• Partner Introduction
• Snowflake Introduction
• Data Warehousing Today
• What Users do with Snowflake
• Hands on Snowflake
• Data loading- sample data set is shared
• Integrations
• Multi-clustering
• And more!
• Scaling/Workload Isolation
• Snowflake in the Real World- Data Sharing and Customer Example
• Conclusion
AGENDA
© 2018DMICONFIDENTIAL&PROPRIETARY
3
DATA & ANALYTICS
Snowflake
Solution Integration
Partner
Creating an intelligent existence by linking physical
& digital worlds to unleash the power of connectivity
〉 DATA PLATFORM SOLUTIONS
〉 MODERN DATA SOLUTIONS
〉 VISUAL SOLUTIONS
〉 ADVANCED ANALYTICS
〉 ENTERPRISE DATA STRATEGY
〉 EXECUTIVE ADVISORY
〉 ANALYTICS AS A SERVICE
〉 AGILE ANALYTICS
© 2018DMICONFIDENTIAL&PROPRIETARY
4
DATA & ANALYTICS
PARTNERSHIPS
***8 MOST IMPORTANT AI AND ANALYTIC TRENDS OF 2019 EBOOK AND MORE INFO ABOUT DMI @ WWW.DMINC.COM/EBOOK M
Today’s Moderators
〉 Brett VanderPlaats
Sr. Data Architect, Data Platform & Analytics
〉 David Mellinger
Practice Lead, Data Platform & Analytics
1,500+ CUSTOMERS
Building new analytic applications
Delivered new analytic application to
pharmacies using Snowflake
Moving to the cloud
Using Snowflake to move data analytics
to the cloud
Modernizing data platforms
Replaced data warehouse appliance +
Hadoop with Snowflake
Accelerating enterprise BI and analytics
Moved from legacy data warehouse
systems (appliance & cloud) to
Snowflake
© 2018 Snowflake Computing Inc. All Rights Reserved. 5
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing Today?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
What would you like in your Data Warehouse?
8
Complete
SQL Database
Zero
Management
All of
Your
Data
All of
Your
Users
Pay Only for
What You
Use
Live Data
Sharing
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are Users Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
What Are User Doing with Snowflake?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Snowflake’s Differentiating Architecture
© 2018 Snowflake Computing Inc. All Rights
Reserved.
SNOWFLAKE’S MULTI-CLUSTER, SHARED DATA ARCHITECTURE
Centralized storage
Instant, automatic scalability & elasticity
Service
Compute
Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Why Scaling Compute Saves $$$$$$
© 2018 Snowflake Computing Inc. All Rights
Reserved.
How Does Snowflake Fit?
ADVANCED
ANALYTICS
INTEGRATION
TOOLS
ELT
Stream
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale compute and concurrency
ADF
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Scale Up (Response Time)
Queries that are:
--Large, Complex, many Calculations
Scale Out
(Throuhput)
Many users
or processes
concurrently
querying
S M L XL 2XXS 3X 4X
XS
XS
XS
XS
XS
XS
XS
XS
XS
L L
2X
2X2X2X
4X
4X
4X
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Differences from other EDW vendors
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Business Implications for Organizations
• Native ANSI-SQL database for leveraging existing skills
• Reducing expensive retraining
• Interoperability with existing tools (Power BI, Tableau, and others have SF
connectors)
• Simplified Migration
• Support/Documentation on Snowflake – Take a look at
© 2018 Snowflake Computing Inc. All Rights
Reserved.
But Wait….
…There’s
© 2018 Snowflake Computing Inc. All Rights
Reserved.
DEV OPS?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Protection w/o Restore?
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Data Sharing (Monetization)
Disaster Recovery
***Platform (AZURE AWS) in 2019
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Native Support for
structured and semi-
structured data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Cross Region Data Replication
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Comprehensive data protection
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
BIO-BREAK
THEN
LET’S DIVE INTO
SNOWFLAKE!
https://tinyurl.com/yy9frfuw
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 1
Snowflake DB/WH/Object
Configuration
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 2
Queries and Performance
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Azure Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Performance through Caching
Azure Storage
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Query
Profiler•Processing — time spent on data processing by the CPU.
•Local Disk IO — time when the processing was blocked by local disk access.
•Remote Disk IO — time when the processing was blocked by remote disk access.
•Network Communication — time when the processing was waiting for the network data
transfer.
•Synchronization — various synchronization activities between participating processes.
•Initialization — time spent setting up the query processing.
•IO — information about the input-output operations performed during the query:
• Scan progress — the percentage of data scanned for a given table so far.
• Bytes scanned — the number of bytes scanned so far.
• Percentage scanned from cache — the percentage of data scanned from the local disk
cache.
• Bytes written — bytes written (e.g. when loading into a table).
• Bytes written to result — bytes written to a result object.
• Bytes read from result — bytes read from a result object.
• External bytes scanned — bytes read from an external object, e.g. a stage.
•DML — statistics for Data Manipulation Language (DML) queries:
• Number of rows inserted — number of rows inserted into a table (or tables).
• Number of rows updated — number of rows updated in a table.
• Number of rows deleted — number of rows deleted from a table.
• Number of rows unloaded — number of rows unloaded during data export.
• Number of bytes deleted — number of bytes deleted from a table.
•Pruning — information on the effects of table pruning:
• Partitions scanned — number of partitions scanned so far.
• Partitions total — total number of partitions in a given table.
•Spilling — information about disk usage for operations where intermediate results do not fit in memory:
• Bytes spilled to local storage — volume of data spilled to local disk.
• Bytes spilled to remote storage — volume of data spilled to remote disk.
•Network — network communication:
• Bytes sent over the network — amount of data sent over the network.
•EXECUTION TIME
•STATISTICS
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Micropartion
s
&
Pruning
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Enterprise Grade Security
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 3
Unstructured Data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 4
Loading Data
© 2018 Snowflake Computing Inc. All Rights
Reserved.
• 4 objects for data loading
1) Source
2) Warehouse
3) Database
4) File Format (default CSV)
• 100 mb file is optimum size
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
Module 5
Cloning / Time Travel
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Multi-cluster Warehouse
August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved
THANK YOU
© 2018 Snowflake Computing Inc. All Rights
Reserved.
© 2018 Snowflake Computing Inc. All Rights
Reserved.
USE CASES FOR SNOWFLAKE DATA SHARING
• Nielsen is a global information, data, and
measurement company
• Nielsen knows “What People Watch,
Listen To, and Buy”
• Nielsen Marketing Cloud includes eXelate
DMP which provides unified consumer
profiles
• Nielsen sells selective slices of their DMP
data available to advertisers for particular
marketing campaigns
• Nielsen plans to use data sharing for
making detailed datasets available to
subscribers
• Lower friction, lower cost solution
• Scalable operations
61© 2016 Snowflake Computing Inc. All Rights
Reserved.
CUSTOMER EXAMPLE: BLACKBOARD
Jay White
Director, Software Engineering
Scenario
Provide and perfect over 14 different data
products that help universities facilitate
learning online
Pain Points
Disparate data
Challenges integrating data
Semi-structured data
Solution
Replace existing Hadoop and RDBMS
system with Snowflake
Everything that we did left our jaw on the
table. ‘Wait – we’ve never done anything like
that.’ Or, ‘How did that just run so fast.’ We
are getting 16x performance from
Snowflake.
62© 2016 Snowflake Computing Inc. All Rights
Reserved.
A NEW DATA PIPELINE FOR BLACKBOARD
Snowflake
S
3
Student data
Mobile data
Collaborative data
Intelsuite data
Kafka
Airflow for data orchestration Looker for internal dashboards
Learning Management System
Blackboard Predict
R Prediction Engine
Unified data
Simplified data transformation
Existing tools integrate seamlessly
• 16x performance improvement over SQL
• 1 PB by the end of 2017
© 2018 Snowflake Computing Inc. All Rights
Reserved.
Built-in disaster recover and high availabity

More Related Content

What's hot (20)

PPTX
Snowflake Data Loading.pptx
Parag860410
 
PPTX
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
PDF
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
Denodo
 
PPTX
A 30 day plan to start ending your data struggle with Snowflake
Snowflake Computing
 
PDF
Snowflake Company Presentation
AndrewJiang18
 
PDF
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
PPTX
Databricks Platform.pptx
Alex Ivy
 
PPT
An overview of snowflake
Sivakumar Ramar
 
PDF
Snowflake Architecture
mymailforspamfr
 
PPTX
Demystifying Data Warehouse as a Service
Snowflake Computing
 
PPTX
Delivering Data Democratization in the Cloud with Snowflake
Kent Graziano
 
PPTX
Data Sharing with Snowflake
Snowflake Computing
 
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
PDF
Demystifying Data Warehousing as a Service - DFW
Kent Graziano
 
PDF
Data Warehouse - Incremental Migration to the Cloud
Michael Rainey
 
PPTX
Building a modern data warehouse
James Serra
 
PPTX
Snowflake Datawarehouse Architecturing
Ishan Bhawantha Hewanayake
 
PDF
Actionable Insights with AI - Snowflake for Data Science
Harald Erb
 
PPTX
Master the Multi-Clustered Data Warehouse - Snowflake
Matillion
 
PDF
Modern Data architecture Design
Kujambu Murugesan
 
Snowflake Data Loading.pptx
Parag860410
 
Snowflake: The Good, the Bad, and the Ugly
Tyler Wishnoff
 
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
Denodo
 
A 30 day plan to start ending your data struggle with Snowflake
Snowflake Computing
 
Snowflake Company Presentation
AndrewJiang18
 
Snowflake Data Science and AI/ML at Scale
Adam Doyle
 
Databricks Platform.pptx
Alex Ivy
 
An overview of snowflake
Sivakumar Ramar
 
Snowflake Architecture
mymailforspamfr
 
Demystifying Data Warehouse as a Service
Snowflake Computing
 
Delivering Data Democratization in the Cloud with Snowflake
Kent Graziano
 
Data Sharing with Snowflake
Snowflake Computing
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Cathrine Wilhelmsen
 
Demystifying Data Warehousing as a Service - DFW
Kent Graziano
 
Data Warehouse - Incremental Migration to the Cloud
Michael Rainey
 
Building a modern data warehouse
James Serra
 
Snowflake Datawarehouse Architecturing
Ishan Bhawantha Hewanayake
 
Actionable Insights with AI - Snowflake for Data Science
Harald Erb
 
Master the Multi-Clustered Data Warehouse - Snowflake
Matillion
 
Modern Data architecture Design
Kujambu Murugesan
 

Similar to Zero to Snowflake Presentation (20)

PDF
Does it only have to be ML + AI?
Harald Erb
 
PDF
Laboratorio práctico: Data warehouse en la nube
Software Guru
 
PDF
Demystifying Data Warehousing as a Service (GLOC 2019)
Kent Graziano
 
PDF
Snowflake Data Cloud Differentiators !!!
waydebiz
 
PDF
Delivering rapid-fire Analytics with Snowflake and Tableau
Harald Erb
 
PDF
Dataiku & Snowflake Meetup Berlin 2020
Harald Erb
 
PDF
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...
IDERA Software
 
PDF
All course slides.pdf
ssuser98bffa1
 
PDF
Best-Practices-for-Using-Tableau-With-Snowflake.pdf
ssuserf8f9b2
 
PDF
Demystifying Data Warehouse as a Service (DWaaS)
Kent Graziano
 
PDF
week1slides1704202828322.pdf
TusharAgarwal49094
 
PDF
From Raw Data to an Interactive Data App in an Hour: Powered by Snowpark Python
HostedbyConfluent
 
PDF
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Certus Solutions
 
PDF
SnowPro Core Study Guide for certification.pdf
ssuser070aca
 
PPTX
Optimizing Snowflake Credits and Performance
johan86326
 
PDF
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
Adaryl "Bob" Wakefield, MBA
 
PPTX
10 Reasons Snowflake Is Great for Analytics
Senturus
 
PDF
Modernize your Infrastructure and Mobilize Your Data
Precisely
 
PPTX
Elastic Data Warehousing
Snowflake Computing
 
PDF
Changing the game with cloud dw
elephantscale
 
Does it only have to be ML + AI?
Harald Erb
 
Laboratorio práctico: Data warehouse en la nube
Software Guru
 
Demystifying Data Warehousing as a Service (GLOC 2019)
Kent Graziano
 
Snowflake Data Cloud Differentiators !!!
waydebiz
 
Delivering rapid-fire Analytics with Snowflake and Tableau
Harald Erb
 
Dataiku & Snowflake Meetup Berlin 2020
Harald Erb
 
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...
IDERA Software
 
All course slides.pdf
ssuser98bffa1
 
Best-Practices-for-Using-Tableau-With-Snowflake.pdf
ssuserf8f9b2
 
Demystifying Data Warehouse as a Service (DWaaS)
Kent Graziano
 
week1slides1704202828322.pdf
TusharAgarwal49094
 
From Raw Data to an Interactive Data App in an Hour: Powered by Snowpark Python
HostedbyConfluent
 
Melbourne: Certus Data 2.0 Vault Meetup with Snowflake - Data Vault In The Cl...
Certus Solutions
 
SnowPro Core Study Guide for certification.pdf
ssuser070aca
 
Optimizing Snowflake Credits and Performance
johan86326
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
Adaryl "Bob" Wakefield, MBA
 
10 Reasons Snowflake Is Great for Analytics
Senturus
 
Modernize your Infrastructure and Mobilize Your Data
Precisely
 
Elastic Data Warehousing
Snowflake Computing
 
Changing the game with cloud dw
elephantscale
 
Ad

Recently uploaded (20)

PDF
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
PDF
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
PPTX
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
PDF
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PDF
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
PPTX
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
PDF
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Persuasive AI: risks and opportunities in the age of digital debate
Speck&Tech
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
Jak MŚP w Europie Środkowo-Wschodniej odnajdują się w świecie AI
dominikamizerska1
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
The Builder’s Playbook - 2025 State of AI Report.pdf
jeroen339954
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
NewMind AI Journal - Weekly Chronicles - July'25 Week II
NewMind AI
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Ad

Zero to Snowflake Presentation

  • 1. © 2018 Snowflake Computing Inc. All Rights Reserved. August 24, 2018© Snowflake Computing Inc. All Rights Reserved ZERO TO SNOWFLAKE IN 90 MINUTES in partnership with
  • 2. © 2018 Snowflake Computing Inc. All Rights Reserved. • Download Materials @ https://tinyurl.com/yy9frfuw (Hotmail / outlook / o365) • Download Worksheets and unzip • Partner Introduction • Snowflake Introduction • Data Warehousing Today • What Users do with Snowflake • Hands on Snowflake • Data loading- sample data set is shared • Integrations • Multi-clustering • And more! • Scaling/Workload Isolation • Snowflake in the Real World- Data Sharing and Customer Example • Conclusion AGENDA
  • 3. © 2018DMICONFIDENTIAL&PROPRIETARY 3 DATA & ANALYTICS Snowflake Solution Integration Partner Creating an intelligent existence by linking physical & digital worlds to unleash the power of connectivity 〉 DATA PLATFORM SOLUTIONS 〉 MODERN DATA SOLUTIONS 〉 VISUAL SOLUTIONS 〉 ADVANCED ANALYTICS 〉 ENTERPRISE DATA STRATEGY 〉 EXECUTIVE ADVISORY 〉 ANALYTICS AS A SERVICE 〉 AGILE ANALYTICS
  • 4. © 2018DMICONFIDENTIAL&PROPRIETARY 4 DATA & ANALYTICS PARTNERSHIPS ***8 MOST IMPORTANT AI AND ANALYTIC TRENDS OF 2019 EBOOK AND MORE INFO ABOUT DMI @ WWW.DMINC.COM/EBOOK M Today’s Moderators 〉 Brett VanderPlaats Sr. Data Architect, Data Platform & Analytics 〉 David Mellinger Practice Lead, Data Platform & Analytics
  • 5. 1,500+ CUSTOMERS Building new analytic applications Delivered new analytic application to pharmacies using Snowflake Moving to the cloud Using Snowflake to move data analytics to the cloud Modernizing data platforms Replaced data warehouse appliance + Hadoop with Snowflake Accelerating enterprise BI and analytics Moved from legacy data warehouse systems (appliance & cloud) to Snowflake © 2018 Snowflake Computing Inc. All Rights Reserved. 5
  • 6. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are Users Doing Today?
  • 7. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 8. © 2018 Snowflake Computing Inc. All Rights Reserved. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved What would you like in your Data Warehouse? 8 Complete SQL Database Zero Management All of Your Data All of Your Users Pay Only for What You Use Live Data Sharing
  • 9. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are Users Doing with Snowflake?
  • 10. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are User Doing with Snowflake?
  • 11. © 2018 Snowflake Computing Inc. All Rights Reserved. What Are User Doing with Snowflake?
  • 12. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 13. © 2018 Snowflake Computing Inc. All Rights Reserved. Snowflake’s Differentiating Architecture
  • 14. © 2018 Snowflake Computing Inc. All Rights Reserved. SNOWFLAKE’S MULTI-CLUSTER, SHARED DATA ARCHITECTURE Centralized storage Instant, automatic scalability & elasticity Service Compute Storage
  • 15. © 2018 Snowflake Computing Inc. All Rights Reserved. Why Scaling Compute Saves $$$$$$
  • 16. © 2018 Snowflake Computing Inc. All Rights Reserved. How Does Snowflake Fit? ADVANCED ANALYTICS INTEGRATION TOOLS ELT Stream
  • 17. © 2018 Snowflake Computing Inc. All Rights Reserved. Scale compute and concurrency ADF
  • 18. © 2018 Snowflake Computing Inc. All Rights Reserved. Scale Up (Response Time) Queries that are: --Large, Complex, many Calculations Scale Out (Throuhput) Many users or processes concurrently querying S M L XL 2XXS 3X 4X XS XS XS XS XS XS XS XS XS L L 2X 2X2X2X 4X 4X 4X
  • 19. © 2018 Snowflake Computing Inc. All Rights Reserved. Differences from other EDW vendors
  • 20. © 2018 Snowflake Computing Inc. All Rights Reserved. Business Implications for Organizations • Native ANSI-SQL database for leveraging existing skills • Reducing expensive retraining • Interoperability with existing tools (Power BI, Tableau, and others have SF connectors) • Simplified Migration • Support/Documentation on Snowflake – Take a look at
  • 21. © 2018 Snowflake Computing Inc. All Rights Reserved. But Wait…. …There’s
  • 22. © 2018 Snowflake Computing Inc. All Rights Reserved. DEV OPS?
  • 23. © 2018 Snowflake Computing Inc. All Rights Reserved. Data Protection w/o Restore?
  • 24. © 2018 Snowflake Computing Inc. All Rights Reserved. Data Sharing (Monetization) Disaster Recovery ***Platform (AZURE AWS) in 2019
  • 25. © 2018 Snowflake Computing Inc. All Rights Reserved. Native Support for structured and semi- structured data
  • 26. © 2018 Snowflake Computing Inc. All Rights Reserved. Cross Region Data Replication
  • 27. © 2018 Snowflake Computing Inc. All Rights Reserved. Comprehensive data protection
  • 28. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved BIO-BREAK THEN LET’S DIVE INTO SNOWFLAKE! https://tinyurl.com/yy9frfuw
  • 29. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 1 Snowflake DB/WH/Object Configuration
  • 30. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 2 Queries and Performance
  • 31. © 2018 Snowflake Computing Inc. All Rights Reserved. Azure Storage
  • 32. © 2018 Snowflake Computing Inc. All Rights Reserved. Performance through Caching Azure Storage
  • 33. © 2018 Snowflake Computing Inc. All Rights Reserved. Query Profiler•Processing — time spent on data processing by the CPU. •Local Disk IO — time when the processing was blocked by local disk access. •Remote Disk IO — time when the processing was blocked by remote disk access. •Network Communication — time when the processing was waiting for the network data transfer. •Synchronization — various synchronization activities between participating processes. •Initialization — time spent setting up the query processing. •IO — information about the input-output operations performed during the query: • Scan progress — the percentage of data scanned for a given table so far. • Bytes scanned — the number of bytes scanned so far. • Percentage scanned from cache — the percentage of data scanned from the local disk cache. • Bytes written — bytes written (e.g. when loading into a table). • Bytes written to result — bytes written to a result object. • Bytes read from result — bytes read from a result object. • External bytes scanned — bytes read from an external object, e.g. a stage. •DML — statistics for Data Manipulation Language (DML) queries: • Number of rows inserted — number of rows inserted into a table (or tables). • Number of rows updated — number of rows updated in a table. • Number of rows deleted — number of rows deleted from a table. • Number of rows unloaded — number of rows unloaded during data export. • Number of bytes deleted — number of bytes deleted from a table. •Pruning — information on the effects of table pruning: • Partitions scanned — number of partitions scanned so far. • Partitions total — total number of partitions in a given table. •Spilling — information about disk usage for operations where intermediate results do not fit in memory: • Bytes spilled to local storage — volume of data spilled to local disk. • Bytes spilled to remote storage — volume of data spilled to remote disk. •Network — network communication: • Bytes sent over the network — amount of data sent over the network. •EXECUTION TIME •STATISTICS
  • 34. © 2018 Snowflake Computing Inc. All Rights Reserved. Multi-cluster Warehouse
  • 35. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 36. © 2018 Snowflake Computing Inc. All Rights Reserved. Micropartion s & Pruning
  • 37. © 2018 Snowflake Computing Inc. All Rights Reserved. Enterprise Grade Security
  • 38. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 3 Unstructured Data
  • 39. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 40. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 4 Loading Data
  • 41. © 2018 Snowflake Computing Inc. All Rights Reserved. • 4 objects for data loading 1) Source 2) Warehouse 3) Database 4) File Format (default CSV) • 100 mb file is optimum size
  • 42. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved Module 5 Cloning / Time Travel
  • 43. © 2018 Snowflake Computing Inc. All Rights Reserved. Multi-cluster Warehouse
  • 44. August 24, 2018© Snowflake Computing Inc. All Rights ReservedAugust 24, 2018© Snowflake Computing Inc. All Rights Reserved THANK YOU
  • 45. © 2018 Snowflake Computing Inc. All Rights Reserved.
  • 46. © 2018 Snowflake Computing Inc. All Rights Reserved. USE CASES FOR SNOWFLAKE DATA SHARING • Nielsen is a global information, data, and measurement company • Nielsen knows “What People Watch, Listen To, and Buy” • Nielsen Marketing Cloud includes eXelate DMP which provides unified consumer profiles • Nielsen sells selective slices of their DMP data available to advertisers for particular marketing campaigns • Nielsen plans to use data sharing for making detailed datasets available to subscribers • Lower friction, lower cost solution • Scalable operations
  • 47. 61© 2016 Snowflake Computing Inc. All Rights Reserved. CUSTOMER EXAMPLE: BLACKBOARD Jay White Director, Software Engineering Scenario Provide and perfect over 14 different data products that help universities facilitate learning online Pain Points Disparate data Challenges integrating data Semi-structured data Solution Replace existing Hadoop and RDBMS system with Snowflake Everything that we did left our jaw on the table. ‘Wait – we’ve never done anything like that.’ Or, ‘How did that just run so fast.’ We are getting 16x performance from Snowflake.
  • 48. 62© 2016 Snowflake Computing Inc. All Rights Reserved. A NEW DATA PIPELINE FOR BLACKBOARD Snowflake S 3 Student data Mobile data Collaborative data Intelsuite data Kafka Airflow for data orchestration Looker for internal dashboards Learning Management System Blackboard Predict R Prediction Engine Unified data Simplified data transformation Existing tools integrate seamlessly • 16x performance improvement over SQL • 1 PB by the end of 2017
  • 49. © 2018 Snowflake Computing Inc. All Rights Reserved. Built-in disaster recover and high availabity