© 2020 Snowflake Inc. All Rights Reserved
© 2020 Snowflake Inc. All Rights Reserved
Introduction to
Snowflake
Dataiku Berlin Meetup
25 February 2020
Harald Erb | Sr. Solutions Engineer
© 2020 Snowflake Inc. All Rights Reserved
Quick Intro to Snowflake
© 2020 Snowflake Computing Inc. All Rights Reserved
SNOWFLAKE TIMELINE
4
Founded in 2012 by
industry veterans
with over 120
database patents
~$1.5BN in venture
capital funding from
leading investors
~$12.4BN valuation
First customers
2014, general
availability 2015
1.800+ employees
Over 3500+
customers today
Queries processed in
Snowflake per day:
> 300 million
Largest single
table:
> 68 trillion rows
Largest number of
tables single DB:
> 200,000
Single customer
most data:
> 55PB
Single customer
most users:
> 10,000
FUN FACTS
Gartner and
Forrester “Leader”
© 2020 Snowflake Inc. All Rights Reserved
On Premises
EDW
1st Gen Cloud
EDW
Data Lake,
Hadoop
Cloud Data
Platform
All Data
All Users
Fast Answers
SQL Database
Value
of Data
Time
You can’t use yesterday’s technology to solve today’s data problems -- and definitely not tomorrow’s.
JOURNEY TO A CLOUD DATA PLATFORM
© 2020 Snowflake Inc. All Rights Reserved
A REAL-WORLD PROBLEM
Data Warehouse Appliance
Resource Usage
Heatmap of weekly usage profile showing critical peak usage times,
but also low Avg. CPU usage of 33% per week
Typical 24h usage profile showing the two main workload groups
“competing for Data Warehouse Appliance resources”
And what about Analytics workloads?
How can we support new Data Initatives??
Data Loading, ETL, Aggregation
Reporting, BI
Workload Groups over time
© 2020 Snowflake Inc. All Rights Reserved
SNOWFLAKE ARCHITECTURE
11
Scale Out Services
Multi-Cluster Compute
Centralized Storage
Cloud Agnostic Layer
© 2020 Snowflake Inc. All Rights Reserved
© 2020 Snowflake Inc. All Rights Reserved
Data Science
Data Transformation
Marketing
Analytics / Reporting / BI
XS
S
L
M
Data
Load
Structured &
Semi-Structured
Finance
App
Functional
Architecture
L
Enabling one or multiple
Data Teams/Projects to
drive innovation…
© 2020 Snowflake Inc. All Rights Reserved
Data Transformation
Marketing
Analytics / Reporting / BI
XS
S
M
L
Data
Load
Structured &
Semi-Structured
Finance
App
L
L
XL
Data Science
Functional
Architecture
…they are even allowed
to scale-up compute
resources when needed…
© 2020 Snowflake Inc. All Rights Reserved
Data Transformation
XS
S
L
Data
Load
Structured &
Semi-Structured
Finance
App
L
L
XL
Data Science
Functional
Architecture
Marketing
Analytics / Reporting / BI
M
M
… without slowing down
other active users of the
Cloud Data Platform.
© 2020 Snowflake Inc. All Rights Reserved
Marketing
Analytics / Reporting / BI
M
M
Data Transformation
XS
S
L
Data
Load
Structured &
Semi-Structured
Finance
App
L
Functional
Architecture Secure Sharing &
Collaboration Your Private
Data Exchange
Your
Business
Ecosystem
Public Data
Exchange
Your Employees
M
© 2020 Snowflake Inc. All Rights Reserved
Marketing
Analytics / Reporting / BI
M
M
Data Transformation
XS
S
L
Data
Load
Structured &
Semi-Structured
Finance
App
L
Secure Sharing &
Collaboration Your Private
Data Exchange
Your
Business
Ecosystem
Public Data
Exchange
Your Employees
M
Functional
Architecture
Clone
M
Test/Dev
© 2020 Snowflake Inc. All Rights Reserved 19
ONE PLATFORM, SHARED DATA, MANY WORKLOADS
Data
Warehouse
Data
Lake
Data
Engineering
Data
Exchange
Data
Applications
Data
Science
Data
Monetization
Operational
Reporting
Ad Hoc
Analysis
Real-time
Analytics
OLTP
Databases
Enterprise
Applications
Third-Party
Web/Log
Data
IoT
Data Sources Data Consumers
ETL,Streaming
Live-Demo!
© 2020 Snowflake Inc. All Rights Reserved
Demo
© 2019 Snowflake Computing Inc. All Rights Reserved 21
AWS
PrivateLink
AWS Direct
Connect
Frankfurt
CDN
S3
Endpoint
Snowflake
S3 bucket
OCSP
cache
External Stage
(COPY data)
Internal Stage
(GET data, Large Results)
Customer
On-premise
environment
Snowflake Driver/Clients
TYPICAL SNOWFLAKE SETUP & DEMO FLOW
Customer Cloud Snowflake
Customer Account
Data Exploration / Data Science
Python/JupyterLab
4. Use Snowflake Python Connector to
access / prepare Time series data
5. Train TS Model, predict future values
and plot forecast along with historical
data
6. Write the predictions back into new
Snowflake table
Customer
“Data Lake”
Snowflake Web UI
1. Provisioning of a Compute Cluster via
SQL command!
2. Resize Compute Cluster and load data
from external S3 Bucket
3. Analyze data using SQL and prepare a
secure Database View for other users
DEMO FLOW
© 2020 Snowflake Inc. All Rights Reserved
© 2020 Snowflake Inc. All Rights Reserved
© 2019 Snowflake Computing Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved
Connecting the dots
© 2020 Snowflake Computing Inc. All Rights Reserved
SNOWFLAKE REFERENCE ARCHITECTURE FOR ANALYTICS
25
© 2020 Snowflake Computing Inc. All Rights Reserved 26
SNOWFLAKE: A SCALABLE + POWERFUL
DATA PROCESSING BACKEND FOR DATAIKU!
© 2020 Snowflake Inc. All Rights Reserved
THANK YOU

More Related Content

PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PPTX
Intro to Data Vault 2.0 on Snowflake
PDF
Modernizing to a Cloud Data Architecture
PDF
AWS Summit Singapore 2019 | Snowflake: Your Data. No Limits
PDF
Snowflake for Data Engineering
PDF
Data warehouse con azure synapse analytics
PPTX
Power bi
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r2)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Intro to Data Vault 2.0 on Snowflake
Modernizing to a Cloud Data Architecture
AWS Summit Singapore 2019 | Snowflake: Your Data. No Limits
Snowflake for Data Engineering
Data warehouse con azure synapse analytics
Power bi
Data Lakehouse, Data Mesh, and Data Fabric (r2)

What's hot (20)

PPTX
Power BI Made Simple
PPTX
Power BI : A Detailed Discussion
PDF
Technical Deck Delta Live Tables.pdf
PPTX
Power BI Overview, Deployment and Governance
PDF
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is Invaluable
PPTX
Power Bi Basics
PDF
Creating a Data-Driven Organization: an executive summary
PDF
Power BI Architecture
PDF
What Is Power BI? | Introduction To Microsoft Power BI | Power BI Training | ...
PPTX
Data Lakehouse Symposium | Day 4
PDF
How to Build a Rock-Solid Analytics and Business Intelligence Strategy
PDF
Power BI Desktop | Power BI Tutorial | Power BI Training | Edureka
PPTX
Microsoft cloud big data strategy
PPTX
Introduction to Power BI to make smart decisions
PDF
Data Management, Metadata Management, and Data Governance – Working Together
PPTX
Power BI Overview
PPT
Data Lakehouse Symposium | Day 1 | Part 2
PPTX
Intro for Power BI
PPTX
Solution architecture for big data projects
Power BI Made Simple
Power BI : A Detailed Discussion
Technical Deck Delta Live Tables.pdf
Power BI Overview, Deployment and Governance
DataEd Slides: Data Strategy – Plans Are Useless but Planning Is Invaluable
Power Bi Basics
Creating a Data-Driven Organization: an executive summary
Power BI Architecture
What Is Power BI? | Introduction To Microsoft Power BI | Power BI Training | ...
Data Lakehouse Symposium | Day 4
How to Build a Rock-Solid Analytics and Business Intelligence Strategy
Power BI Desktop | Power BI Tutorial | Power BI Training | Edureka
Microsoft cloud big data strategy
Introduction to Power BI to make smart decisions
Data Management, Metadata Management, and Data Governance – Working Together
Power BI Overview
Data Lakehouse Symposium | Day 1 | Part 2
Intro for Power BI
Solution architecture for big data projects
Ad

Similar to Dataiku & Snowflake Meetup Berlin 2020 (20)

PPTX
Delivering Data Democratization in the Cloud with Snowflake
PPTX
Snowflake’s Cloud Data Platform and Modern Analytics
PDF
Laboratorio práctico: Data warehouse en la nube
PDF
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
PDF
Snowflake Data Cloud Differentiators !!!
PDF
Delivering rapid-fire Analytics with Snowflake and Tableau
PDF
Does it only have to be ML + AI?
PDF
Actionable Insights with AI - Snowflake for Data Science
PDF
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...
PPTX
Zero to Snowflake Presentation
PPTX
ME_Snowflake_Introduction_for new students.pptx
PDF
Snowflake Data Science and AI/ML at Scale
PPTX
Demystifying Data Warehouse as a Service
PPTX
Master the Multi-Clustered Data Warehouse - Snowflake
PDF
Demystifying Data Warehousing as a Service (GLOC 2019)
PPTX
10 Reasons Snowflake Is Great for Analytics
PPTX
Snowflake Training in Hyderabad Snowflake Training - Enroll Now.pptx
PPTX
Elastic Data Warehousing
PDF
Snowflake_Cheat_Sheet_Snowflake_Cheat_Sheet
PPTX
Introducing the Snowflake Computing Cloud Data Warehouse
Delivering Data Democratization in the Cloud with Snowflake
Snowflake’s Cloud Data Platform and Modern Analytics
Laboratorio práctico: Data warehouse en la nube
How to Take Advantage of an Enterprise Data Warehouse in the Cloud
Snowflake Data Cloud Differentiators !!!
Delivering rapid-fire Analytics with Snowflake and Tableau
Does it only have to be ML + AI?
Actionable Insights with AI - Snowflake for Data Science
Idera live 2021: Keynote Presentation The Future of Data is The Data Cloud b...
Zero to Snowflake Presentation
ME_Snowflake_Introduction_for new students.pptx
Snowflake Data Science and AI/ML at Scale
Demystifying Data Warehouse as a Service
Master the Multi-Clustered Data Warehouse - Snowflake
Demystifying Data Warehousing as a Service (GLOC 2019)
10 Reasons Snowflake Is Great for Analytics
Snowflake Training in Hyderabad Snowflake Training - Enroll Now.pptx
Elastic Data Warehousing
Snowflake_Cheat_Sheet_Snowflake_Cheat_Sheet
Introducing the Snowflake Computing Cloud Data Warehouse
Ad

More from Harald Erb (9)

PDF
Machine Learning - Eine Challenge für Architekten
PDF
DOAG Big Data Days 2017 - Cloud Journey
PDF
Do you know what k-Means? Cluster-Analysen
PDF
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
PDF
Big Data Discovery + Analytics = Datengetriebene Innovation!
PDF
Big Data Discovery
PDF
DOAG News 2012 - Analytische Mehrwerte mit Big Data
PDF
Oracle Unified Information Architeture + Analytics by Example
PDF
Endeca Web Acquisition Toolkit - Integration verteilter Web-Anwendungen und a...
Machine Learning - Eine Challenge für Architekten
DOAG Big Data Days 2017 - Cloud Journey
Do you know what k-Means? Cluster-Analysen
Exploratory Analysis in the Data Lab - Team-Sport or for Nerds only?
Big Data Discovery + Analytics = Datengetriebene Innovation!
Big Data Discovery
DOAG News 2012 - Analytische Mehrwerte mit Big Data
Oracle Unified Information Architeture + Analytics by Example
Endeca Web Acquisition Toolkit - Integration verteilter Web-Anwendungen und a...

Recently uploaded (20)

PDF
A biomechanical Functional analysis of the masitary muscles in man
PPTX
PPT for Diseases.pptx, there are 3 types of diseases
PPTX
ifsm.pptx, institutional food service management
PPTX
machinelearningoverview-250809184828-927201d2.pptx
PDF
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
PPT
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
PDF
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
PDF
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PPTX
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
PPTX
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
PPTX
Hushh Hackathon for IIT Bombay: Create your very own Agents
PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PPT
Classification methods in data analytics.ppt
PPTX
indiraparyavaranbhavan-240418134200-31d840b3.pptx
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PDF
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
PPTX
Hushh.ai: Your Personal Data, Your Business
PPTX
recommendation Project PPT with details attached
PDF
REPORT CARD OF GRADE 2 2025-2026 MATATAG
A biomechanical Functional analysis of the masitary muscles in man
PPT for Diseases.pptx, there are 3 types of diseases
ifsm.pptx, institutional food service management
machinelearningoverview-250809184828-927201d2.pptx
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
dsa Lec-1 Introduction FOR THE STUDENTS OF bscs
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
expt-design-lecture-12 hghhgfggjhjd (1).ppt
cp-and-safeguarding-training-2018-2019-mmfv2-230818062456-767bc1a7.pptx
chuitkarjhanbijunsdivndsijvndiucbhsaxnmzsicvjsd
Hushh Hackathon for IIT Bombay: Create your very own Agents
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
Classification methods in data analytics.ppt
indiraparyavaranbhavan-240418134200-31d840b3.pptx
AI AND ML PROPOSAL PRESENTATION MUST.pptx
CS3352FOUNDATION OF DATA SCIENCE _1_MAterial.pdf
Hushh.ai: Your Personal Data, Your Business
recommendation Project PPT with details attached
REPORT CARD OF GRADE 2 2025-2026 MATATAG

Dataiku & Snowflake Meetup Berlin 2020

  • 1. © 2020 Snowflake Inc. All Rights Reserved
  • 2. © 2020 Snowflake Inc. All Rights Reserved Introduction to Snowflake Dataiku Berlin Meetup 25 February 2020 Harald Erb | Sr. Solutions Engineer
  • 3. © 2020 Snowflake Inc. All Rights Reserved Quick Intro to Snowflake
  • 4. © 2020 Snowflake Computing Inc. All Rights Reserved SNOWFLAKE TIMELINE 4 Founded in 2012 by industry veterans with over 120 database patents ~$1.5BN in venture capital funding from leading investors ~$12.4BN valuation First customers 2014, general availability 2015 1.800+ employees Over 3500+ customers today Queries processed in Snowflake per day: > 300 million Largest single table: > 68 trillion rows Largest number of tables single DB: > 200,000 Single customer most data: > 55PB Single customer most users: > 10,000 FUN FACTS Gartner and Forrester “Leader”
  • 5. © 2020 Snowflake Inc. All Rights Reserved On Premises EDW 1st Gen Cloud EDW Data Lake, Hadoop Cloud Data Platform All Data All Users Fast Answers SQL Database Value of Data Time You can’t use yesterday’s technology to solve today’s data problems -- and definitely not tomorrow’s. JOURNEY TO A CLOUD DATA PLATFORM
  • 6. © 2020 Snowflake Inc. All Rights Reserved A REAL-WORLD PROBLEM Data Warehouse Appliance Resource Usage Heatmap of weekly usage profile showing critical peak usage times, but also low Avg. CPU usage of 33% per week Typical 24h usage profile showing the two main workload groups “competing for Data Warehouse Appliance resources” And what about Analytics workloads? How can we support new Data Initatives?? Data Loading, ETL, Aggregation Reporting, BI Workload Groups over time
  • 7. © 2020 Snowflake Inc. All Rights Reserved SNOWFLAKE ARCHITECTURE 11 Scale Out Services Multi-Cluster Compute Centralized Storage Cloud Agnostic Layer
  • 8. © 2020 Snowflake Inc. All Rights Reserved
  • 9. © 2020 Snowflake Inc. All Rights Reserved Data Science Data Transformation Marketing Analytics / Reporting / BI XS S L M Data Load Structured & Semi-Structured Finance App Functional Architecture L Enabling one or multiple Data Teams/Projects to drive innovation…
  • 10. © 2020 Snowflake Inc. All Rights Reserved Data Transformation Marketing Analytics / Reporting / BI XS S M L Data Load Structured & Semi-Structured Finance App L L XL Data Science Functional Architecture …they are even allowed to scale-up compute resources when needed…
  • 11. © 2020 Snowflake Inc. All Rights Reserved Data Transformation XS S L Data Load Structured & Semi-Structured Finance App L L XL Data Science Functional Architecture Marketing Analytics / Reporting / BI M M … without slowing down other active users of the Cloud Data Platform.
  • 12. © 2020 Snowflake Inc. All Rights Reserved Marketing Analytics / Reporting / BI M M Data Transformation XS S L Data Load Structured & Semi-Structured Finance App L Functional Architecture Secure Sharing & Collaboration Your Private Data Exchange Your Business Ecosystem Public Data Exchange Your Employees M
  • 13. © 2020 Snowflake Inc. All Rights Reserved Marketing Analytics / Reporting / BI M M Data Transformation XS S L Data Load Structured & Semi-Structured Finance App L Secure Sharing & Collaboration Your Private Data Exchange Your Business Ecosystem Public Data Exchange Your Employees M Functional Architecture Clone M Test/Dev
  • 14. © 2020 Snowflake Inc. All Rights Reserved 19 ONE PLATFORM, SHARED DATA, MANY WORKLOADS Data Warehouse Data Lake Data Engineering Data Exchange Data Applications Data Science Data Monetization Operational Reporting Ad Hoc Analysis Real-time Analytics OLTP Databases Enterprise Applications Third-Party Web/Log Data IoT Data Sources Data Consumers ETL,Streaming Live-Demo!
  • 15. © 2020 Snowflake Inc. All Rights Reserved Demo
  • 16. © 2019 Snowflake Computing Inc. All Rights Reserved 21 AWS PrivateLink AWS Direct Connect Frankfurt CDN S3 Endpoint Snowflake S3 bucket OCSP cache External Stage (COPY data) Internal Stage (GET data, Large Results) Customer On-premise environment Snowflake Driver/Clients TYPICAL SNOWFLAKE SETUP & DEMO FLOW Customer Cloud Snowflake Customer Account Data Exploration / Data Science Python/JupyterLab 4. Use Snowflake Python Connector to access / prepare Time series data 5. Train TS Model, predict future values and plot forecast along with historical data 6. Write the predictions back into new Snowflake table Customer “Data Lake” Snowflake Web UI 1. Provisioning of a Compute Cluster via SQL command! 2. Resize Compute Cluster and load data from external S3 Bucket 3. Analyze data using SQL and prepare a secure Database View for other users DEMO FLOW
  • 17. © 2020 Snowflake Inc. All Rights Reserved
  • 18. © 2020 Snowflake Inc. All Rights Reserved
  • 19. © 2019 Snowflake Computing Inc. All Rights Reserved© 2020 Snowflake Inc. All Rights Reserved Connecting the dots
  • 20. © 2020 Snowflake Computing Inc. All Rights Reserved SNOWFLAKE REFERENCE ARCHITECTURE FOR ANALYTICS 25
  • 21. © 2020 Snowflake Computing Inc. All Rights Reserved 26 SNOWFLAKE: A SCALABLE + POWERFUL DATA PROCESSING BACKEND FOR DATAIKU!
  • 22. © 2020 Snowflake Inc. All Rights Reserved THANK YOU