SlideShare a Scribd company logo
Copyright Global Data Strategy, Ltd. 2022
Business Intelligence & Data Analytics:
An Architected Approach
Donna Burbank
Global Data Strategy, Ltd.
June 23rd, 2022
KATANA GRAPH |
TM
Katana Graph
June 2022
KATANA GRAPH |
TM
KATANA GRAPH |
TM
Confidential 2
High Performance Scale-out Graph Processing & Analytics
Founded in March 2020, offices in Austin, Bay Area,
NYC, Denver
Co-founders: Keshav Pingali and Chris Rossbach
Investors: Intel Capital, Dell Venture Capital, Redline Ventures,
Walden International
Katana team: Leaders in graph algorithms, programming
languages, runtimes, virtualization and storage.
Commercial engagements with several Fortune 100 companies
Website: www.katanagraph.com
Company Overview
KATANA GRAPH |
TM
Leadership Team
Confidential 3
Gurbinder Gill
PhD UT Austin
VMWare, Facebook,
MSR , IBM Research
Roshan Dathathri
PhD UT Austin
NI, MSR, HP Labs
Emmett Witchel
Prof UT Austin
InCert, Veritas,
Symantec
Bo Wu
Prof Colorado
School of Mines
Graph mining expert
Donald Nguyen
PhD UT Austin
Google, Synthace,
Determined AI
Tyler Hunt
PhD UT Austin
MSR, Visa Research,
Bell Labs
Jon Currey
University of Cambridge
Distributed Systems,
Machine Learning
MSR, Apple (iTune), Oracle
Yige Hu
PhD UT Austin
File System,
Fault Tolerance
Amy Chang
Board Advisor
BOD P&G, Cisco, Disney
UCSF Hospital Exec Committee
Deans Advisory Council
Stanford University
Ying Ding
Data Science Advisor
Professor UT Austin
Medical/ Pharma Knowledge Graph,
Machine Learning
Co-founder Data2Discovery
Keshav Pingali
CEO, Co-founder
Prof UT Austin
Fellow ACM, IEEE, AAAS
Chris Rossbach
CTO, Co-founder
Prof UT Austin
MSR, Vmware, Canesta
Farshid Sabet
CBO, Co-founder
Intel, Modvidius,
Aptina, SanDisk
KATANA GRAPH |
TM
KATANA GRAPH |
Graph Technology
Application Areas
04
Platforms
Finance
Healthcare
Retail
Energy Industrial
Telecom
Genomics Anti Money
Laundering
Drug
Discovery
Identity
Graph
Precision
Medicine
Electronic
Circuit Design
Tools
Knowledge
Graph
Predictive
Monitoring
Intrusion
detection
Supply Chain
Optimization
Fraud
Detection
Real Time
Analytics
Customer
360
Recommendation
Social
Networks
KATANA GRAPH |
TM
KATANA GRAPH |
TM
Why Katana Graph
Confidential 5
Architected to handle massive graphs
• Tested with largest publicly available
web-crawl: WDC12 (3.5B vertices, 128B edges)
Unmatched performance
• 10x - 100x times faster vs competing solutions
Massive scalability
• Proven on Open Cloud HPC Clusters
(AWS , Azure, Google Cloud)
• Scales up to 256 machines on Stampede Xeon
(Skylake) Cluster
Native AI/ML with Graphs
• Health and Life Sciences (HLS), Financial, Identity
Management, Intrusion detection, EDA (Electronic
Design Automation), HPC (High Performance
Computing) application: 3D mesh generation
KATANA GRAPH |
TM
Graph Compute Domains
Confidential 06
Graph Database
(Query)
Graph AI
& Machine
Learning
Graph
Analytics &
Mining
Probability
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Donna Burbank
2
Donna is a recognised industry expert in
information management with over 20 years
of experience in data strategy, information
management, data modeling, metadata
management, and enterprise architecture.
Her background is multi-faceted across
consulting, product development, product
management, brand strategy, marketing, and
business leadership.
She is currently the Managing Director at
Global Data Strategy, Ltd., an international
information management consulting company
that specializes in the alignment of business
drivers with data-centric technology.
In past roles, she has served in key brand
strategy and product management roles at CA
Technologies and Embarcadero Technologies
for several of the leading data management
products in the market.
As an active contributor to the data
management community, she is a long time
DAMA International member, Past President
and Advisor to the DAMA Rocky Mountain
chapter, and was awarded the Excellence in
Data Management Award from DAMA
International.
She has worked with dozens of Fortune 500
companies worldwide in the Americas,
Europe, Asia, and Africa and speaks regularly
at industry conferences. She has co-authored
several books and is a regular contributor to
industry publications. She can be reached at
donna.burbank@globaldatastrategy.com
Donna is based in Boulder, Colorado, USA.
Follow on Twitter @donnaburbank
@GlobalDataStrat
Twitter Event hashtag: #DAStrategies
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
DATAVERSITY Data Architecture Strategies
• January Emerging Trends in Data Architecture – What’s the Next Big Thing?
• February Building a Data Strategy - Practical Steps for Aligning with Business Goals
• March Master Data Management – Aligning Data, Process, and Governance
• April Data Governance & Data Architecture: Alignment & Synergies
• May Improving Data Literacy Around Data Architecture
• June Business Intelligence & Data Analytics: An Architected Approach
• July Best Practices in Metadata Management
• August Data Quality Best Practices
• September Business-centric Data Modeling
• October Graph Databases: Benefits & Risks
• December Enterprise Architecture vs. Data Architecture
3
This Year’s Lineup
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
What We’ll Cover Today
• Business intelligence (BI) and data analytics are
increasing in popularity as more organizations are
looking to become more data-driven.
• Many tools have powerful visualization
techniques that can create dynamic displays of
critical information.
• To ensure that the data displayed on these
visualizations is accurate and timely, a strong
Data Architecture is needed.
• This webinar will discuss how to create a robust
Data Architecture for BI and data analytics that
takes both business and technology needs into
consideration.
4
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Data-Driven Business
70% of organizations feel that their
organization sees data as a strategic asset*.
70% of indicated that reporting and
analytics were key drivers for data
management.**
>50% identified improved collaboration
through using a defined data architecture. **
5
* based on research from a 2019 DATAVERSITY survey on “Trends in Data Management” by Donna Burbank and Michelle Knight
** based on research from a 2021 DATAVERSITY survey on “Trends in Data Management” by Donna Burbank and Michelle Knight
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Main Business Goals & Drivers for Data Management
6
0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% 80.00%
Gaining Competitive Advantage
Improving Outcomes (e.g. health, education, etc.)
Improving Product Quality
Increasing Revenue and Growth
Improving Customer Satisfaction
Complying with Regulations
Saving Cost and Increasing Efficiency
Reducing Risk
Supporting Digital Transformation
Gaining Insights through Reporting and Analytics
Main Business Goals & Drivers for Data Management
(select all that apply)
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Supporting Reporting & Analytics
7
ACME Inc. Sales Dashboard
❑ Product: Widget 1
❑ Region: NA
201
8
2019 2020 2021 2022
Successful reporting & analytics includes:
• Data-driven culture
• Do we use dashboards in our sales meetings?
• Or go by “gut feel”?
• How can we integrate analytics into our sales cycle
(e.g. predictive next best offer)
• Data Governance
• How do we define “Total Revenue”?
• What countries are included in South America?
• Data Quality
• Are these revenue numbers accurate?
• What’s the source of the product data?
• Data Architecture
• How are we storing the data to accurately &
efficiently to slice and dice for these reports?
Super Widget
Pack
Widget 1
Widget 2
What about
the data?
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
What is the Correct Architecture to Power Reporting & Analytics?
… There is a Cacophony of Options …
8
Data
Warehouse
Data Lake
Data Lake
House
Data Marketplace
Metadata
Catalog
Relational, Nonrelational, Star Schema, SQL, NoSQL, Graph, Document Store, Real-
time Streaming, Time series….
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
What are Current Organizational Priorities
9
* based on research from a 2021
DATAVERSITY survey on “Trends in Data
Management” by Donna Burbank and
Michelle Knight
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Using a Data Lake in Conjunction with a Data Warehouse
10
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Integrating Multiple Paradigms
• The Data Lake has a different architecture & purpose than traditional data sources such as data
warehouses.
• But the two environments can co-exist to share relevant information.
11
Data Analysis & Discovery – Data Lake Enterprise Systems of Record
Data Governance & Collaboration
Master &
Reference Data
Data Warehouse
Data Marts
Operational Data
Security & Privacy
Sandbox
Lightly Modeled
Data
Data
Exploration
Reporting & Analytics
Advanced
Analytics
Self-Service BI
Standard BI
Reports
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Level 1
“Top-Down” alignment with
business priorities
Level 5
“Bottom-Up” management &
inventory of data sources
Level 2
Managing the people, process,
policies & culture around data
Level 4
Coordinating & integrating
disparate data sources
Level 3
Leveraging data for strategic
advantage
A Holistic Approach is Needed
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
13
The Design Aspect of
Data Architecture for BI & Analytics
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
A little data modeling up-front
… prevents headaches down the road
From Data Modeling for the Business by Hoberman, Burbank, Bradley, Technics Publications, 2009
• It’s often tempting to skip data
modeling documentation because it’s
“faster”
• But…long-term, it’s ultimately longer as
errors and inconsistencies need to be
fixed as a result.
“If you don’t have time to do it right, do
you have time to do it again?”
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Levels of Data Models
15
Conceptual
Logical
Physical
Purpose
Communication & Definition of
Business Concepts & Rules
Clarification & Detail
of Business Rules &
Data Structures
Technical
Implementation on
a Physical Database
Audience
Business Stakeholders
Data Architects
Data Architects
Business Analysts
DBAs
Developers
Business Concepts
Data Entities
Physical Tables
Business Stakeholders
Data Architects
Enterprise
Subject Areas
Organization & Scoping of main
business domain areas
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Different Physical Models for Different Use Cases
16
Relational – Normal Form
• Reduce redundancy for
operational data
• Increase data quality
• Ensure consistency (ACID
transactions)
Dimensional– Star Schema
• Ease of reporting for summarized
and historical data
• Ability to easily “slice and dice” for
self-service reporting
• Performance and flexibility
NoSQL
No modeling technique is inherently “better” than another. Data use cases & purpose drives what “good” looks like.
…Rant over…
• Speed of retrieval, low
latency
• High data volumes
• Flexibility for change
…And More!
• There are numerous
ways to model and store
data.
• Hierarchical/XML
• Graph
• COBOL Copybook!
• S3 “buckets”
• Data Vault
• Etc…
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Is the Star Schema Dead?
17
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
The Star Schema
Dimension
Dimension
Dimension
Dimension
Dimension
Fact
(Measure)
Facts/Measures: Contain the actual values to be reported on.
What are we measuring? e.g. Activities (sales transaction,
patient visit, etc.)
• Few attributes (just numbers with links to the dimensions)
• Many values (e.g. all sales transactions)
Dimensions: Contain the details that describe the central fact.
i.e. The things we want to report by. e.g. Date, Region, Quarter
• Many attributes (Individual name, DOB, gender, etc.)
• Few values
Note: Your Master Data domains often feed these dimensions.
Sales
By Month
By Customer
By Region By Sales Rep
By Product
The Star Schema is still a user-friendly and performant way to “slice and dice” data for reporting.
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
The Bus Matrix
A Bus Matrix is a simply way to keep track of what you want to report “on” (Facts) and what
you want to report “by” (Dimensions)
Location Sales Rep Product Customer
Total Sales Revenue X X X X
Wholesale Revenue X X
Number of Returned Items
Etc.
Report “by”
- Facts
Report “on” - Dimensions
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Design Patterns
There are a number of design patterns available to fit a variety of use cases
(again – there is no “one size fits all” )
Inmon vs. Kimball
The battle still rages...
Data Vault
Hubs, Links and Satellites
Flatten Everything
Popular with Data Science
Columnar
Columns vs. Rows
And More…
Choices abound…
Graph
Good for discovering connections
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
In a Typical Organization,
there are many Use Cases for Data Models
21
Web
Application
Operational
System
NoSQL Key Value Pair
for web session info
Relational Database
for Operational Data.
The following is just a subset of options that exist….
Operational Usage Transfer /
Exchange
JSON
XML
… Etc.
Storage for Analytics /
Reporting
Relational for Consistency
& Standards
Reporting for Analytic
“Slicing & Dicing”
Data Vault for Flexible
Storage
Consumption for Analytics
& Reporting
Cubes
Cubes for Business
Intelligence Reporting
Flattened Tables
Flattened tables for
Analytics & Data Science
Master Data & Hierarchies
for Data Quality &
Consistency
Graph Database
Graph Database for
Connections & Patterns
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Summary
• Analytics and Reporting are key priorities for
today’s data-driven business.
• A strong data architecture is needed to support
successful analytics
• There are many choices in the marketplace, and
at the same time, core fundamentals still apply.
• Choose your architecture wisely, and have fun
and success with the numerous options available
in today’s market.
22
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
DATAVERSITY Data Architecture Strategies
• January Emerging Trends in Data Architecture – What’s the Next Big Thing?
• February Building a Data Strategy - Practical Steps for Aligning with Business Goals
• March Master Data Management – Aligning Data, Process, and Governance
• April Data Governance & Data Architecture: Alignment & Synergies
• May Improving Data Literacy Around Data Architecture
• June Business Intelligence & Data Analytics: An Architected Approach
• July Best Practices in Metadata Management
• August Data Quality Best Practices
• September Business-centric Data Modeling
• October Graph Databases: Benefits & Risks
• December Enterprise Architecture vs. Data Architecture
23
This Year’s Lineup
Global Data Strategy, Ltd. 2022
Who We Are: Business-Focused Data Strategy
Maximize the Organizational Value of Your Data Investment
In today’s business environment, showing rapid time to value for
any technical investment is critical.
But technology and data can be complex. At Global Data Strategy,
we help demystify technical complexity to help you:
• Demonstrate the ROI and business value of data to your
management
• Build a data strategy at your pace to match your unique culture
and organizational style.
• Create an actionable roadmap for “quick wins”, which building
towards a long-term scalable architecture.
Global Data Strategy’s shares experience from some of the largest
international organizations scaled to the pace of your unique team.
www.globaldatastrategy.com
Global Data Strategy has worked with organizations globally in the
following industries:
Finance · Retail · Social Services · Health Care · Education · Manufacturing
· Government · Public Utilities · Construction · Media & Entertainment ·
Insurance …. and more
Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com
Questions?
Thoughts? Ideas?
25

More Related Content

What's hot (20)

PDF
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
PDF
Improving Data Literacy Around Data Architecture
DATAVERSITY
 
PDF
Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
PDF
Data Governance
Boris Otto
 
PDF
Data Quality Best Practices
DATAVERSITY
 
PDF
Building a Data Strategy Your C-Suite Will Support
Reid Colson
 
PDF
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
PPT
Data Architecture for Data Governance
DATAVERSITY
 
PDF
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
Christopher Bradley
 
PDF
Data Governance Takes a Village (So Why is Everyone Hiding?)
DATAVERSITY
 
PPTX
Data Governance Best Practices
Boris Otto
 
PDF
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
PDF
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
DATAVERSITY
 
PDF
DAS Slides: Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
PDF
Graph Databases – Benefits and Risks
DATAVERSITY
 
PDF
Data Governance Best Practices
DATAVERSITY
 
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing
DATAVERSITY
 
PDF
Data Governance and Metadata Management
DATAVERSITY
 
PPTX
How to Build & Sustain a Data Governance Operating Model
DATUM LLC
 
PDF
Data Architecture Best Practices for Advanced Analytics
DATAVERSITY
 
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
Improving Data Literacy Around Data Architecture
DATAVERSITY
 
Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Data Governance
Boris Otto
 
Data Quality Best Practices
DATAVERSITY
 
Building a Data Strategy Your C-Suite Will Support
Reid Colson
 
Data at the Speed of Business with Data Mastering and Governance
DATAVERSITY
 
Data Architecture for Data Governance
DATAVERSITY
 
DMBOK 2.0 and other frameworks including TOGAF & COBIT - keynote from DAMA Au...
Christopher Bradley
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
DATAVERSITY
 
Data Governance Best Practices
Boris Otto
 
Five Things to Consider About Data Mesh and Data Governance
DATAVERSITY
 
How to Use a Semantic Layer to Deliver Actionable Insights at Scale
DATAVERSITY
 
DAS Slides: Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Graph Databases – Benefits and Risks
DATAVERSITY
 
Data Governance Best Practices
DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing
DATAVERSITY
 
Data Governance and Metadata Management
DATAVERSITY
 
How to Build & Sustain a Data Governance Operating Model
DATUM LLC
 
Data Architecture Best Practices for Advanced Analytics
DATAVERSITY
 

Similar to Business Intelligence & Data Analytics– An Architected Approach (20)

PDF
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
DATAVERSITY
 
PDF
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DATAVERSITY
 
PDF
Master Data Management – Aligning Data, Process, and Governance
DATAVERSITY
 
PDF
DAS Slides: Master Data Management – Aligning Data, Process, and Governance
DATAVERSITY
 
PDF
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
DATAVERSITY
 
PDF
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
DATAVERSITY
 
PDF
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
PDF
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
DATAVERSITY
 
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
PDF
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
DATAVERSITY
 
PDF
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DATAVERSITY
 
PDF
Data Modeling Techniques
DATAVERSITY
 
PDF
Data Modeling Best Practices - Business & Technical Approaches
DATAVERSITY
 
PDF
DAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DATAVERSITY
 
PDF
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
DATAVERSITY
 
PDF
Data Modeling, Data Governance, & Data Quality
DATAVERSITY
 
PDF
Modern Metadata Strategies
DATAVERSITY
 
PDF
Data Lake Architecture – Modern Strategies & Approaches
DATAVERSITY
 
PDF
Data Modeling for Big Data
DATAVERSITY
 
DAS Slides: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
DAS Slides: Building a Data Strategy — Practical Steps for Aligning with Busi...
DATAVERSITY
 
DAS Slides: Building a Future-State Data Architecture Plan - Where to Begin?
DATAVERSITY
 
Master Data Management – Aligning Data, Process, and Governance
DATAVERSITY
 
DAS Slides: Master Data Management – Aligning Data, Process, and Governance
DATAVERSITY
 
Data Architecture Best Practices for Today’s Rapidly Changing Data Landscape
DATAVERSITY
 
Data Architecture Strategies: Building an Enterprise Data Strategy – Where to...
DATAVERSITY
 
DAS Webinar: Emerging Trends in Data Architecture – What’s the Next Big Thing?
DATAVERSITY
 
Data Architecture Strategies Webinar: Emerging Trends in Data Architecture – ...
DATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
DATAVERSITY
 
DAS Slides: Building a Data Strategy – Practical Steps for Aligning with Busi...
DATAVERSITY
 
DAS Slides: Self-Service Reporting and Data Prep – Benefits & Risks
DATAVERSITY
 
Data Modeling Techniques
DATAVERSITY
 
Data Modeling Best Practices - Business & Technical Approaches
DATAVERSITY
 
DAS Slides: Data Governance and Data Architecture – Alignment and Synergies
DATAVERSITY
 
Lessons in Data Modeling: Why a Data Model is an Important Part of Your Data ...
DATAVERSITY
 
Data Modeling, Data Governance, & Data Quality
DATAVERSITY
 
Modern Metadata Strategies
DATAVERSITY
 
Data Lake Architecture – Modern Strategies & Approaches
DATAVERSITY
 
Data Modeling for Big Data
DATAVERSITY
 
Ad

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
PDF
Exploring Levels of Data Literacy
DATAVERSITY
 
PDF
Make Data Work for You
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
PDF
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
PDF
Data Modeling Fundamentals
DATAVERSITY
 
PDF
Showing ROI for Your Analytic Project
DATAVERSITY
 
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
PDF
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
PDF
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
PDF
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
PDF
2023 Trends in Enterprise Analytics
DATAVERSITY
 
PDF
Data Strategy Best Practices
DATAVERSITY
 
PDF
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
PDF
Data Management Best Practices
DATAVERSITY
 
PDF
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
PDF
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
DATAVERSITY
 
PDF
Empowering the Data Driven Business with Modern Business Intelligence
DATAVERSITY
 
PDF
Data Governance Best Practices, Assessments, and Roadmaps
DATAVERSITY
 
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
DATAVERSITY
 
Exploring Levels of Data Literacy
DATAVERSITY
 
Make Data Work for You
DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
DATAVERSITY
 
Data Catalogs Are the Answer – What Is the Question?
DATAVERSITY
 
Data Modeling Fundamentals
DATAVERSITY
 
Showing ROI for Your Analytic Project
DATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
DATAVERSITY
 
Is Enterprise Data Literacy Possible?
DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
DATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
DATAVERSITY
 
2023 Trends in Enterprise Analytics
DATAVERSITY
 
Data Strategy Best Practices
DATAVERSITY
 
Who Should Own Data Governance – IT or Business?
DATAVERSITY
 
Data Management Best Practices
DATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
DATAVERSITY
 
Empowering the Data Driven Business with Modern Business Intelligence
DATAVERSITY
 
Data Governance Best Practices, Assessments, and Roadmaps
DATAVERSITY
 
Ad

Recently uploaded (20)

PDF
apidays Munich 2025 - Developer Portals, API Catalogs, and Marketplaces, Miri...
apidays
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PDF
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PPTX
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
PPTX
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
PDF
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
PDF
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
PPTX
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPT
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PDF
Top Civil Engineer Canada Services111111
nengineeringfirms
 
apidays Munich 2025 - Developer Portals, API Catalogs, and Marketplaces, Miri...
apidays
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
Real Life Application of Set theory, Relations and Functions
manavparmar205
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Top Civil Engineer Canada Services111111
nengineeringfirms
 

Business Intelligence & Data Analytics– An Architected Approach

  • 1. Copyright Global Data Strategy, Ltd. 2022 Business Intelligence & Data Analytics: An Architected Approach Donna Burbank Global Data Strategy, Ltd. June 23rd, 2022
  • 2. KATANA GRAPH | TM Katana Graph June 2022
  • 3. KATANA GRAPH | TM KATANA GRAPH | TM Confidential 2 High Performance Scale-out Graph Processing & Analytics Founded in March 2020, offices in Austin, Bay Area, NYC, Denver Co-founders: Keshav Pingali and Chris Rossbach Investors: Intel Capital, Dell Venture Capital, Redline Ventures, Walden International Katana team: Leaders in graph algorithms, programming languages, runtimes, virtualization and storage. Commercial engagements with several Fortune 100 companies Website: www.katanagraph.com Company Overview
  • 4. KATANA GRAPH | TM Leadership Team Confidential 3 Gurbinder Gill PhD UT Austin VMWare, Facebook, MSR , IBM Research Roshan Dathathri PhD UT Austin NI, MSR, HP Labs Emmett Witchel Prof UT Austin InCert, Veritas, Symantec Bo Wu Prof Colorado School of Mines Graph mining expert Donald Nguyen PhD UT Austin Google, Synthace, Determined AI Tyler Hunt PhD UT Austin MSR, Visa Research, Bell Labs Jon Currey University of Cambridge Distributed Systems, Machine Learning MSR, Apple (iTune), Oracle Yige Hu PhD UT Austin File System, Fault Tolerance Amy Chang Board Advisor BOD P&G, Cisco, Disney UCSF Hospital Exec Committee Deans Advisory Council Stanford University Ying Ding Data Science Advisor Professor UT Austin Medical/ Pharma Knowledge Graph, Machine Learning Co-founder Data2Discovery Keshav Pingali CEO, Co-founder Prof UT Austin Fellow ACM, IEEE, AAAS Chris Rossbach CTO, Co-founder Prof UT Austin MSR, Vmware, Canesta Farshid Sabet CBO, Co-founder Intel, Modvidius, Aptina, SanDisk
  • 5. KATANA GRAPH | TM KATANA GRAPH | Graph Technology Application Areas 04 Platforms Finance Healthcare Retail Energy Industrial Telecom Genomics Anti Money Laundering Drug Discovery Identity Graph Precision Medicine Electronic Circuit Design Tools Knowledge Graph Predictive Monitoring Intrusion detection Supply Chain Optimization Fraud Detection Real Time Analytics Customer 360 Recommendation Social Networks
  • 6. KATANA GRAPH | TM KATANA GRAPH | TM Why Katana Graph Confidential 5 Architected to handle massive graphs • Tested with largest publicly available web-crawl: WDC12 (3.5B vertices, 128B edges) Unmatched performance • 10x - 100x times faster vs competing solutions Massive scalability • Proven on Open Cloud HPC Clusters (AWS , Azure, Google Cloud) • Scales up to 256 machines on Stampede Xeon (Skylake) Cluster Native AI/ML with Graphs • Health and Life Sciences (HLS), Financial, Identity Management, Intrusion detection, EDA (Electronic Design Automation), HPC (High Performance Computing) application: 3D mesh generation
  • 7. KATANA GRAPH | TM Graph Compute Domains Confidential 06 Graph Database (Query) Graph AI & Machine Learning Graph Analytics & Mining Probability
  • 8. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Donna Burbank 2 Donna is a recognised industry expert in information management with over 20 years of experience in data strategy, information management, data modeling, metadata management, and enterprise architecture. Her background is multi-faceted across consulting, product development, product management, brand strategy, marketing, and business leadership. She is currently the Managing Director at Global Data Strategy, Ltd., an international information management consulting company that specializes in the alignment of business drivers with data-centric technology. In past roles, she has served in key brand strategy and product management roles at CA Technologies and Embarcadero Technologies for several of the leading data management products in the market. As an active contributor to the data management community, she is a long time DAMA International member, Past President and Advisor to the DAMA Rocky Mountain chapter, and was awarded the Excellence in Data Management Award from DAMA International. She has worked with dozens of Fortune 500 companies worldwide in the Americas, Europe, Asia, and Africa and speaks regularly at industry conferences. She has co-authored several books and is a regular contributor to industry publications. She can be reached at [email protected] Donna is based in Boulder, Colorado, USA. Follow on Twitter @donnaburbank @GlobalDataStrat Twitter Event hashtag: #DAStrategies
  • 9. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com DATAVERSITY Data Architecture Strategies • January Emerging Trends in Data Architecture – What’s the Next Big Thing? • February Building a Data Strategy - Practical Steps for Aligning with Business Goals • March Master Data Management – Aligning Data, Process, and Governance • April Data Governance & Data Architecture: Alignment & Synergies • May Improving Data Literacy Around Data Architecture • June Business Intelligence & Data Analytics: An Architected Approach • July Best Practices in Metadata Management • August Data Quality Best Practices • September Business-centric Data Modeling • October Graph Databases: Benefits & Risks • December Enterprise Architecture vs. Data Architecture 3 This Year’s Lineup
  • 10. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com What We’ll Cover Today • Business intelligence (BI) and data analytics are increasing in popularity as more organizations are looking to become more data-driven. • Many tools have powerful visualization techniques that can create dynamic displays of critical information. • To ensure that the data displayed on these visualizations is accurate and timely, a strong Data Architecture is needed. • This webinar will discuss how to create a robust Data Architecture for BI and data analytics that takes both business and technology needs into consideration. 4
  • 11. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Data-Driven Business 70% of organizations feel that their organization sees data as a strategic asset*. 70% of indicated that reporting and analytics were key drivers for data management.** >50% identified improved collaboration through using a defined data architecture. ** 5 * based on research from a 2019 DATAVERSITY survey on “Trends in Data Management” by Donna Burbank and Michelle Knight ** based on research from a 2021 DATAVERSITY survey on “Trends in Data Management” by Donna Burbank and Michelle Knight
  • 12. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Main Business Goals & Drivers for Data Management 6 0.00% 10.00% 20.00% 30.00% 40.00% 50.00% 60.00% 70.00% 80.00% Gaining Competitive Advantage Improving Outcomes (e.g. health, education, etc.) Improving Product Quality Increasing Revenue and Growth Improving Customer Satisfaction Complying with Regulations Saving Cost and Increasing Efficiency Reducing Risk Supporting Digital Transformation Gaining Insights through Reporting and Analytics Main Business Goals & Drivers for Data Management (select all that apply)
  • 13. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Supporting Reporting & Analytics 7 ACME Inc. Sales Dashboard ❑ Product: Widget 1 ❑ Region: NA 201 8 2019 2020 2021 2022 Successful reporting & analytics includes: • Data-driven culture • Do we use dashboards in our sales meetings? • Or go by “gut feel”? • How can we integrate analytics into our sales cycle (e.g. predictive next best offer) • Data Governance • How do we define “Total Revenue”? • What countries are included in South America? • Data Quality • Are these revenue numbers accurate? • What’s the source of the product data? • Data Architecture • How are we storing the data to accurately & efficiently to slice and dice for these reports? Super Widget Pack Widget 1 Widget 2 What about the data?
  • 14. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com What is the Correct Architecture to Power Reporting & Analytics? … There is a Cacophony of Options … 8 Data Warehouse Data Lake Data Lake House Data Marketplace Metadata Catalog Relational, Nonrelational, Star Schema, SQL, NoSQL, Graph, Document Store, Real- time Streaming, Time series….
  • 15. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com What are Current Organizational Priorities 9 * based on research from a 2021 DATAVERSITY survey on “Trends in Data Management” by Donna Burbank and Michelle Knight
  • 16. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Using a Data Lake in Conjunction with a Data Warehouse 10
  • 17. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Integrating Multiple Paradigms • The Data Lake has a different architecture & purpose than traditional data sources such as data warehouses. • But the two environments can co-exist to share relevant information. 11 Data Analysis & Discovery – Data Lake Enterprise Systems of Record Data Governance & Collaboration Master & Reference Data Data Warehouse Data Marts Operational Data Security & Privacy Sandbox Lightly Modeled Data Data Exploration Reporting & Analytics Advanced Analytics Self-Service BI Standard BI Reports
  • 18. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Level 1 “Top-Down” alignment with business priorities Level 5 “Bottom-Up” management & inventory of data sources Level 2 Managing the people, process, policies & culture around data Level 4 Coordinating & integrating disparate data sources Level 3 Leveraging data for strategic advantage A Holistic Approach is Needed
  • 19. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com 13 The Design Aspect of Data Architecture for BI & Analytics
  • 20. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com A little data modeling up-front … prevents headaches down the road From Data Modeling for the Business by Hoberman, Burbank, Bradley, Technics Publications, 2009 • It’s often tempting to skip data modeling documentation because it’s “faster” • But…long-term, it’s ultimately longer as errors and inconsistencies need to be fixed as a result. “If you don’t have time to do it right, do you have time to do it again?”
  • 21. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Levels of Data Models 15 Conceptual Logical Physical Purpose Communication & Definition of Business Concepts & Rules Clarification & Detail of Business Rules & Data Structures Technical Implementation on a Physical Database Audience Business Stakeholders Data Architects Data Architects Business Analysts DBAs Developers Business Concepts Data Entities Physical Tables Business Stakeholders Data Architects Enterprise Subject Areas Organization & Scoping of main business domain areas
  • 22. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Different Physical Models for Different Use Cases 16 Relational – Normal Form • Reduce redundancy for operational data • Increase data quality • Ensure consistency (ACID transactions) Dimensional– Star Schema • Ease of reporting for summarized and historical data • Ability to easily “slice and dice” for self-service reporting • Performance and flexibility NoSQL No modeling technique is inherently “better” than another. Data use cases & purpose drives what “good” looks like. …Rant over… • Speed of retrieval, low latency • High data volumes • Flexibility for change …And More! • There are numerous ways to model and store data. • Hierarchical/XML • Graph • COBOL Copybook! • S3 “buckets” • Data Vault • Etc…
  • 23. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Is the Star Schema Dead? 17
  • 24. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com The Star Schema Dimension Dimension Dimension Dimension Dimension Fact (Measure) Facts/Measures: Contain the actual values to be reported on. What are we measuring? e.g. Activities (sales transaction, patient visit, etc.) • Few attributes (just numbers with links to the dimensions) • Many values (e.g. all sales transactions) Dimensions: Contain the details that describe the central fact. i.e. The things we want to report by. e.g. Date, Region, Quarter • Many attributes (Individual name, DOB, gender, etc.) • Few values Note: Your Master Data domains often feed these dimensions. Sales By Month By Customer By Region By Sales Rep By Product The Star Schema is still a user-friendly and performant way to “slice and dice” data for reporting.
  • 25. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com The Bus Matrix A Bus Matrix is a simply way to keep track of what you want to report “on” (Facts) and what you want to report “by” (Dimensions) Location Sales Rep Product Customer Total Sales Revenue X X X X Wholesale Revenue X X Number of Returned Items Etc. Report “by” - Facts Report “on” - Dimensions
  • 26. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Design Patterns There are a number of design patterns available to fit a variety of use cases (again – there is no “one size fits all” ) Inmon vs. Kimball The battle still rages... Data Vault Hubs, Links and Satellites Flatten Everything Popular with Data Science Columnar Columns vs. Rows And More… Choices abound… Graph Good for discovering connections
  • 27. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com In a Typical Organization, there are many Use Cases for Data Models 21 Web Application Operational System NoSQL Key Value Pair for web session info Relational Database for Operational Data. The following is just a subset of options that exist…. Operational Usage Transfer / Exchange JSON XML … Etc. Storage for Analytics / Reporting Relational for Consistency & Standards Reporting for Analytic “Slicing & Dicing” Data Vault for Flexible Storage Consumption for Analytics & Reporting Cubes Cubes for Business Intelligence Reporting Flattened Tables Flattened tables for Analytics & Data Science Master Data & Hierarchies for Data Quality & Consistency Graph Database Graph Database for Connections & Patterns
  • 28. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Summary • Analytics and Reporting are key priorities for today’s data-driven business. • A strong data architecture is needed to support successful analytics • There are many choices in the marketplace, and at the same time, core fundamentals still apply. • Choose your architecture wisely, and have fun and success with the numerous options available in today’s market. 22
  • 29. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com DATAVERSITY Data Architecture Strategies • January Emerging Trends in Data Architecture – What’s the Next Big Thing? • February Building a Data Strategy - Practical Steps for Aligning with Business Goals • March Master Data Management – Aligning Data, Process, and Governance • April Data Governance & Data Architecture: Alignment & Synergies • May Improving Data Literacy Around Data Architecture • June Business Intelligence & Data Analytics: An Architected Approach • July Best Practices in Metadata Management • August Data Quality Best Practices • September Business-centric Data Modeling • October Graph Databases: Benefits & Risks • December Enterprise Architecture vs. Data Architecture 23 This Year’s Lineup
  • 30. Global Data Strategy, Ltd. 2022 Who We Are: Business-Focused Data Strategy Maximize the Organizational Value of Your Data Investment In today’s business environment, showing rapid time to value for any technical investment is critical. But technology and data can be complex. At Global Data Strategy, we help demystify technical complexity to help you: • Demonstrate the ROI and business value of data to your management • Build a data strategy at your pace to match your unique culture and organizational style. • Create an actionable roadmap for “quick wins”, which building towards a long-term scalable architecture. Global Data Strategy’s shares experience from some of the largest international organizations scaled to the pace of your unique team. www.globaldatastrategy.com Global Data Strategy has worked with organizations globally in the following industries: Finance · Retail · Social Services · Health Care · Education · Manufacturing · Government · Public Utilities · Construction · Media & Entertainment · Insurance …. and more
  • 31. Global Data Strategy, Ltd. 2022 www.globaldatastrategy.com Questions? Thoughts? Ideas? 25