Realizing the Promise of
Data Lakes
webinars
Data Ninja Webinar Series
Sessions covering data virtualization solutions for driving business value
2
Data Ninja Webinars
Five webinars over the next few months…
Speakers
Head of Product Marketing
Lakshmi Randall
Agenda
1.Big Data Challenges
2.Limitations of Physical Data Lakes
3.The Governed Logical Data Lake
4.Benefits
5.Customer Case Studies
6.Q&A
5
Big Data Challenges
6
SOMEBODY BOUGHT SOMETHING
“BACK IN THE DAY”
• WE HAVE TO DEAL WITH
LEGACY
• HOMOGENEITY ISN’T
REALISTIC
• ALL DATA SOURCES ARE
NO LONGER EQUAL
© 2016 Autodesk | Enterprise Information Services 7
The Data Lake alone is not a panacea
Easy to put data
in
Wait! What about
my old data
warehouse?
Harder to access
and secure the
data
8
Duplicate sets of data in source system and data lake
 Creates data quality challenges because data must be updated in two
places.
 Expensive since the data needs to be stored and maintained twice
Governance of such large amounts of data can be challenging
 Restrictions on data access must be maintained as data is brought
into the data lake as well as on new data created within the data lake.
Data lakes themselves can become silos
 Often built for a specific department, data from the data lake must be
integrated with other enterprise data to create a complete picture
Limitations of Physical Data Lakes
8
9
Overcoming the Limitations of Physical Data Lakes
Integrate your Data Lake with other Enterprise Data Architectures
Provides a way to
access data from
separate systems
through an
abstraction layer
that makes it
appear as if the
data were in a
single data lake
Improves the
enterprise func-
tionality of data
lakes by
combining one or
more physical
data lakes with
other enterprise
data
Improves an
organization’s
ability to govern
and extract more
value from its
data lakes by
extending them
as logical data
lakes
9
Implement a Single Logical Data Lake Using Data Virtualization
Marketing
Data Lakes
Research
Logical Data Lake/Big Data Fabric
Healthcare
Self-Service
Analytics
Operational
Apps
A Single Governed Logical Data Lake
Data Virtualization combines one or more physical data lakes with other enterprise data to create a
“virtual” or “logical” data lake.
Other Data Sources
MDM Cloud Apps
BI/Analytical
Tools
Excel
Reports
DATA VIRTUALIZATION
Semantic
Model
Data
Discovery
Metadata
Catalog
Data
Governance
Denodo Platform Bridges Distinct Data Architectures
10
 Discover
 Prepare
 Curate
 Orchestrate
 Integrate
 Publish
“Rely on Data Integration
Infrastructure to make the
Data Lake Work.”
Philip Russom, Analyst, TDWI
11
Data Governance
 Data Lineage
 Structure and organization to your data lake
 Data Masking
 Data Quality
 Functions for validating, cleansing, enriching
and standardizing data.
 SDK to integrate with external DQ tools and Big
Data systems
Enterprise Access Point
 Enterprise-level access controls – table, row, column
 Authentication/Authroization
 Roles
 Audit all access
 Encryption/Decryption
 Universal Semantic Model
The Governed Data Lake
12
Data Lineage
Understand the “source of truth” and transformations of every piece of data in the
model
Selective Data Masking (Full and Partial)
13
Role-based Granular Security
14
Performance & Scalability
16
Mature Data Virtualization
Operational
Data
EDW
SQL
Integrated
Security
Other
Sources
Cache
In-memory
Fabric
Big
Data
SaaS
REST
OData
Catalog &
Data Exploration
Monitoring
Auditing
17
Performance
Denodo’s unique query optimizer
Denodo’s optimizer borrows many techniques from traditional RDBMs
 Query plans based on statistics and indexes
 Multiple JOIN methods
 Query rewriting to generate more optimal SQL
However, given the distributed execution of a query in a processing
fabric, Denodo has designed unique techniques to maximize
performance in this environment
 Dynamic rewriting focused on maximizing execution at source and reduction of
network traffic
 Cost estimates also factor-in:
 Processing power of the sources (e.g. number of nodes in a Hadoop cluster)
 Network and transfer rates
18
1. A logical data lake prevents the data lake from becoming
a silo and provides access to all the information an
organization needs to power its analytics.
2. Data virtualization improves agility in big data activities.
Users can quickly combine sources of information
without spending time installing and configuring new
databases or clusters to store the consolidated
information.
3. This ease of use encourages exploration of data since the
cost or effort to access the information is lower.
4. Data virtualization eliminates the cost of storing
information twice and the need to update information in
multiple places since information is not duplicated.
Managing Data Lakes
18
Data virtualization is a practical strategy for managing data lakes
19
Why do Enterprises need Denodo’s Big Data Fabric
to succeed?
New actionable insights with minimal effort
Information Self-service for business users Secures big data end-to-end
Real-time integrated data
across the business
Ability to aggregate, transform, cleanse,
and integrate data from multiple big data
sources, which can then be presented in
dashboards, reporting tools, and web
applications.
Allows any application, process,
dashboard, tool, or user to access
any integrated data, regardless of
where the data is physically or
logically located and regardless of
the data format.
Offers consistent, timely, and
trusted data for internal and
external users.
Enables centralized data access
and control, and supports data-at-
rest and data-in-motion security
measures.
Remediates security risks with
masking, auditing, and encryption
across the fabric.
Provides self-service data discovery
and search capabilities.
Virtual Sandboxing for Citizen Users
Benefits
21
-Source: “Predicts 2017: Data Distribution and Complexity Drive Information Infrastructure
Modernization”
By 2018, organizations with data virtualization capabilities will
spend 40% less on building and managing data integration
processes for connecting distributed data assets.”
Data Virtualization bridges the skills gap
 Developers and users don’t need
to learn special languages.
 They can leverage Denodo
graphical user interface to
model, unify and deliver the data
to multiple consumers.
Data Virtualization combines
new and legacy data sources
24
How the data goes in… How it gets back out…
Denodo’s Big data
fabric provides
easy access to big
data without
having to decipher
various data
formats.
Data Virtualization provides ease of use
25
-Source: “Forrester Wave™: Big Data Fabric Q4 2016”
Denodo’s key strength is delivering a unified and centralized
data services fabric with security and real-time integration
across multiple traditional and big data sources, including
Hadoop, NoSQL, cloud, and software-as-a-service (SaaS).”
26
-Source: “Forrester Wave™: Big Data Fabric Q4 2016”
Today, several enterprises are leveraging Denodo to support big data fabric
deployments — such as virtual big data marts, big data analytics, realtime
analytics, and iot data processing — in various vertical industries. Customers
like its easy-to-use, simple yet sophisticated data modeling capabilities,
search, and support for various big data sources.
Customer Case Studies
© 2016 Autodesk | Enterprise Information Services 28
Why Build the Logical Data Lake Data virtualization can be used
throughout your data pipeline!
29
Leading Construction Manufacturer - Telematics &
Predictive Maintenance
Dealer
Maintenance
Parts Inventory
OSI PI Hadoop Cluster
Tableau: Dealer / Customer Dashboard
30
Enrich Machine Data and Combine with Other Data
Ingest, Integrate & Deliver
Persisted
(In-memory, Hadoop)
Streams
(specific time window)
Message Queue
Machine-generated/Event data Alerts
Workflows
Operational
Processes
Analytical
Processes
Consumers
Visualization
Data Virtualization
Enrich and Combine IoT
Data with Other Data
Historians
Streams
ERP/SCM
DW
Analytical
DB
MDM
Apps
Data Marts
Hadoop NoSQL
31
Business Benefits
 Improved asset performance and proactive maintenance.
 Reduced warranty costs due to proactive maintenance of parts
preventing parts failure.
 Optimized pricing for services and parts among global service
providers.
 New Business Model opportunities based on real-time analysis
of detailed sensor data.
Q&A
Next Steps
Forrester Wave™: Big Data Fabric Q4 2016
http://www.denodo.com/en/page/forrester-wave-big-data-fabric-
q4-2016
Get Started!
Download Denodo Express: www.denodoexpress.com
Access Denodo Platform on AWS: www.denodo.com/en/denodo-
platform/denodo-platform-for-aws
Data Ninja Webinar Series
Sessions covering data virtualization solutions for driving business value
Next Series: Packed Lunch Series
January - 2017
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical,
including photocopying and microfilm, without prior the written authorization from Denodo Technologies.

More Related Content

PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PDF
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
PPT
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r2)
PPTX
SQL Server Disaster Recovery Implementation
PPT
DW 101
PPTX
From Hadoop to Enterprise Data Warehouse
PPTX
Technical Demonstration - Denodo Platform 7.0
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Fast and Furious: From POC to an Enterprise Big Data Stack in 2014
Data Lakehouse, Data Mesh, and Data Fabric (r2)
SQL Server Disaster Recovery Implementation
DW 101
From Hadoop to Enterprise Data Warehouse
Technical Demonstration - Denodo Platform 7.0

What's hot (20)

PDF
Performance Acceleration: Summaries, Recommendation, MPP and more
PDF
Company report xinglian
PDF
Enabling Cloud Data Integration (EMEA)
PPTX
Applying Big Data Superpowers to Healthcare
PDF
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
PPT
Data Federation
PDF
Accelerate Self-Service Analytics with Data Virtualization and Visualization
PDF
How to select a modern data warehouse and get the most out of it?
PPTX
2022 02 Integration Bootcamp
PDF
Cloud Storage Spring Cleaning: A Treasure Hunt
PDF
Enabling a Data Mesh Architecture with Data Virtualization
PDF
Building a Logical Data Fabric using Data Virtualization (ASEAN)
PDF
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
PDF
Data Virtualization: From Zero to Hero
PDF
Are You Killing the Benefits of Your Data Lake?
PDF
(BI Advanced) Hiram Fleitas - SQL Server Machine Learning Predict Sentiment O...
PDF
GigaOm-sector-roadmap-cloud-analytic-databases-2017
PDF
GDPR Noncompliance: Avoid the Risk with Data Virtualization
PDF
The technology of the business data lake
PDF
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Performance Acceleration: Summaries, Recommendation, MPP and more
Company report xinglian
Enabling Cloud Data Integration (EMEA)
Applying Big Data Superpowers to Healthcare
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Data Federation
Accelerate Self-Service Analytics with Data Virtualization and Visualization
How to select a modern data warehouse and get the most out of it?
2022 02 Integration Bootcamp
Cloud Storage Spring Cleaning: A Treasure Hunt
Enabling a Data Mesh Architecture with Data Virtualization
Building a Logical Data Fabric using Data Virtualization (ASEAN)
DataStax GeekNet Webinar - Apache Cassandra: Enterprise NoSQL
Data Virtualization: From Zero to Hero
Are You Killing the Benefits of Your Data Lake?
(BI Advanced) Hiram Fleitas - SQL Server Machine Learning Predict Sentiment O...
GigaOm-sector-roadmap-cloud-analytic-databases-2017
GDPR Noncompliance: Avoid the Risk with Data Virtualization
The technology of the business data lake
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Ad

Viewers also liked (9)

PDF
Data-driven teacher effectiveness: Where to begin?
PPT
Data Driven Instruction
PDF
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
PDF
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
PPTX
Practical guide to architecting data lakes - Avinash Ramineni - Phoenix Data...
PDF
Data Driven Design Research Personas
PDF
Data Lake: A simple introduction
PPTX
Christopher Penn - Build a Data-Driven Customer Journey
PDF
Utilities Digital Data Driven Innovation
Data-driven teacher effectiveness: Where to begin?
Data Driven Instruction
Dr. Christian Kurze from Denodo, "Data Virtualization: Fulfilling the Promise...
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Practical guide to architecting data lakes - Avinash Ramineni - Phoenix Data...
Data Driven Design Research Personas
Data Lake: A simple introduction
Christopher Penn - Build a Data-Driven Customer Journey
Utilities Digital Data Driven Innovation
Ad

Similar to Data Ninja Webinar Series: Realizing the Promise of Data Lakes (20)

PDF
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
PDF
Big Data Fabric: A Recipe for Big Data Initiatives
PDF
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
PDF
Die Big Data Fabric als Enabler für Machine Learning & AI
PDF
Data Virtualization: An Introduction
PDF
An Introduction to Data Virtualization in 2018
PDF
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
PDF
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
PDF
Data Virtualization: An Introduction
PDF
Data Virtualization: An Introduction
PDF
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
PDF
Data Virtualization. An Introduction (ASEAN)
PDF
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
PDF
Data virtualization an introduction
PDF
Why Data Virtualization? An Introduction
PDF
How Data Virtualization Puts Machine Learning into Production (APAC)
PDF
Modern Data Management for Federal Modernization
PDF
The Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
PDF
The Role of Logical Data Fabric in a Unified Platform for Modern Analytics (A...
Data Virtualization enabled Data Fabric: Operationalize the Data Lake (APAC)
Big Data Fabric: A Recipe for Big Data Initiatives
Big Data Fabric: A Necessity For Any Successful Big Data Initiative
Die Big Data Fabric als Enabler für Machine Learning & AI
Data Virtualization: An Introduction
An Introduction to Data Virtualization in 2018
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
How Data Virtualization Puts Enterprise Machine Learning Programs into Produc...
Data Virtualization: An Introduction
Data Virtualization: An Introduction
From Single Purpose to Multi Purpose Data Lakes - Broadening End Users
Data Virtualization. An Introduction (ASEAN)
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Data virtualization an introduction
Why Data Virtualization? An Introduction
How Data Virtualization Puts Machine Learning into Production (APAC)
Modern Data Management for Federal Modernization
The Role of the Logical Data Fabric in a Unified Platform for Modern Analytics
The Role of Logical Data Fabric in a Unified Platform for Modern Analytics (A...

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
PDF
What you need to know about Generative AI and Data Management?
PDF
Mastering Data Compliance in a Dynamic Business Landscape
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
PDF
Drive Data Privacy Regulatory Compliance
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
PDF
How to Build Your Data Marketplace with Data Virtualization?
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
PDF
Enabling Data Catalog users with advanced usability
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Enterprise Monitoring and Auditing in Denodo
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Achieving Self-Service Analytics with a Governed Data Services Layer
What you need to know about Generative AI and Data Management?
Mastering Data Compliance in a Dynamic Business Landscape
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Drive Data Privacy Regulatory Compliance
Знакомство с виртуализацией данных для профессионалов в области данных
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Lunch and Learn ANZ: Key Takeaways for 2023!
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
How to Build Your Data Marketplace with Data Virtualization?
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Enabling Data Catalog users with advanced usability
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
GenAI y el futuro de la gestión de datos: mitos y realidades

Recently uploaded (20)

PDF
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PPTX
PPT for Diseases (1)-2, types of diseases.pptx
PPTX
1 hour to get there before the game is done so you don’t need a car seat for ...
PPTX
MBA JAPAN: 2025 the University of Waseda
PPTX
Introduction to Fundamentals of Data Security
PPTX
machinelearningoverview-250809184828-927201d2.pptx
PPT
Classification methods in data analytics.ppt
PPTX
PPT for Diseases.pptx, there are 3 types of diseases
PPTX
recommendation Project PPT with details attached
PDF
REPORT CARD OF GRADE 2 2025-2026 MATATAG
PPTX
Chapter security of computer_8_v8.1.pptx
PDF
©️ 01_Algorithm for Microsoft New Product Launch - handling web site - by Ale...
PPTX
DATA MODELING, data model concepts, types of data concepts
PPTX
AI AND ML PROPOSAL PRESENTATION MUST.pptx
PDF
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
PPTX
inbound2857676998455010149.pptxmmmmmmmmm
PDF
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
PDF
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
PDF
technical specifications solar ear 2025.
PDF
The Role of Pathology AI in Translational Cancer Research and Education
©️ 02_SKU Automatic SW Robotics for Microsoft PC.pdf
PPT for Diseases (1)-2, types of diseases.pptx
1 hour to get there before the game is done so you don’t need a car seat for ...
MBA JAPAN: 2025 the University of Waseda
Introduction to Fundamentals of Data Security
machinelearningoverview-250809184828-927201d2.pptx
Classification methods in data analytics.ppt
PPT for Diseases.pptx, there are 3 types of diseases
recommendation Project PPT with details attached
REPORT CARD OF GRADE 2 2025-2026 MATATAG
Chapter security of computer_8_v8.1.pptx
©️ 01_Algorithm for Microsoft New Product Launch - handling web site - by Ale...
DATA MODELING, data model concepts, types of data concepts
AI AND ML PROPOSAL PRESENTATION MUST.pptx
book-34714 (2).pdfhjkkljgfdssawtjiiiiiujj
inbound2857676998455010149.pptxmmmmmmmmm
Concepts of Database Management, 10th Edition by Lisa Friedrichsen Test Bank.pdf
2025-08 San Francisco FinOps Meetup: Tiering, Intelligently.
technical specifications solar ear 2025.
The Role of Pathology AI in Translational Cancer Research and Education

Data Ninja Webinar Series: Realizing the Promise of Data Lakes

  • 1. Realizing the Promise of Data Lakes webinars Data Ninja Webinar Series Sessions covering data virtualization solutions for driving business value
  • 2. 2 Data Ninja Webinars Five webinars over the next few months…
  • 3. Speakers Head of Product Marketing Lakshmi Randall
  • 4. Agenda 1.Big Data Challenges 2.Limitations of Physical Data Lakes 3.The Governed Logical Data Lake 4.Benefits 5.Customer Case Studies 6.Q&A
  • 6. 6 SOMEBODY BOUGHT SOMETHING “BACK IN THE DAY” • WE HAVE TO DEAL WITH LEGACY • HOMOGENEITY ISN’T REALISTIC • ALL DATA SOURCES ARE NO LONGER EQUAL
  • 7. © 2016 Autodesk | Enterprise Information Services 7 The Data Lake alone is not a panacea Easy to put data in Wait! What about my old data warehouse? Harder to access and secure the data
  • 8. 8 Duplicate sets of data in source system and data lake  Creates data quality challenges because data must be updated in two places.  Expensive since the data needs to be stored and maintained twice Governance of such large amounts of data can be challenging  Restrictions on data access must be maintained as data is brought into the data lake as well as on new data created within the data lake. Data lakes themselves can become silos  Often built for a specific department, data from the data lake must be integrated with other enterprise data to create a complete picture Limitations of Physical Data Lakes 8
  • 9. 9 Overcoming the Limitations of Physical Data Lakes Integrate your Data Lake with other Enterprise Data Architectures Provides a way to access data from separate systems through an abstraction layer that makes it appear as if the data were in a single data lake Improves the enterprise func- tionality of data lakes by combining one or more physical data lakes with other enterprise data Improves an organization’s ability to govern and extract more value from its data lakes by extending them as logical data lakes 9 Implement a Single Logical Data Lake Using Data Virtualization
  • 10. Marketing Data Lakes Research Logical Data Lake/Big Data Fabric Healthcare Self-Service Analytics Operational Apps A Single Governed Logical Data Lake Data Virtualization combines one or more physical data lakes with other enterprise data to create a “virtual” or “logical” data lake. Other Data Sources MDM Cloud Apps BI/Analytical Tools Excel Reports DATA VIRTUALIZATION Semantic Model Data Discovery Metadata Catalog Data Governance Denodo Platform Bridges Distinct Data Architectures 10  Discover  Prepare  Curate  Orchestrate  Integrate  Publish “Rely on Data Integration Infrastructure to make the Data Lake Work.” Philip Russom, Analyst, TDWI
  • 11. 11 Data Governance  Data Lineage  Structure and organization to your data lake  Data Masking  Data Quality  Functions for validating, cleansing, enriching and standardizing data.  SDK to integrate with external DQ tools and Big Data systems Enterprise Access Point  Enterprise-level access controls – table, row, column  Authentication/Authroization  Roles  Audit all access  Encryption/Decryption  Universal Semantic Model The Governed Data Lake
  • 12. 12 Data Lineage Understand the “source of truth” and transformations of every piece of data in the model
  • 13. Selective Data Masking (Full and Partial) 13
  • 17. 17 Performance Denodo’s unique query optimizer Denodo’s optimizer borrows many techniques from traditional RDBMs  Query plans based on statistics and indexes  Multiple JOIN methods  Query rewriting to generate more optimal SQL However, given the distributed execution of a query in a processing fabric, Denodo has designed unique techniques to maximize performance in this environment  Dynamic rewriting focused on maximizing execution at source and reduction of network traffic  Cost estimates also factor-in:  Processing power of the sources (e.g. number of nodes in a Hadoop cluster)  Network and transfer rates
  • 18. 18 1. A logical data lake prevents the data lake from becoming a silo and provides access to all the information an organization needs to power its analytics. 2. Data virtualization improves agility in big data activities. Users can quickly combine sources of information without spending time installing and configuring new databases or clusters to store the consolidated information. 3. This ease of use encourages exploration of data since the cost or effort to access the information is lower. 4. Data virtualization eliminates the cost of storing information twice and the need to update information in multiple places since information is not duplicated. Managing Data Lakes 18 Data virtualization is a practical strategy for managing data lakes
  • 19. 19 Why do Enterprises need Denodo’s Big Data Fabric to succeed? New actionable insights with minimal effort Information Self-service for business users Secures big data end-to-end Real-time integrated data across the business Ability to aggregate, transform, cleanse, and integrate data from multiple big data sources, which can then be presented in dashboards, reporting tools, and web applications. Allows any application, process, dashboard, tool, or user to access any integrated data, regardless of where the data is physically or logically located and regardless of the data format. Offers consistent, timely, and trusted data for internal and external users. Enables centralized data access and control, and supports data-at- rest and data-in-motion security measures. Remediates security risks with masking, auditing, and encryption across the fabric. Provides self-service data discovery and search capabilities. Virtual Sandboxing for Citizen Users
  • 21. 21 -Source: “Predicts 2017: Data Distribution and Complexity Drive Information Infrastructure Modernization” By 2018, organizations with data virtualization capabilities will spend 40% less on building and managing data integration processes for connecting distributed data assets.”
  • 22. Data Virtualization bridges the skills gap
  • 23.  Developers and users don’t need to learn special languages.  They can leverage Denodo graphical user interface to model, unify and deliver the data to multiple consumers. Data Virtualization combines new and legacy data sources
  • 24. 24 How the data goes in… How it gets back out… Denodo’s Big data fabric provides easy access to big data without having to decipher various data formats. Data Virtualization provides ease of use
  • 25. 25 -Source: “Forrester Wave™: Big Data Fabric Q4 2016” Denodo’s key strength is delivering a unified and centralized data services fabric with security and real-time integration across multiple traditional and big data sources, including Hadoop, NoSQL, cloud, and software-as-a-service (SaaS).”
  • 26. 26 -Source: “Forrester Wave™: Big Data Fabric Q4 2016” Today, several enterprises are leveraging Denodo to support big data fabric deployments — such as virtual big data marts, big data analytics, realtime analytics, and iot data processing — in various vertical industries. Customers like its easy-to-use, simple yet sophisticated data modeling capabilities, search, and support for various big data sources.
  • 28. © 2016 Autodesk | Enterprise Information Services 28 Why Build the Logical Data Lake Data virtualization can be used throughout your data pipeline!
  • 29. 29 Leading Construction Manufacturer - Telematics & Predictive Maintenance Dealer Maintenance Parts Inventory OSI PI Hadoop Cluster Tableau: Dealer / Customer Dashboard
  • 30. 30 Enrich Machine Data and Combine with Other Data Ingest, Integrate & Deliver Persisted (In-memory, Hadoop) Streams (specific time window) Message Queue Machine-generated/Event data Alerts Workflows Operational Processes Analytical Processes Consumers Visualization Data Virtualization Enrich and Combine IoT Data with Other Data Historians Streams ERP/SCM DW Analytical DB MDM Apps Data Marts Hadoop NoSQL
  • 31. 31 Business Benefits  Improved asset performance and proactive maintenance.  Reduced warranty costs due to proactive maintenance of parts preventing parts failure.  Optimized pricing for services and parts among global service providers.  New Business Model opportunities based on real-time analysis of detailed sensor data.
  • 32. Q&A
  • 33. Next Steps Forrester Wave™: Big Data Fabric Q4 2016 http://www.denodo.com/en/page/forrester-wave-big-data-fabric- q4-2016 Get Started! Download Denodo Express: www.denodoexpress.com Access Denodo Platform on AWS: www.denodo.com/en/denodo- platform/denodo-platform-for-aws
  • 34. Data Ninja Webinar Series Sessions covering data virtualization solutions for driving business value Next Series: Packed Lunch Series January - 2017
  • 35. Thanks! www.denodo.com [email protected] © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.