SlideShare a Scribd company logo
W E B I N A R S E R I E S
BI Tools and Data
Virtualization are
Interchangeable
W E B I N A R S E R I E S
BI Tools and Data
Virtualization are
Interchangeable
Paul Moxon
SVP Data Architectures & Chief Evangelist
Denodo
17nd June 2020
Paul Moxon
SVP Data Architectures & Chief
Evangelist, Denodo
Speakers
1. Today’s Myth
2. Origins of the Myth
3. Just the Facts Ma’am
4. The Proof is in the Pudding
5. Conclusions
6. Q&A
7. Next Steps
Agenda
5
Myth #2:
BI Tools and Data Virtualization
are Interchangeable
Origins of the Myth
7
Welcome to my Universe
• BusinessObjects added Universe as semantic layer to
BI tool
• Special tools to design business-oriented data
objects
• Hide technical nature of physical data storage
• Initially use Data Federator to access multiple data
sources
• Multi-source Universe capability subsumed Data
Federator tool
• Made BusinessObjects the leading BI Tool vendor
• Increased usability and appeal to ‘citizen analysts’
8
Follow the Leader
• Other vendors followed this approach
• MicroStrategy, Cognos, etc.
• New entrants initially focused on visualization
and analysis of data
• Tableau, Qlik, Power BI
• Quickly added ‘data blending’ capabilities
• Support multiple data source integration
• With limitations 
9
Data Blending Everywhere
• Most reporting tools now offer capabilities to create reports with data coming from
multiple data sources
• Some in real time, with their own federation engines (e.g. Tableau, MicroStrategy,
Business Objects, etc.)
• Some based on replication in the reporting tool engine (Qlik, SiSense, ThoughtSpot,
etc.)
• Some of them also provide data modeling capabilities (Looker, Business Objects,
MicroStrategy, PowerBI, etc.)
So if I can have multi-source queries and define a logical model in my
reporting tool, why would I need Data Virtualization?
Just the Facts, Ma’am
11
Source: “Gartner Market Guide for Data Virtualization, November 16, 2018”
Data virtualization can be used to create virtualized and
integrated views of data in-memory rather than executing data
movement and physically storing integrated views in a target
data structure. It provides a layer of abstraction above the
physical implementation of data, to simplify query logic.
12
What is Data Virtualization?
Consume
in business applications
Combine
related data into views
Connect
to disparate data sources
2
3
1
DATA CONSUMERS
DISPARATE DATA SOURCES
Enterprise Applications, Reporting, BI, Portals, ESB, Mobile, Web, Users
Databases & Warehouses, Cloud/Saas Applications, Big Data, NoSQL, Web, XML, Excel, PDF, Word...
Analytical Operational
Less StructuredMore Structured
CONNECT COMBINE PUBLISH
Multiple Protocols,
Formats
Query, Search,
Browse
Request/Reply,
Event Driven
Secure
Delivery
SQL,
MDX
Web
Services
Big Data
APIs
Web Automation
and Indexing
CONNECT COMBINE CONSUME
Share, Deliver,
Publish, Govern,
Collaborate
Discover, Transform,
Prepare, Improve
Quality, Integrate
Normalized views of
disparate data
“Data virtualization
integrates disparate
data sources in real
time or near-real
time to meet
demands for
analytics and
transactional data.”
– Create a Road Map For A
Real-time, Agile, Self-
Service Data Platform,
Forrester Research, Dec 16,
2015
13
What is Data Virtualization?
1. Single Access Point to all
Data at any location
2. Semantic Layer – Expose
Data in Business-Friendly
form, adapted to the
needs of each consumer
3. Abstract changes in the
underlying infrastructure
4. Single entry point to apply
security and governance
policies
5. Avoid data replication: Up
to 80% reduction in
integration costs, in terms
of resources and
technology data
14
(Almost) Any-to-Many Connectivity
Relational Databases
• MS SQL*Server (JDBC, ODBC): 2000, 2005, 2008,
2008R2, 2012, 2014, 2016
• Oracle (JDBC): 8i, 9i, 10g, 11g, 12c, 18
• Oracle E-Business Suite (JDBC): 12
• IBM DB2 (JDBC): 8, 9, 10, 11, 12 for LUW; 9,10 for z/OS
• Informix (JDBC): 7, 12
• Sybase Adaptive Server Enterprise (JDBC): 12, 15
• MySQL (JDBC): 4, 5
• PostgreSQL (JDBC): 8, 9
• Denodo Platform (JDBC): 5.5, 6.0, 7.0
- For multi-location architecture deployments
• MS Access (ODBC)
• Apache Derby (JDBC): 10
• Generic (JDBC)
In-Memory Databases
• SAP HANA (JDBC): 1
• Oracle TimesTen (JDBC): 11g
• Oracle 12c In-Memory
Parallel databases and appliances
• GreenPlum (JDBC): 4.2
• HP Vertica (JDBC): 7, 8
• Oracle Exadata (JDBC): X5-2
• ParAccel 8.0.2 (using ParAccel 2.5.0.0 JDBC3g/SSL
driver)
• Netezza (JDBC): 4.6, 5.0, 6.0, 7.0
• SybaseIQ (JDBC) 12.x, 15.x
• Teradata (JDBC): 12, 13, 14, 15
Multi-Dimensional Sources
• SAP BW (BAPI/XMLA): 3.x
• SAP BI 7.x (BAPI): 7.x
• Mondrian (XMLA): 3.x
• MS SQL Server Analysis Services 200x
• Essbase (XMLA): 9, 11
Cloud Data Warehouse
• Amazon Redshift (JDBC)
• Amazon Athena (JDBC)
• Amazon Aurora (JDBC)
• Snowflake (JDBC)
• Amazon DynamoDB
• Azure SQL Data Warehouse
• Azure CosmosDB (SQL API and MongoDB API)
Big Data/NoSQL
• Apache Hive (JDBC): 0.12, 1.1.0, 1.1.0 for Cloudera
1.2.1 for Hortonworks 2.0.0
• MapR-XD, MapR-DB, MapR-ES, Hive, and Drill for
MapR 6.1
• Impala (JDBC): 2.3
• Spark SQL (JDBC): 1.5, 1.6
• Google BigQuery (JDBC)
• Presto (JDBC)
Web Automation
• Denodo’s ITPilot automates extraction from web
pages
Indexes and unstructured content
• CMS, file systems, pdf, word, text, email servers,
knowledge bases, indexes
• Elastic Search
Web Services
• SOAP
• REST (XML, RSS, ATOM, JSON)
• OData v2 and v4
Packaged Applications
• SAP ERP/ECC (BAPIs and RFC tables)
• Oracle E-Business Suite 12
• Siebel
• SAS (SAS JDBC Driver): 7 and higher
Semantic Repositories
• Semantic repositories in Triple Stores / RDF
accessed through SPARQL endpoints.
Flat and Binary Files
• CSV, pipe-delimited, Regular expression-parsed
• MS Excel xls 97-2003
• MS Excel xlsx 2007 or later
• MS Access
• XML
• JSON
All files can be locally accessible or in remote
filesystems, through FTP/ SFTP/FTPS, and in clear,
zipped and/or encrypted format.
Active Directory as source or leveraging security
• LDAP v3
• Microsoft Active Directory 2003, 2008
Cloud, SaaS, Web Sources with Simplified OAuth
Security
• Amazon
• Google
• Facebook
• LinkedIn
• MS Azure Data Lake
• MS SharePoint (by using the OData connector)
• MS Dynamics
• ServiceNow
• Marketo
• Salesforce
• Twitter via APIs with simplified Oauth integration
(1.0, 1.0a and 2.0)
• Workday
MS Queues as data source and Delivery
• MQSeries
• SonicMQ
• ActiveMQ
• Tibco EMS
Denodo SDK for Custom Connectors
• CouchDB
• Lotus Domino
• MongoDB and Mongo Atlas DBaaS
Mainframe
• IMS
• IBM IMS native drivers: 8, 9
• IMS Universal Drivers: 11
Hierarchical databases
• Adabas (SOA Gateway and Denodo’s SOAP
connector): 5, 6
Legacy
• Microsoft FoxPro (ODBC)
The following data sources have been successfully
tested with Denodo using JDBC and ODBC drivers,
WS/SOAP and WS/REST, and DenodoConnect
adapters (not exhaustive list):
• Apache Solr
• Kafka Messages
• SAS Files
• Hadoop HBase
• Hadoop HCatalog
• Hadoop HDFS (Avro, CSV, Parquet)
• Files in Amazon S3 (incl. Parquet files)
• IBM BigInsights
• Pivotal HAWQ
15
(Almost) Any-to-Many Connectivity
Many Consumers
Protocols and Formats
• SQL Based access via JDBC, ODBC and ADO.NET
• Web Services
• SOAP (XML/JSON)
• REST (JSON/XML)
• OData
• Open API (a.k.a Swagger)
• Web Parts (for SharePoint), Portlets
• Kafka and JMS listeners for message queues
• Denodo Scheduler for batch process and ‘ETL lite’
Security Options
• Authentication using LDAP or Active Directory
• Kerberos for Single Sign-On (SSO)
• OAuth, OAuth 2.0 (JWT)
• SAML
• SSL/TLS
• WS-Security, X.509 certificates
BI/Reporting tools
• Microstrategy, Cognos, Business Objects, Oracle OBIEE
• Tableau, Qlikview, Spotfire, Microsoft PowerBI
• Excel
Analytical Tools/Languages
• SAS, Statistica, SPSS, MatLab
• R, Python, Java, Scala, etc.
• Azure ML Studio, Amazon Machine Learning
Portals
• SharePoint, Enterprise portals, Web/mobile apps
Enterprise Service Bus
• Oracle Service Bus, Azure Service Bus, TIBCO Active Matrix
Bus
ETL tools
• SAP Data Services, Informatica Powercenter, IBM Data
Stage, Talend ETL
API Management tools
• CA (Layer 7), TIBCO Mashery, Apigee
16
Data Blending – Semantic Silos
17
Data Blending Silos
Q: Is SAP planning to release SAP Universe connections for Power BI and Tableau?
A: The answer is no. No. There are no plans for this.
Gregory Botticchio, Director of Product Management, SAP BusinessObjects
Suite 360 webinar for SAP BusinessObjects 4.3 Release Preview
Beside SAP BusinessObjects, are you
using other analytics solution(s)?
18
Data Blending Limitations
Shared Dataset
(Import Mode)
Shared Dataset
(Direct Mode)
Direct mode is limited
to 1 data source
and 1 million rows
19
Francois Ajenstat, Chief Product Officer, Tableau Software
There are two flows; the ad-hoc and the operational…where we are
coming from is…I just want to integrate these two sources. It's not
formalized, per se, it's not a project. I just want to connect this and this
and I want to analyze it. How do we go from data to analysis as quickly as
possible? And when you want to formalize it, operationalize it, make it
repeatable, then [you use other tools].
The Proof is in the Pudding
21
Denodo’s Coronavirus Data Portal
File
Denodo Express
COVID-19 Edition
Data
Catalog
Data
Portal
JDBC
ODBC
API
GraphQL
GeoJSON
Sandbox
Sandbox
Sandbox
22
Connected Data Sources
Australian Bureau of Statistics Labor Force
Survey
ACAPS
Air Quality Open Data Platform
Allen Institute for AI
ArcGIS Hub
Becker Friedman Institute for Research in
Economics, University of Chicago
California Health and Human Services (CHHS)
Carnegie Mellon University
Centraal Bureau voor de Statistiek (CBS),
Netherlands
COVID19-India (covid19india.org)
Data Science for Social Impact Research Group
(DSFSI), University of Pretoria
Dipartimento della Protezione Civile, Italy
Europa Press
European Centre for Disease Prevention and
Control (ECDC)
Federal Ministry of Social Affairs, Health, Care
and Consumer Protection (BMSGPK), Austria
France GEOJSON
French Government Open Data (data.gouv.fr)
GlobalHealth 50/50
Google - COVID-19 Community Mobility
Reports
Hong Kong Department of Health
Humanitarian Data Exchange
Institute for Health Metrics and Evaluation
(IHME)
Instituto de Salud Carlos III
International Monetary Fund (IMF)
Istituto Nazionale di Statistica, Italy
Johns Hopkins University (JHU) Center for
Systems Science and Engineering (CSSE)
Junta de Castilla y Léon
Kaiser Family Foundation (KFF)
Ministerio de Sanidad, Spain
Ministry of Health of New Zealand
Ministry of Health, Brazil
Ministry of Health, Consumer Affairs and
Social Welfare, Spain
Ministry of Health, Labor and Welfare, Japan
National Institute for Health (NIH) - National
Library of Medicine (NLM)
Netherlands National Institute for Public
Health and the Environment (RIVM)
New York City Department of Health and
Mental Hygiene (DOHMH)
Office for National Statistics, UK
Organisation for Economic Co-operation and
Development (OECD)
Our World in Data
Public Health England
Robert Koch Institute (RKI)
RSS News Feeds
San Francisco Department of Public Health
(SFDPH)
Servicio Publico de Empleo Estatal (SEPE),
Spain
Statista.com
Statistics Austria
Statistics Canada
Statistics Norway
Statistics Sweden
Taiwan Centers for Disease Control
Texas Department of State, Health Services
Thailand Department for Disease Control
The COVID Tracking Project
The Economist
The Government of the Hong Kong Special
Administrative Region - Census and Statistics
Department
The New York Times
The World Bank
United Kingdom Government Open Data
(gov.uk)
United Nations Educational, Scientific and
Cultural Organization (UNESCO)
United Nations Population Division, Department
of Economic and Social Affairs
US Department of Labor
Wharton School of Business, University of
Pennsylvania
World Health Organization (WHO)
23
So, Let’s Have a Look…
https://coronavirusdataportal.com
Summary & Conclusions
25
Comparing Apples to Oranges
• Data Virtualization and ‘Data Blending’ serve two different purposes
• Data Blending is focused on a single vendor’s toolset
• It makes it easier for ‘citizen analysts’ to use a specific BI Tool
• It provides a semantic layer for that specific toolset
• It has limitations on real-time use
• Data Virtualization provides an enterprise-wide data fabric layer
• Supports many different consuming tools
• Creates a general purpose semantic layer for all users
• Can mix data delivery modes without limitations
• Use the right tool for the right task
26
Myth #2:
BI Tools and Data Virtualization
are Interchangeable.
Q&A
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm,
without prior the written authorization from Denodo Technologies.

More Related Content

What's hot (20)

PPTX
Cepta The Future of Data with Power BI
Kellyn Pot'Vin-Gorman
 
PPTX
Power BI for Big Data and the New Look of Big Data Solutions
James Serra
 
PPTX
Power bi
jainema23
 
PDF
Data Virtualization Primer - Introduction
Kenneth Peeples
 
PDF
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Cathrine Wilhelmsen
 
PPTX
Free Training: How to Build a Lakehouse
Databricks
 
PDF
Data visualization with sql analytics
Databricks
 
PDF
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Hortonworks
 
PPTX
Data Lake Overview
James Serra
 
PDF
Introduction to Data Vault Modeling
Kent Graziano
 
PDF
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Kent Graziano
 
PPTX
Building a Big Data Solution
James Serra
 
PPTX
Introduction to Microsoft SQL Server 2008 R2 Analysis Service
Quang Nguyễn Bá
 
PPTX
Data Virtualization and ETL
Lily Luo
 
PPTX
Introduction to Microsoft’s Master Data Services (MDS)
James Serra
 
PPTX
Power BI Overview, Deployment and Governance
James Serra
 
PDF
An introduction to data virtualization in business intelligence
David Walker
 
PPT
Big data insights with Red Hat JBoss Data Virtualization
Kenneth Peeples
 
PDF
Demystifying Data Warehousing as a Service - DFW
Kent Graziano
 
PPTX
Top Five Cool Features in Oracle SQL Developer Data Modeler
Kent Graziano
 
Cepta The Future of Data with Power BI
Kellyn Pot'Vin-Gorman
 
Power BI for Big Data and the New Look of Big Data Solutions
James Serra
 
Power bi
jainema23
 
Data Virtualization Primer - Introduction
Kenneth Peeples
 
Data Integration through Data Virtualization (SQL Server Konferenz 2019)
Cathrine Wilhelmsen
 
Free Training: How to Build a Lakehouse
Databricks
 
Data visualization with sql analytics
Databricks
 
Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Hortonworks
 
Data Lake Overview
James Serra
 
Introduction to Data Vault Modeling
Kent Graziano
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Kent Graziano
 
Building a Big Data Solution
James Serra
 
Introduction to Microsoft SQL Server 2008 R2 Analysis Service
Quang Nguyễn Bá
 
Data Virtualization and ETL
Lily Luo
 
Introduction to Microsoft’s Master Data Services (MDS)
James Serra
 
Power BI Overview, Deployment and Governance
James Serra
 
An introduction to data virtualization in business intelligence
David Walker
 
Big data insights with Red Hat JBoss Data Virtualization
Kenneth Peeples
 
Demystifying Data Warehousing as a Service - DFW
Kent Graziano
 
Top Five Cool Features in Oracle SQL Developer Data Modeler
Kent Graziano
 

Similar to Myth Busters II: BI Tools and Data Virtualization are Interchangeable (20)

PDF
Getting Started with Data Virtualization – What problems DV solves
Denodo
 
PDF
Denodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo
 
PDF
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Denodo
 
PDF
Virtualisation de données : Enjeux, Usages & Bénéfices
Denodo
 
PDF
Next Gen Analytics Going Beyond Data Warehouse
Denodo
 
PDF
How to Place Data at the Center of Digital Transformation in BFSI
Denodo
 
PDF
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Denodo
 
PDF
Data Marketplace - Rethink the Data
Denodo
 
PPT
Why Data Virtualization? An Introduction by Denodo
Justo Hidalgo
 
PDF
Reinvent Your Data Management Strategy for Successful Digital Transformation
Denodo
 
PDF
Cloud Modernization and Data as a Service Option
Denodo
 
PDF
Modern Data Management for Federal Modernization
Denodo
 
PDF
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Denodo
 
PDF
Myth Busters IV: I Access My Data Through APIs–Data Virtualization Can't Do This
Denodo
 
PDF
Future of Data Strategy (ASEAN)
Denodo
 
PDF
Self-Service Analytics with Guard Rails
Denodo
 
PDF
Introduction to Modern Data Virtualization 2021 (APAC)
Denodo
 
PDF
Three Dimensions of Data as a Service
Denodo
 
PDF
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
Inside Analysis
 
PDF
Denodo Platform 7.0: What's New?
Denodo
 
Getting Started with Data Virtualization – What problems DV solves
Denodo
 
Denodo Partner Connect: Technical Webinar - Ask Me Anything
Denodo
 
Self Service Analytics and a Modern Data Architecture with Data Virtualizatio...
Denodo
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Denodo
 
Next Gen Analytics Going Beyond Data Warehouse
Denodo
 
How to Place Data at the Center of Digital Transformation in BFSI
Denodo
 
Belgium & Luxembourg dedicated online Data Virtualization discovery workshop
Denodo
 
Data Marketplace - Rethink the Data
Denodo
 
Why Data Virtualization? An Introduction by Denodo
Justo Hidalgo
 
Reinvent Your Data Management Strategy for Successful Digital Transformation
Denodo
 
Cloud Modernization and Data as a Service Option
Denodo
 
Modern Data Management for Federal Modernization
Denodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Denodo
 
Myth Busters IV: I Access My Data Through APIs–Data Virtualization Can't Do This
Denodo
 
Future of Data Strategy (ASEAN)
Denodo
 
Self-Service Analytics with Guard Rails
Denodo
 
Introduction to Modern Data Virtualization 2021 (APAC)
Denodo
 
Three Dimensions of Data as a Service
Denodo
 
The Perfect Storm: The Impact of Analytics, Big Data and Analytics
Inside Analysis
 
Denodo Platform 7.0: What's New?
Denodo
 
Ad

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
Denodo
 
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
PDF
What you need to know about Generative AI and Data Management?
Denodo
 
PDF
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
PDF
Drive Data Privacy Regulatory Compliance
Denodo
 
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
PDF
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
PDF
Enabling Data Catalog users with advanced usability
Denodo
 
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 
Enterprise Monitoring and Auditing in Denodo
Denodo
 
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
What you need to know about Generative AI and Data Management?
Denodo
 
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
Drive Data Privacy Regulatory Compliance
Denodo
 
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
Enabling Data Catalog users with advanced usability
Denodo
 
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 
Ad

Recently uploaded (20)

PPTX
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PDF
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PDF
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
PDF
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
PDF
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PDF
Data Retrieval and Preparation Business Analytics.pdf
kayserrakib80
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PPTX
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PPTX
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 
apidays Helsinki & North 2025 - From Chaos to Clarity: Designing (AI-Ready) A...
apidays
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Singapore 2025 - The API Playbook for AI by Shin Wee Chuang (PAND AI)
apidays
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
apidays Singapore 2025 - Trustworthy Generative AI: The Role of Observability...
apidays
 
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
OOPs with Java_unit2.pdf. sarthak bookkk
Sarthak964187
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
Data Retrieval and Preparation Business Analytics.pdf
kayserrakib80
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
ER_Model_with_Diagrams_Presentation.pptx
dharaadhvaryu1992
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
apidays Helsinki & North 2025 - APIs at Scale: Designing for Alignment, Trust...
apidays
 

Myth Busters II: BI Tools and Data Virtualization are Interchangeable

  • 1. W E B I N A R S E R I E S BI Tools and Data Virtualization are Interchangeable
  • 2. W E B I N A R S E R I E S BI Tools and Data Virtualization are Interchangeable Paul Moxon SVP Data Architectures & Chief Evangelist Denodo 17nd June 2020
  • 3. Paul Moxon SVP Data Architectures & Chief Evangelist, Denodo Speakers
  • 4. 1. Today’s Myth 2. Origins of the Myth 3. Just the Facts Ma’am 4. The Proof is in the Pudding 5. Conclusions 6. Q&A 7. Next Steps Agenda
  • 5. 5 Myth #2: BI Tools and Data Virtualization are Interchangeable
  • 7. 7 Welcome to my Universe • BusinessObjects added Universe as semantic layer to BI tool • Special tools to design business-oriented data objects • Hide technical nature of physical data storage • Initially use Data Federator to access multiple data sources • Multi-source Universe capability subsumed Data Federator tool • Made BusinessObjects the leading BI Tool vendor • Increased usability and appeal to ‘citizen analysts’
  • 8. 8 Follow the Leader • Other vendors followed this approach • MicroStrategy, Cognos, etc. • New entrants initially focused on visualization and analysis of data • Tableau, Qlik, Power BI • Quickly added ‘data blending’ capabilities • Support multiple data source integration • With limitations 
  • 9. 9 Data Blending Everywhere • Most reporting tools now offer capabilities to create reports with data coming from multiple data sources • Some in real time, with their own federation engines (e.g. Tableau, MicroStrategy, Business Objects, etc.) • Some based on replication in the reporting tool engine (Qlik, SiSense, ThoughtSpot, etc.) • Some of them also provide data modeling capabilities (Looker, Business Objects, MicroStrategy, PowerBI, etc.) So if I can have multi-source queries and define a logical model in my reporting tool, why would I need Data Virtualization?
  • 10. Just the Facts, Ma’am
  • 11. 11 Source: “Gartner Market Guide for Data Virtualization, November 16, 2018” Data virtualization can be used to create virtualized and integrated views of data in-memory rather than executing data movement and physically storing integrated views in a target data structure. It provides a layer of abstraction above the physical implementation of data, to simplify query logic.
  • 12. 12 What is Data Virtualization? Consume in business applications Combine related data into views Connect to disparate data sources 2 3 1 DATA CONSUMERS DISPARATE DATA SOURCES Enterprise Applications, Reporting, BI, Portals, ESB, Mobile, Web, Users Databases & Warehouses, Cloud/Saas Applications, Big Data, NoSQL, Web, XML, Excel, PDF, Word... Analytical Operational Less StructuredMore Structured CONNECT COMBINE PUBLISH Multiple Protocols, Formats Query, Search, Browse Request/Reply, Event Driven Secure Delivery SQL, MDX Web Services Big Data APIs Web Automation and Indexing CONNECT COMBINE CONSUME Share, Deliver, Publish, Govern, Collaborate Discover, Transform, Prepare, Improve Quality, Integrate Normalized views of disparate data “Data virtualization integrates disparate data sources in real time or near-real time to meet demands for analytics and transactional data.” – Create a Road Map For A Real-time, Agile, Self- Service Data Platform, Forrester Research, Dec 16, 2015
  • 13. 13 What is Data Virtualization? 1. Single Access Point to all Data at any location 2. Semantic Layer – Expose Data in Business-Friendly form, adapted to the needs of each consumer 3. Abstract changes in the underlying infrastructure 4. Single entry point to apply security and governance policies 5. Avoid data replication: Up to 80% reduction in integration costs, in terms of resources and technology data
  • 14. 14 (Almost) Any-to-Many Connectivity Relational Databases • MS SQL*Server (JDBC, ODBC): 2000, 2005, 2008, 2008R2, 2012, 2014, 2016 • Oracle (JDBC): 8i, 9i, 10g, 11g, 12c, 18 • Oracle E-Business Suite (JDBC): 12 • IBM DB2 (JDBC): 8, 9, 10, 11, 12 for LUW; 9,10 for z/OS • Informix (JDBC): 7, 12 • Sybase Adaptive Server Enterprise (JDBC): 12, 15 • MySQL (JDBC): 4, 5 • PostgreSQL (JDBC): 8, 9 • Denodo Platform (JDBC): 5.5, 6.0, 7.0 - For multi-location architecture deployments • MS Access (ODBC) • Apache Derby (JDBC): 10 • Generic (JDBC) In-Memory Databases • SAP HANA (JDBC): 1 • Oracle TimesTen (JDBC): 11g • Oracle 12c In-Memory Parallel databases and appliances • GreenPlum (JDBC): 4.2 • HP Vertica (JDBC): 7, 8 • Oracle Exadata (JDBC): X5-2 • ParAccel 8.0.2 (using ParAccel 2.5.0.0 JDBC3g/SSL driver) • Netezza (JDBC): 4.6, 5.0, 6.0, 7.0 • SybaseIQ (JDBC) 12.x, 15.x • Teradata (JDBC): 12, 13, 14, 15 Multi-Dimensional Sources • SAP BW (BAPI/XMLA): 3.x • SAP BI 7.x (BAPI): 7.x • Mondrian (XMLA): 3.x • MS SQL Server Analysis Services 200x • Essbase (XMLA): 9, 11 Cloud Data Warehouse • Amazon Redshift (JDBC) • Amazon Athena (JDBC) • Amazon Aurora (JDBC) • Snowflake (JDBC) • Amazon DynamoDB • Azure SQL Data Warehouse • Azure CosmosDB (SQL API and MongoDB API) Big Data/NoSQL • Apache Hive (JDBC): 0.12, 1.1.0, 1.1.0 for Cloudera 1.2.1 for Hortonworks 2.0.0 • MapR-XD, MapR-DB, MapR-ES, Hive, and Drill for MapR 6.1 • Impala (JDBC): 2.3 • Spark SQL (JDBC): 1.5, 1.6 • Google BigQuery (JDBC) • Presto (JDBC) Web Automation • Denodo’s ITPilot automates extraction from web pages Indexes and unstructured content • CMS, file systems, pdf, word, text, email servers, knowledge bases, indexes • Elastic Search Web Services • SOAP • REST (XML, RSS, ATOM, JSON) • OData v2 and v4 Packaged Applications • SAP ERP/ECC (BAPIs and RFC tables) • Oracle E-Business Suite 12 • Siebel • SAS (SAS JDBC Driver): 7 and higher Semantic Repositories • Semantic repositories in Triple Stores / RDF accessed through SPARQL endpoints. Flat and Binary Files • CSV, pipe-delimited, Regular expression-parsed • MS Excel xls 97-2003 • MS Excel xlsx 2007 or later • MS Access • XML • JSON All files can be locally accessible or in remote filesystems, through FTP/ SFTP/FTPS, and in clear, zipped and/or encrypted format. Active Directory as source or leveraging security • LDAP v3 • Microsoft Active Directory 2003, 2008 Cloud, SaaS, Web Sources with Simplified OAuth Security • Amazon • Google • Facebook • LinkedIn • MS Azure Data Lake • MS SharePoint (by using the OData connector) • MS Dynamics • ServiceNow • Marketo • Salesforce • Twitter via APIs with simplified Oauth integration (1.0, 1.0a and 2.0) • Workday MS Queues as data source and Delivery • MQSeries • SonicMQ • ActiveMQ • Tibco EMS Denodo SDK for Custom Connectors • CouchDB • Lotus Domino • MongoDB and Mongo Atlas DBaaS Mainframe • IMS • IBM IMS native drivers: 8, 9 • IMS Universal Drivers: 11 Hierarchical databases • Adabas (SOA Gateway and Denodo’s SOAP connector): 5, 6 Legacy • Microsoft FoxPro (ODBC) The following data sources have been successfully tested with Denodo using JDBC and ODBC drivers, WS/SOAP and WS/REST, and DenodoConnect adapters (not exhaustive list): • Apache Solr • Kafka Messages • SAS Files • Hadoop HBase • Hadoop HCatalog • Hadoop HDFS (Avro, CSV, Parquet) • Files in Amazon S3 (incl. Parquet files) • IBM BigInsights • Pivotal HAWQ
  • 15. 15 (Almost) Any-to-Many Connectivity Many Consumers Protocols and Formats • SQL Based access via JDBC, ODBC and ADO.NET • Web Services • SOAP (XML/JSON) • REST (JSON/XML) • OData • Open API (a.k.a Swagger) • Web Parts (for SharePoint), Portlets • Kafka and JMS listeners for message queues • Denodo Scheduler for batch process and ‘ETL lite’ Security Options • Authentication using LDAP or Active Directory • Kerberos for Single Sign-On (SSO) • OAuth, OAuth 2.0 (JWT) • SAML • SSL/TLS • WS-Security, X.509 certificates BI/Reporting tools • Microstrategy, Cognos, Business Objects, Oracle OBIEE • Tableau, Qlikview, Spotfire, Microsoft PowerBI • Excel Analytical Tools/Languages • SAS, Statistica, SPSS, MatLab • R, Python, Java, Scala, etc. • Azure ML Studio, Amazon Machine Learning Portals • SharePoint, Enterprise portals, Web/mobile apps Enterprise Service Bus • Oracle Service Bus, Azure Service Bus, TIBCO Active Matrix Bus ETL tools • SAP Data Services, Informatica Powercenter, IBM Data Stage, Talend ETL API Management tools • CA (Layer 7), TIBCO Mashery, Apigee
  • 16. 16 Data Blending – Semantic Silos
  • 17. 17 Data Blending Silos Q: Is SAP planning to release SAP Universe connections for Power BI and Tableau? A: The answer is no. No. There are no plans for this. Gregory Botticchio, Director of Product Management, SAP BusinessObjects Suite 360 webinar for SAP BusinessObjects 4.3 Release Preview Beside SAP BusinessObjects, are you using other analytics solution(s)?
  • 18. 18 Data Blending Limitations Shared Dataset (Import Mode) Shared Dataset (Direct Mode) Direct mode is limited to 1 data source and 1 million rows
  • 19. 19 Francois Ajenstat, Chief Product Officer, Tableau Software There are two flows; the ad-hoc and the operational…where we are coming from is…I just want to integrate these two sources. It's not formalized, per se, it's not a project. I just want to connect this and this and I want to analyze it. How do we go from data to analysis as quickly as possible? And when you want to formalize it, operationalize it, make it repeatable, then [you use other tools].
  • 20. The Proof is in the Pudding
  • 21. 21 Denodo’s Coronavirus Data Portal File Denodo Express COVID-19 Edition Data Catalog Data Portal JDBC ODBC API GraphQL GeoJSON Sandbox Sandbox Sandbox
  • 22. 22 Connected Data Sources Australian Bureau of Statistics Labor Force Survey ACAPS Air Quality Open Data Platform Allen Institute for AI ArcGIS Hub Becker Friedman Institute for Research in Economics, University of Chicago California Health and Human Services (CHHS) Carnegie Mellon University Centraal Bureau voor de Statistiek (CBS), Netherlands COVID19-India (covid19india.org) Data Science for Social Impact Research Group (DSFSI), University of Pretoria Dipartimento della Protezione Civile, Italy Europa Press European Centre for Disease Prevention and Control (ECDC) Federal Ministry of Social Affairs, Health, Care and Consumer Protection (BMSGPK), Austria France GEOJSON French Government Open Data (data.gouv.fr) GlobalHealth 50/50 Google - COVID-19 Community Mobility Reports Hong Kong Department of Health Humanitarian Data Exchange Institute for Health Metrics and Evaluation (IHME) Instituto de Salud Carlos III International Monetary Fund (IMF) Istituto Nazionale di Statistica, Italy Johns Hopkins University (JHU) Center for Systems Science and Engineering (CSSE) Junta de Castilla y Léon Kaiser Family Foundation (KFF) Ministerio de Sanidad, Spain Ministry of Health of New Zealand Ministry of Health, Brazil Ministry of Health, Consumer Affairs and Social Welfare, Spain Ministry of Health, Labor and Welfare, Japan National Institute for Health (NIH) - National Library of Medicine (NLM) Netherlands National Institute for Public Health and the Environment (RIVM) New York City Department of Health and Mental Hygiene (DOHMH) Office for National Statistics, UK Organisation for Economic Co-operation and Development (OECD) Our World in Data Public Health England Robert Koch Institute (RKI) RSS News Feeds San Francisco Department of Public Health (SFDPH) Servicio Publico de Empleo Estatal (SEPE), Spain Statista.com Statistics Austria Statistics Canada Statistics Norway Statistics Sweden Taiwan Centers for Disease Control Texas Department of State, Health Services Thailand Department for Disease Control The COVID Tracking Project The Economist The Government of the Hong Kong Special Administrative Region - Census and Statistics Department The New York Times The World Bank United Kingdom Government Open Data (gov.uk) United Nations Educational, Scientific and Cultural Organization (UNESCO) United Nations Population Division, Department of Economic and Social Affairs US Department of Labor Wharton School of Business, University of Pennsylvania World Health Organization (WHO)
  • 23. 23 So, Let’s Have a Look… https://coronavirusdataportal.com
  • 25. 25 Comparing Apples to Oranges • Data Virtualization and ‘Data Blending’ serve two different purposes • Data Blending is focused on a single vendor’s toolset • It makes it easier for ‘citizen analysts’ to use a specific BI Tool • It provides a semantic layer for that specific toolset • It has limitations on real-time use • Data Virtualization provides an enterprise-wide data fabric layer • Supports many different consuming tools • Creates a general purpose semantic layer for all users • Can mix data delivery modes without limitations • Use the right tool for the right task
  • 26. 26 Myth #2: BI Tools and Data Virtualization are Interchangeable.
  • 27. Q&A
  • 28. Thanks! www.denodo.com [email protected] © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.