SlideShare a Scribd company logo
How To Manage Large
Deployments
Juan Lozano, Principal Technical Account
Manager
Agenda1.Introduction
2.Environment Management
3.Metadata Synchronization and migrations
4.Monitoring
5.Resource Management
Introduction
4
Introduction
Denodo deployments can go from a few developers and users
executing thousands of queries a day to hundreds of developers and
users executing hundreds of thousands of queries a day. In order to
manage both types of deployments Denodo provides multiple
features that simplify this task.
These features will help with the management of environments,
metadata synchronization, migrations, resource management and
monitoring.
5
Challenges in Large Deployments
Managing multiple environments with different dependent parameters such as
connectivity credentials
Migration process between different stages of the release lifecycle can be time
consuming and prone to errors
Resources can be over loaded based on the number of users or queries
Finding and diagnosing an issue in one of the servers of the deployment can
be difficult as there are multiple environments and/or servers that can be part
of a cluster
6
Solutions
Denodo fully integrates with a version control system
Dependent parameters are organized by environments
Simple migration process between different environments by providing both a GUI and
a set of scripts that can be fully automated and integrated with a current migration
protocol
Resource manager integrated in the administration tool to control the system
resources with a big level of granularity
Denodo monitoring and diagnosing web allows to configure a set of environments or
servers so all the information for each node, including historical, is available from a
single interface
7
Deployment Overview
Administration and
Development Tool DEVELOPMENT
• Integrated Version
Control for multiple
environments
• Includes
Import/Export
functionalities
• Resources are
managed from the same
tool
Load Balancer
STAGING PRODUCTION
CLUSTER
• Integrated in the
Denodo Server
• It provides a web
interface for easy
access
Monitoring and
diagnostic
Replicator &
automated scripts
• GUI
• Scripts that can be
integrated with other
processes
Testing Tool
• Automated Testing
• No programming
needed
Integrated
Version
Control
MS TFSSubversion git
Local Development
Server
Environment Management
9
Version Control
Built-in integration with version control systems: Subversion, Git and MS TFS
 Support for team development and multiple environment migrations
 Integrated dependency management and conflicts control
10
Version Control
Bob Alice
Development
Server
Staging
Server
Demo_svn_Bob Demo_svn_Alice
Demo_svn Demo_svn
VCS
Repository
Demo_svn
(consolidated
Development
database)
Replicator
& Scripts
11
Dependent parameters and Environments
Denodo includes the concept of environment for manage dependent metadata
such as the data sources connection properties (e.g. login/passwords)
Environments can represent, for example, different development servers for
different groups and/or different geographical locations
When loading metadata from a different environment, the process can be
configured to use the properties of the target environment instead of the origin
ones
12
Version Control Environments
By following this approach we can easily maintain all our dependent
metadata organized and ready to be used
13
Version Control
Bob Alice
Development
Server in New York
Staging
Server
Demo_svn_Bob Demo_svn_Alice
Demo_svn Demo_svn
VCS
Repository
Demo_svn
(consolidated
Development
database)
Replicator
& Scripts
Local Development
Server in
Palo Alto
PA Environment
Properties
NY Environment
Properties
Staging Environment
Properties
Metadata Synchronization
and Migrations
15
Metadata Synchronization and Migrations
Denodo provides different tools to help with metadata synchronization and migration process:
VDP Administration Tool GUI
Shell Scripts
Denodo Replicator
Denodo Testing Tool
16
VDP Administration Tool GUI
Denodo admin tool provides graphical wizards to export/import
metadata and environment dependent properties
17
Shell Scripts
Denodo provides scripts for performing periodical backup copies and import metadata
to a list of servers
These scripts offer the same options as the VDP admin interface for import and export
The scripts can be found in the Denodo_Homebin folder but also as an standalone tool
in the DENODO_HOMEtoolsdbdenodo-db-tools folder
These scripts can be used to automate the promotion/migration process adding any
validation rules as needed
18
Denodo Testing Tool
18
The Denodo Testing Tool allows Denodo users to easily automate the testing of their
data virtualization scenarios.
Tests are specified in text files and organized in folders. No programming needed.
Test sets are executed in a completely automated manner.
19
Creating an automated script
We can combine the shell scripts with the Testing Tool and create a fully automated
process, for example our script can:
Obtain the consolidated version (VCS) from the development server
Load that version into the staging server but using the environment dependent
parameters file
Execute the Denodo Testing Tool group of tests
In case of error inform of tests that failed
In other case tag the version
20
Denodo Replicator
Denodo Replicator can be used to graphically manage an
environment and synchronize the metadata among its members
Metadata replication
Monitoring and diagnosis
22
■ See current sessions, queries, connections, cache
load processes…
■ See resources usage in each server (CPU,
memory, connections,…)
■ Inspect data sources and cache statistics
(connection pools, response times, active
requests…)
■ Inspect errors, warnings and system logs
■ Integrates Monitoring Functionality from VDP
Admin Tool, Denodo Dashboard and Denodo
Monitor
■ Go “back in time” to the moment where a
problem happened
■ Graphically inspect and browse all the
information provided by the Denodo Monitor
and server logs:
■ Active requests and sessions
■ Resources Usage
■ Data source statistics
■ Integrates diagnosing information currently
dispersed in logs:
■ Graphical Analysis of incidents
Monitoring and Diagnosing Tool (I)
Graphical Monitoring of Servers and Clusters; Graphical Problem Diagnosing
23
Monitoring and Diagnosing Tool (II)
24
Monitoring and Diagnosing Tool (III)
Environments and servers can be created
Servers can be added to an environment
Start monitoring a server/environment by double clicking
in an item or by right clicking in the desired item and
choosing the desired option
Diagnostic information about an specific server can be
loaded by an option of the right context menu (new
“child” of this server appears in the tree)
25
Monitoring and Diagnosing Tool (and IV)
Data of monitoring and diagnosing is organized in 8 categories:
State: Summary information of the state of the server/environment
Resources: Information about physical resources (memory, cpu,…)
Requests: Information about requests, including real-time execution trace
Session: Currently opened sessions, including client application
Cache: Information about cache load processes and cache contents
Datasources:Information about activity in the datasources
Threads: Information about threads (priorities, CPU usage)
Errors: Inspect logged errors and warnings
… and many others
Filter and sort information by any criteria: e.g. see data sources used by
a query, see requests from a given session,…
Use Case Scenario: Slow queries (I)
This sample scenario shows a possible use case in diagnostic
mode
Users have reported performance problems with VDP server
‘server1’ during query execution times are bigger than expected
The issues were reported between 19:15 and 21:00 on March 3
The goal is to detect the root cause of the problem
26
Use Case Scenario: Slow queries (II)
The diagnostic logs saved during March 3 are loaded
A new main tab “Server1 March 3” is opened with data for the loaded
diagnostic data
27
Use Case Scenario: Slow queries (III)
A time filter is created for the conflicting time period
Now you can inspect all the metrics of the system during that
period
28
Use Case Scenario: Slow queries (IV)
The ‘State’ tab shows a clear increment of the ‘waiting requests’
29
Use Case Scenario: Slow queries (V)
A filter on ‘Requests’ Tab shows that a significant number of
queries failed or finished with timeout in that period
We explore the data sources used by some of the failed queries
30
Use Case Scenario: Slow queries (VI)
The connection pool of the data source is frequently full during
the period -> requests are being queued
Suggests pool is too small or more queries than usual
31
Use Case Scenario: Slow queries (VII)
We continue the analysis and order the requests by client
application. We notice the application ’single_customer’ view
executed many long-running requests during the period
32
Use Case Scenario: Slow queries (VIII)
We examine the query plan of some of the queries and notice that
they are complex, long-running reports
33
Use Case Scenario: Slow queries (and IX)
Possible solutions:
Use Resource Manager to limit the maximum number of concurrent
requests from the “Single Customer View” application, so they do not
monopolize the data source
Ask the administrators of the “Single Customer View” application to
distribute the execution of reports in a longer time period
Cache part of the data to distribute the workload
Increase the size of the pool if the data source can support the workload
34
Resources Management
Workload Management
Ensure fair distribution of resources among applications / users
Allocate available resources according to business priorities
Sessions classified into groups according to criteria such as user/role,
application, time,…
Sessions are assigned to resource groups, which establish restrictions:
Apply always or only under heavy CPU usage
Example restrictions:
Change execution priorities
Max. number of concurrent queries
Limit execution time
36
■ Create plan restrictions.
Denodo Resource Manager (I)
■ Rules Classify Sessions into Groups (e.g. by user, application,…)
■ E.g. Sessions from application ‘single customer view’ are assigned to plan
called ‘limit_concurrent_queries_to_10’
Denodo Resource Manager (II)
Thanks!
www.denodo.com info@denodo.com
© Copyright Denodo Technologies. All rights reserved
Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical,
including photocopying and microfilm, without prior the written authorization from Denodo Technologies.

More Related Content

What's hot (20)

PDF
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
Denodo
 
PDF
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
PDF
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
Denodo
 
PDF
Denodo Data Virtualization Platform architecture: Data Discovery and Data Gov...
Denodo
 
PDF
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Denodo
 
PDF
Data Virtualization: The Agile Delivery Platform
Denodo
 
PDF
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Denodo
 
PDF
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Denodo
 
PDF
Can data virtualization uphold performance with complex queries?
Denodo
 
PPTX
Fast Data Strategy Houston Roadshow Presentation
Denodo
 
PDF
Virtualisation de données : Enjeux, Usages & Bénéfices
Denodo
 
PDF
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Denodo
 
PDF
Data Virtualization: An Essential Component of a Cloud Data Lake
Denodo
 
PDF
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo
 
PPTX
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo
 
PDF
Enabling Cloud Data Integration (EMEA)
Denodo
 
PDF
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Denodo
 
PDF
GDPR Noncompliance: Avoid the Risk with Data Virtualization
Denodo
 
PDF
Data Virtualization: From Zero to Hero
Denodo
 
PDF
Why Data Mesh Needs Data Virtualization (ASEAN)
Denodo
 
Denodo Data Virtualization Platform Architecture: Performance (session 2 from...
Denodo
 
Advanced Analytics and Machine Learning with Data Virtualization
Denodo
 
Denodo DataFest 2017: Business Needs for a Fast Data Strategy
Denodo
 
Denodo Data Virtualization Platform architecture: Data Discovery and Data Gov...
Denodo
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Denodo
 
Data Virtualization: The Agile Delivery Platform
Denodo
 
Data Virtualization, a Strategic IT Investment to Build Modern Enterprise Dat...
Denodo
 
Logical Data Warehouse: The Foundation of Modern Data and Analytics (APAC)
Denodo
 
Can data virtualization uphold performance with complex queries?
Denodo
 
Fast Data Strategy Houston Roadshow Presentation
Denodo
 
Virtualisation de données : Enjeux, Usages & Bénéfices
Denodo
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Denodo
 
Data Virtualization: An Essential Component of a Cloud Data Lake
Denodo
 
Denodo DataFest 2016: What’s New in Denodo Platform – Demo and Roadmap
Denodo
 
Denodo Data Virtualization - IT Days in Luxembourg with Oktopus
Denodo
 
Enabling Cloud Data Integration (EMEA)
Denodo
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Denodo
 
GDPR Noncompliance: Avoid the Risk with Data Virtualization
Denodo
 
Data Virtualization: From Zero to Hero
Denodo
 
Why Data Mesh Needs Data Virtualization (ASEAN)
Denodo
 

Viewers also liked (13)

PPTX
SQL In/On/Around Hadoop
DataWorks Summit
 
PDF
Denodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo
 
PDF
Denodo DataFest 2016: ROI Justification in Data Virtualization
Denodo
 
PDF
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo
 
PPTX
Powering Self Service Business Intelligence with Hadoop and Data Virtualization
Denodo
 
PDF
いまさら聞けないインフラ勉強会Vol.3~システム監視ツールバトル2013開催案内
Shu Takeda
 
PDF
Getting Started with Data Virtualization – What problems DV solves
Denodo
 
PPTX
Data Virtualization: An Introduction
Denodo
 
PDF
Big Data Fabric: A Recipe for Big Data Initiatives
Denodo
 
PDF
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Denodo
 
PDF
Teradata - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
 
PDF
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Denodo
 
PDF
[data analytics showcase] B12: サーバー1,000台を監視するということ by 株式会社インサイトテクノロジー 小幡 一郎
Insight Technology, Inc.
 
SQL In/On/Around Hadoop
DataWorks Summit
 
Denodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo
 
Denodo DataFest 2016: ROI Justification in Data Virtualization
Denodo
 
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Denodo
 
Powering Self Service Business Intelligence with Hadoop and Data Virtualization
Denodo
 
いまさら聞けないインフラ勉強会Vol.3~システム監視ツールバトル2013開催案内
Shu Takeda
 
Getting Started with Data Virtualization – What problems DV solves
Denodo
 
Data Virtualization: An Introduction
Denodo
 
Big Data Fabric: A Recipe for Big Data Initiatives
Denodo
 
Education Seminar: Self-service BI, Logical Data Warehouse and Data Lakes
Denodo
 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
 
Extended Data Warehouse - A New Data Architecture for Modern BI with Claudia ...
Denodo
 
[data analytics showcase] B12: サーバー1,000台を監視するということ by 株式会社インサイトテクノロジー 小幡 一郎
Insight Technology, Inc.
 
Ad

Similar to Data Virtualization Deployments: How to Manage Very Large Deployments (20)

PDF
Solution Manager in Denodo Platform 7.0: Admin Made Simple
Denodo
 
PDF
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo
 
PDF
How Does the Denodo Platform Accelerate Your Time to Insights?
Denodo
 
PDF
Enabling Fast Data Strategy: What’s new in Denodo Platform 6.0
Denodo
 
PDF
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Denodo
 
PDF
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
Denodo
 
PPTX
Take your Data Management Practice to the Next Level with Denodo 7
Denodo
 
PDF
Why Data Virtualization? An Introduction
Denodo
 
PDF
Wie beschleunigt die Denodo Plattform Ihre Zeit der Erkenntnisgewinnung?
Denodo
 
PDF
Denodo Data Virtualization Platform: Scalability (session 3 from Architect to...
Denodo
 
PDF
Performance Acceleration: Summaries, Recommendation, MPP and more
Denodo
 
PDF
Denodo Platform 7.0: What's New?
Denodo
 
PDF
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Denodo
 
PDF
Maximizing Data Lake ROI with Data Virtualization: A Technical Demonstration
Denodo
 
PDF
Partner Engagement Webinar Series: Highlights from DataFest North America
Denodo
 
PDF
In Memory Parallel Processing for Big Data Scenarios
Denodo
 
PDF
Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Denodo
 
PDF
Best Practices for Migrating from Denodo 6.x to 7.0
Denodo
 
PDF
Introduction to Modern Data Virtualization 2021 (APAC)
Denodo
 
PDF
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo
 
Solution Manager in Denodo Platform 7.0: Admin Made Simple
Denodo
 
Denodo 6.0: Self Service Search, Discovery & Governance using an Universal Se...
Denodo
 
How Does the Denodo Platform Accelerate Your Time to Insights?
Denodo
 
Enabling Fast Data Strategy: What’s new in Denodo Platform 6.0
Denodo
 
Building a Single Logical Data Lake: For Advanced Analytics, Data Science, an...
Denodo
 
Partner Enablement: Key Differentiators of Denodo Platform 6.0 for the Field
Denodo
 
Take your Data Management Practice to the Next Level with Denodo 7
Denodo
 
Why Data Virtualization? An Introduction
Denodo
 
Wie beschleunigt die Denodo Plattform Ihre Zeit der Erkenntnisgewinnung?
Denodo
 
Denodo Data Virtualization Platform: Scalability (session 3 from Architect to...
Denodo
 
Performance Acceleration: Summaries, Recommendation, MPP and more
Denodo
 
Denodo Platform 7.0: What's New?
Denodo
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Denodo
 
Maximizing Data Lake ROI with Data Virtualization: A Technical Demonstration
Denodo
 
Partner Engagement Webinar Series: Highlights from DataFest North America
Denodo
 
In Memory Parallel Processing for Big Data Scenarios
Denodo
 
Scaling Multi-Cloud Deployments with Denodo: Automated Infrastructure Management
Denodo
 
Best Practices for Migrating from Denodo 6.x to 7.0
Denodo
 
Introduction to Modern Data Virtualization 2021 (APAC)
Denodo
 
Denodo Platform 7.0: Redefine Analytics with In-Memory Parallel Processing an...
Denodo
 
Ad

More from Denodo (20)

PDF
Enterprise Monitoring and Auditing in Denodo
Denodo
 
PDF
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
PDF
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
PDF
What you need to know about Generative AI and Data Management?
Denodo
 
PDF
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
PDF
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
PDF
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
PDF
Drive Data Privacy Regulatory Compliance
Denodo
 
PDF
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
PDF
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
PDF
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
PDF
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
PDF
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
PDF
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
PDF
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
PDF
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
PDF
Enabling Data Catalog users with advanced usability
Denodo
 
PDF
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
PDF
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 
Enterprise Monitoring and Auditing in Denodo
Denodo
 
Lunch and Learn ANZ: Mastering Cloud Data Cost Control: A FinOps Approach
Denodo
 
Achieving Self-Service Analytics with a Governed Data Services Layer
Denodo
 
What you need to know about Generative AI and Data Management?
Denodo
 
Mastering Data Compliance in a Dynamic Business Landscape
Denodo
 
Denodo Partner Connect: Business Value Demo with Denodo Demo Lite
Denodo
 
Expert Panel: Overcoming Challenges with Distributed Data to Maximize Busines...
Denodo
 
Drive Data Privacy Regulatory Compliance
Denodo
 
Знакомство с виртуализацией данных для профессионалов в области данных
Denodo
 
Data Democratization: A Secret Sauce to Say Goodbye to Data Fragmentation
Denodo
 
Denodo Partner Connect - Technical Webinar - Ask Me Anything
Denodo
 
Lunch and Learn ANZ: Key Takeaways for 2023!
Denodo
 
It’s a Wrap! 2023 – A Groundbreaking Year for AI and The Way Forward
Denodo
 
Quels sont les facteurs-clés de succès pour appliquer au mieux le RGPD à votr...
Denodo
 
Lunch and Learn ANZ: Achieving Self-Service Analytics with a Governed Data Se...
Denodo
 
How to Build Your Data Marketplace with Data Virtualization?
Denodo
 
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Denodo
 
Enabling Data Catalog users with advanced usability
Denodo
 
Denodo Partner Connect: Technical Webinar - Architect Associate Certification...
Denodo
 
GenAI y el futuro de la gestión de datos: mitos y realidades
Denodo
 

Recently uploaded (20)

PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
DOCX
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PDF
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PDF
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
PPTX
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
Python coding for beginners !! Start now!#
Rajni Bhardwaj Grover
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
SIZING YOUR AIR CONDITIONER---A PRACTICAL GUIDE.pdf
Muhammad Rizwan Akram
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
How do you fast track Agentic automation use cases discovery?
DianaGray10
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
The 2025 InfraRed Report - Redpoint Ventures
Razin Mustafiz
 
AI Penetration Testing Essentials: A Cybersecurity Guide for 2025
defencerabbit Team
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
Digital Circuits, important subject in CS
contactparinay1
 
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Safe Software
 

Data Virtualization Deployments: How to Manage Very Large Deployments

  • 1. How To Manage Large Deployments Juan Lozano, Principal Technical Account Manager
  • 2. Agenda1.Introduction 2.Environment Management 3.Metadata Synchronization and migrations 4.Monitoring 5.Resource Management
  • 4. 4 Introduction Denodo deployments can go from a few developers and users executing thousands of queries a day to hundreds of developers and users executing hundreds of thousands of queries a day. In order to manage both types of deployments Denodo provides multiple features that simplify this task. These features will help with the management of environments, metadata synchronization, migrations, resource management and monitoring.
  • 5. 5 Challenges in Large Deployments Managing multiple environments with different dependent parameters such as connectivity credentials Migration process between different stages of the release lifecycle can be time consuming and prone to errors Resources can be over loaded based on the number of users or queries Finding and diagnosing an issue in one of the servers of the deployment can be difficult as there are multiple environments and/or servers that can be part of a cluster
  • 6. 6 Solutions Denodo fully integrates with a version control system Dependent parameters are organized by environments Simple migration process between different environments by providing both a GUI and a set of scripts that can be fully automated and integrated with a current migration protocol Resource manager integrated in the administration tool to control the system resources with a big level of granularity Denodo monitoring and diagnosing web allows to configure a set of environments or servers so all the information for each node, including historical, is available from a single interface
  • 7. 7 Deployment Overview Administration and Development Tool DEVELOPMENT • Integrated Version Control for multiple environments • Includes Import/Export functionalities • Resources are managed from the same tool Load Balancer STAGING PRODUCTION CLUSTER • Integrated in the Denodo Server • It provides a web interface for easy access Monitoring and diagnostic Replicator & automated scripts • GUI • Scripts that can be integrated with other processes Testing Tool • Automated Testing • No programming needed Integrated Version Control MS TFSSubversion git Local Development Server
  • 9. 9 Version Control Built-in integration with version control systems: Subversion, Git and MS TFS  Support for team development and multiple environment migrations  Integrated dependency management and conflicts control
  • 10. 10 Version Control Bob Alice Development Server Staging Server Demo_svn_Bob Demo_svn_Alice Demo_svn Demo_svn VCS Repository Demo_svn (consolidated Development database) Replicator & Scripts
  • 11. 11 Dependent parameters and Environments Denodo includes the concept of environment for manage dependent metadata such as the data sources connection properties (e.g. login/passwords) Environments can represent, for example, different development servers for different groups and/or different geographical locations When loading metadata from a different environment, the process can be configured to use the properties of the target environment instead of the origin ones
  • 12. 12 Version Control Environments By following this approach we can easily maintain all our dependent metadata organized and ready to be used
  • 13. 13 Version Control Bob Alice Development Server in New York Staging Server Demo_svn_Bob Demo_svn_Alice Demo_svn Demo_svn VCS Repository Demo_svn (consolidated Development database) Replicator & Scripts Local Development Server in Palo Alto PA Environment Properties NY Environment Properties Staging Environment Properties
  • 15. 15 Metadata Synchronization and Migrations Denodo provides different tools to help with metadata synchronization and migration process: VDP Administration Tool GUI Shell Scripts Denodo Replicator Denodo Testing Tool
  • 16. 16 VDP Administration Tool GUI Denodo admin tool provides graphical wizards to export/import metadata and environment dependent properties
  • 17. 17 Shell Scripts Denodo provides scripts for performing periodical backup copies and import metadata to a list of servers These scripts offer the same options as the VDP admin interface for import and export The scripts can be found in the Denodo_Homebin folder but also as an standalone tool in the DENODO_HOMEtoolsdbdenodo-db-tools folder These scripts can be used to automate the promotion/migration process adding any validation rules as needed
  • 18. 18 Denodo Testing Tool 18 The Denodo Testing Tool allows Denodo users to easily automate the testing of their data virtualization scenarios. Tests are specified in text files and organized in folders. No programming needed. Test sets are executed in a completely automated manner.
  • 19. 19 Creating an automated script We can combine the shell scripts with the Testing Tool and create a fully automated process, for example our script can: Obtain the consolidated version (VCS) from the development server Load that version into the staging server but using the environment dependent parameters file Execute the Denodo Testing Tool group of tests In case of error inform of tests that failed In other case tag the version
  • 20. 20 Denodo Replicator Denodo Replicator can be used to graphically manage an environment and synchronize the metadata among its members Metadata replication
  • 22. 22 ■ See current sessions, queries, connections, cache load processes… ■ See resources usage in each server (CPU, memory, connections,…) ■ Inspect data sources and cache statistics (connection pools, response times, active requests…) ■ Inspect errors, warnings and system logs ■ Integrates Monitoring Functionality from VDP Admin Tool, Denodo Dashboard and Denodo Monitor ■ Go “back in time” to the moment where a problem happened ■ Graphically inspect and browse all the information provided by the Denodo Monitor and server logs: ■ Active requests and sessions ■ Resources Usage ■ Data source statistics ■ Integrates diagnosing information currently dispersed in logs: ■ Graphical Analysis of incidents Monitoring and Diagnosing Tool (I) Graphical Monitoring of Servers and Clusters; Graphical Problem Diagnosing
  • 24. 24 Monitoring and Diagnosing Tool (III) Environments and servers can be created Servers can be added to an environment Start monitoring a server/environment by double clicking in an item or by right clicking in the desired item and choosing the desired option Diagnostic information about an specific server can be loaded by an option of the right context menu (new “child” of this server appears in the tree)
  • 25. 25 Monitoring and Diagnosing Tool (and IV) Data of monitoring and diagnosing is organized in 8 categories: State: Summary information of the state of the server/environment Resources: Information about physical resources (memory, cpu,…) Requests: Information about requests, including real-time execution trace Session: Currently opened sessions, including client application Cache: Information about cache load processes and cache contents Datasources:Information about activity in the datasources Threads: Information about threads (priorities, CPU usage) Errors: Inspect logged errors and warnings … and many others Filter and sort information by any criteria: e.g. see data sources used by a query, see requests from a given session,…
  • 26. Use Case Scenario: Slow queries (I) This sample scenario shows a possible use case in diagnostic mode Users have reported performance problems with VDP server ‘server1’ during query execution times are bigger than expected The issues were reported between 19:15 and 21:00 on March 3 The goal is to detect the root cause of the problem 26
  • 27. Use Case Scenario: Slow queries (II) The diagnostic logs saved during March 3 are loaded A new main tab “Server1 March 3” is opened with data for the loaded diagnostic data 27
  • 28. Use Case Scenario: Slow queries (III) A time filter is created for the conflicting time period Now you can inspect all the metrics of the system during that period 28
  • 29. Use Case Scenario: Slow queries (IV) The ‘State’ tab shows a clear increment of the ‘waiting requests’ 29
  • 30. Use Case Scenario: Slow queries (V) A filter on ‘Requests’ Tab shows that a significant number of queries failed or finished with timeout in that period We explore the data sources used by some of the failed queries 30
  • 31. Use Case Scenario: Slow queries (VI) The connection pool of the data source is frequently full during the period -> requests are being queued Suggests pool is too small or more queries than usual 31
  • 32. Use Case Scenario: Slow queries (VII) We continue the analysis and order the requests by client application. We notice the application ’single_customer’ view executed many long-running requests during the period 32
  • 33. Use Case Scenario: Slow queries (VIII) We examine the query plan of some of the queries and notice that they are complex, long-running reports 33
  • 34. Use Case Scenario: Slow queries (and IX) Possible solutions: Use Resource Manager to limit the maximum number of concurrent requests from the “Single Customer View” application, so they do not monopolize the data source Ask the administrators of the “Single Customer View” application to distribute the execution of reports in a longer time period Cache part of the data to distribute the workload Increase the size of the pool if the data source can support the workload 34
  • 36. Workload Management Ensure fair distribution of resources among applications / users Allocate available resources according to business priorities Sessions classified into groups according to criteria such as user/role, application, time,… Sessions are assigned to resource groups, which establish restrictions: Apply always or only under heavy CPU usage Example restrictions: Change execution priorities Max. number of concurrent queries Limit execution time 36
  • 37. ■ Create plan restrictions. Denodo Resource Manager (I)
  • 38. ■ Rules Classify Sessions into Groups (e.g. by user, application,…) ■ E.g. Sessions from application ‘single customer view’ are assigned to plan called ‘limit_concurrent_queries_to_10’ Denodo Resource Manager (II)
  • 39. Thanks! www.denodo.com [email protected] © Copyright Denodo Technologies. All rights reserved Unless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.