SlideShare a Scribd company logo
Axibase Time Series Database
Axibase Time Series Database
2 Prepared by Axibase
Axibase Time-Series Database (ATSD) is a clustered non-relational database for the storage of
various information coming out of the IT infrastructure. ATSD is specifically designed to store and
analyze large amounts of statistical data collected at high frequency.
Database History
3 Prepared by Axibase
• 1970 – IBM introduced relational algebra for data processing.
• Cambrian explosion of relational database management systems:
• 2000 – first large-scale applications emerge, such as Google Search.
• 2004 – Google Big Table – first non-relational database using distributed file system.
• Currently we are experiencing Cambrian explosion of non-relational (a.k.a. NoSQL) databases:
Key Differences Between SQL and NoSQL
4 Prepared by Axibase
SQL NoSQL
High-level Programming Language SQL
Transactions
Query Optimizer
Non-key indexes
Key Differences Between SQL and NoSQL
5 Prepared by Axibase
SQL NoSQL
Scalability TB PB
Maximum Cluster Size 48 (Oracle RAC) 1000+
Distributed
Read Time
Depends on table size and
indexes
Linear
Write Time
Depends on table size and
indexes
Linear
Table Schema (column names,
data types)
Predetermined
Raw bytes. Schema
determined by application
How Proven Is NoSQL Technology
6 Prepared by Axibase
NoSQL is the leading technology behind big data applications.
• Google – search, gmail, AppEngine
• Yahoo/Microsoft – search
• Amazon – e-commerce, search, cloud computing (AWS DynamoDB)
• IBM Big Insights, Microsoft Azure HD Insight
Big Data Adoption
7 Prepared by Axibase
HBase behind Facebook Messages:
• 6+ billion messages per day
• 75+ billion R/W operations per day
• Peak throughput: 1.5 million R/W operations per second
• 2+ petabytes of data (6+ PB including replicas) with data growth of over 8 TB per day
Big Data Adoption
8 Prepared by Axibase
IBM BigInsights behind Vestas:
• A wind energy company in Denmark is reducing the time to analyze petabytes of data from
several weeks to 15 minutes to improve the accuracy of wind turbine placement.
• Stores 2.8 PB of company historical data together with over 178 external parameters:
temperature, barometric pressure, humidity, precipitation, wind direction, wind velocity etc.
• Stores precise data on weather over the past 11 years.
• Collects data from over 35,000 meteorological stations.
Big Data Adoption
9 Prepared by Axibase
HBase behind Explorys:
• Explorys uses HBase to enable search and analysis of patient populations, treatment protocols,
and clinical outcomes.
• Stores over 275 billion clinical, financial and operational data elements.
• 48 million unique patient files.
• Collecting data from over 340 hospitals and 300,000 healthcare providers.
• Pull data from 22 integrated major healthcare systems.
Axibase Time Series Database
10 Prepared by Axibase
Scalability & Speed
• Collects billions of samples per day. Retains detailed data forever.
Features
• Combines database, rule engine, and visualization in one product.
Analytical Rule Engine
• Applies aggregate functions and filters on streaming data.
Integration
• Accepts data from any source based on industry-standard protocols.
Visualization
• Built-in portals with smart widgets.
11 Prepared by Axibase
Big Data for IT Monitoring
12 Prepared by Axibase
• Retain detailed data forever.
• Collect statistics at high-frequency, for example every 15 seconds.
• Consolidate performance statistics from all systems into one database: facilities, network,
storage, servers, applications, databases, transactions, service providers, user activity etc.
• Monitor infrastructure based on abnormal deviations instead of manual thresholds.
• Apply statistical formulas to predict outages.
• Take advantage of schema-less database to collect data from any source.
Big Data for Developers
13 Prepared by Axibase
• Support for annotation-style instrumentation.
• Alternative to byte-code instrumentation and
file logging.
• Collect detailed performance and usage
statistics for reporting and analytics, without
writing custom monitors.
Big Data for Operations
14 Prepared by Axibase
• Gather and analyze statistical data generated by the various systems and sensors.
• Analytics that can support decision control systems.
• Allows for better real‐time operations decision‐support.
• Generate accurate forecasts of upcoming issues:
• Delays
• Scheduled maintenance based on product usage and sensor data instead of warranty
periods
• Improved customer service times and standards.
ATSD Architecture
15 Prepared by Axibase
• ATSD architecture combines database,
analytics and reporting tools into one
complete product.
• Data locality makes analytics run faster.
• Application server layer is simplified to
provide core shared services
ATSD Components
16 Prepared by Axibase
• Pluggable driver provides support for
different storage engines
• Compute, persistence and data
collection layers scaled independently
Fault Tolerance
17 Prepared by Axibase
• ATSD is a distributed system,
with high fault tolerance.
• Each data sample is
automatically replicated 3
times for recovery.
ATSD Scalability
18 Prepared by Axibase
• ATSD is a distributed, non-relational database with high throughput, fault tolerance and reading
speed.
• ATSD can collect billions of metrics per day and store petabytes of data.
• ATSD supports millisecond resolution and sampling intervals of up to several measurements per
second. The data is stored without losing accuracy.
• Additional nodes can be added at runtime to handle increasing volumes. ATSD automatically
distributes the table across active nodes.
• New nodes can be added in remote data centers to minimize network traffic.
Supported Data Types
19 Prepared by Axibase
• Two types of data ingestion: push and pull.
• ATSD supports numeric values, log messages and properties (collection of key-values).
• ATSD uses collectors for retrieving structured and unstructured data from remote sources.
• Support for standard protocols: Telnet, ICMP, CSV/TSV, FILE, JMX, HTTP, and JSON.
Data Collection
20 Prepared by Axibase
• Collection is agentless; data is pushed by external systems into ATSD.
• New metrics are auto-registered. No need to update schema or restart any server components.
• Existing monitoring tools can be instrumented to stream data into ATSD.
• Each data sample can be tagged (key = value) at source for subsequent querying, aggregations,
and roll-ups.
Data Storage
21 Prepared by Axibase
• Built-in data compression provides 70%-80% disk space savings over raw data.
• No data needs to be deleted. Seek time is almost linear regardless of the dataset size.
• Data storage is sparse and efficient. ATSD stores only what is collected instead of long rows with
NULLs or zeros, as is the case in relational model.
• VMware VMFS-attached disks are sufficient for small to medium clusters.
• Direct attached disks with JBOD are recommended for larger clusters.
• JBOD alternatives to minimize node recovery time are available from leading storage vendors,
such as NetApp E-Series.
Built-in Instruments
22 Prepared by Axibase
Unlike conventional data warehouses, ATSD comes with a set of built-in tools for data analysis:
• Analytical Rule Engine
• Forecasting
• Visualization
Analytical Rule Engine
23 Prepared by Axibase
• Evaluates incoming data in memory based on statistical rules.
• Statistical rules are applied to the incoming data stream before data is
stored on disk.
• As data is ingested by ATSD server, a subset of samples that match rule
queries are routed to the rule engine for processing.
• Rule Engine supports both time- and count- based data windows.
• Rule expressions and filters can reference not just numeric values but also
tags such as system type, location, priority to ensure that alerts are raised
only for critical issues.
• Multiple metrics and entities can be correlated within the same rule.
Analytical Rule Engine – Rule Examples
24 Prepared by Axibase
Type Window Example Description
threshold none value > 75 Raise an alert if last metric value exceeds threshold
range none value > 50 AND value <= 75 Raise an alert if value is outside of specified range
statistical-count count(10) avg(value) > 75 Raise an alert if average value of the last 10 samples exceeds threshold
statistical-time time('15 min') avg(value) > 75 Raise an alert if average value for the last 15 minutes exceeds threshold
statistical-deviation time('15 min') avg(value) / avg(value(time: '1 hour')) >
1.25
Raise an alert if 15-minute average exceeds 1-hour average by more than 25%
statistical-ungrouped time('15 min') avg(value) > 75 Raise an alert if 15-minute average values for all entities in the group exceeds threshold
metric correlation time('15 min') avg(value) > 75 AND avg(value(metric:
'loadavg.1m')) > 0.5
Raise an alert if average values for two separate metrics for the last 15 minutes exceed predefined
thresholds
entity correlation time('15 min') avg(value) > 75 AND avg(value(entity:
'host2')) > 75
Raise an alert if average values for two entities for the last 15 minutes exceed thresholds
threshold override time('15 min') avg(value) >= entity.groupTag('cpu
_avg').min()
Raise an alert if 15-minute average value exceeds minimum threshold specified for groups to which
the entity belongs
cpu forecast deviation time('5 min') abs(forecast_deviation(wavg())) > 2 Raise an alert if 5-minute average deviates from forecast by more than two standard deviations
cpu forecast diff time('10 min') abs(wavg() - forecast()) > 25 Raise alert if absolute forecast deviates from average by more than specified value
disk threshold time('15 min') new_maximum() &&
threshold_linear_time(99) < 120
Raise alert if last value is the highest observed and linear threshold is expected to violate the 99%
threshold in less than 120 minutes
Analytical Rule Engine
25 Prepared by Axibase
Analytical Rule Engine
26 Prepared by Axibase
Forecasting
27 Prepared by Axibase
• Customers have a growing need to predict problems before they occur. The accuracy of
predictions and the percentage of false positives/negatives highly depends on the frequency of
data collection, the retention interval, and algorithms.
• The use of built-in autoregressive time-series extrapolation algorithms (Holt-Winters, ARIMA,
etc.) in ATSD allows predicting of system failures at early stages.
• The forecasting process is resource intensive and is most effective in a clustered system with
data locality such as ATSD.
• Dynamic predictions eliminate the need to set manual thresholds.
Forecasting Example
28 Prepared by Axibase
Forecasting Example
29 Prepared by Axibase
Forecast Settings
30 Prepared by Axibase
• ATSD selects the most accurate
forecasting algorithm for each
time-series separately based on a
ranking system.
• The winning algorithm is used to
compute forecast for the next day,
week or month.
• Pre-computed forecasts can be
used in rule engine.
Forecast Settings
31 Prepared by Axibase
Visualization
32 Prepared by Axibase
• ATSD can be integrated with Axibase Enterprise Reporting using the ATSD adapter
• ATSD comes with a wide variety of widgets for creating interactive portals directly in ATSD.
• ATSD widgets are designed from the ground-up to handle large data sets and calculations on the
client.
• ATSD visualization is supported on mobile devices and Smart TVs.
Visualization
33 Prepared by Axibase
Search
34 Prepared by Axibase
• Implemented in ATSD is log file search system to detect problems in distributed systems for the
purposes of security, audit and change control.
Notifications
• Supports standard notification mechanisms: email, console, web service, and notification in the
environment.
• For example, Axibase LED lighting system - the "Data Cube", which changes colors depending on
the status of IT services.
ATSD Benefits
35 Prepared by Axibase
• Enables customers to extract value from data that already exists in their operational and IT
infrastructures.
• Delivers preemptive monitoring through identification of abnormal behaviors in production
systems.
• Eliminates most manually-defined rules from the customer’s monitoring catalog.
• Serves as a centralized repository for historical data.
• Directly supported by AER for Dashboards, Reports, Capacity Planning
System Requirements
36 Prepared by Axibase
• Operating Systems:
• Red Hat Enterprise Linux 5.6+
• Ubuntu 12.04+
• Suse Linux Enterprise Server 10+
• Computing Hardware:
Edition Community - FREE Standard Enterprise
ATSD Nodes 1 1 + 1 > 5
Processors 2 vCPU, 2+ GHz 4 vCPU, 2+ GHz 4 vCPU, 2+ GHz
Memory 4 GB (2GB for JVM) 16 GB (8GB for JVM) 16 GB (8GB for JVM)
Use Cases
37 Prepared by Axibase
• ITM long-term history extension
• nmon reporting for AIX, Linux and Solaris
• Minimize exceptions in monitoring catalog
• Collect environmental data from SCADA
• Predictive Maintenance – based on sensors
ITM History Extension
38 Prepared by Axibase
• ITM can be instrumented to write streaming data into CSV files.
• CSV can be instantly uploaded into ATSD using inotify utility and wget.
• Example: private history streaming in ITM
• KHD_CSV_OUTPUT_ACTIVATE = Y
ITM History Extension
39 Prepared by Axibase
• Warehouse Proxy Agent is setup to save history data to CSV file
on the local machine.
• ATSD ingests the CSV files for analytics and long-term storage.
• ATSD converts the data using built in parsers.
nmon Reporting
40 Prepared by Axibase
• Consolidate trusted statistics from UNIX systems in one database
• ATSD is able to collect, parse and analyze nmon files
• Analyze nmon data with forecasting algorithms
• Capitalize on nmon data with two predefined visualization portals or easily create your own
portals using built-in HTML5 widgets
nmon Predefined Portals
41 Prepared by Axibase
42 Prepared by Axibase
Predefined AIX Portal
43 Prepared by Axibase
Predefined Linux Portal
44 Prepared by Axibase
Contact Axibase
Axibase Contact Details:
• General - 408.973.7897
• Fax - 408.725.8885
• Email - sales@axibase.com
Our headquarters are located in Cupertino, Silicon Valley:
• 19925 Stevens Creek Blvd. Cupertino, CA 95014 USA

More Related Content

PDF
Enhanced Data Visualization provided for 200,000 Machines with OpenTSDB and C...
YASH Technologies
 
PDF
Enhanced Data Visualization provided for 200,000 Machines with OpenTSDB and ...
YASH Technologies
 
PPTX
Data Modeling IoT and Time Series data in NoSQL
Basho Technologies
 
PPTX
Using druid for interactive count distinct queries at scale
Itai Yaffe
 
PPTX
A Zen Journey to Database Management
Basho Technologies
 
PDF
Druid
Dori Waldman
 
PPTX
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
DataWorks Summit
 
PPTX
The key to unlocking the Value in the IoT? Managing the Data!
DataWorks Summit/Hadoop Summit
 
Enhanced Data Visualization provided for 200,000 Machines with OpenTSDB and C...
YASH Technologies
 
Enhanced Data Visualization provided for 200,000 Machines with OpenTSDB and ...
YASH Technologies
 
Data Modeling IoT and Time Series data in NoSQL
Basho Technologies
 
Using druid for interactive count distinct queries at scale
Itai Yaffe
 
A Zen Journey to Database Management
Basho Technologies
 
Transforming and Scaling Large Scale Data Analytics: Moving to a Cloud-based ...
DataWorks Summit
 
The key to unlocking the Value in the IoT? Managing the Data!
DataWorks Summit/Hadoop Summit
 

What's hot (20)

PDF
A Study Review of Common Big Data Architecture for Small-Medium Enterprise
Ridwan Fadjar
 
PDF
codecentric AG: CQRS and Event Sourcing Applications with Cassandra
DataStax Academy
 
PPTX
Building Scalable Aggregation Systems
Jared Winick
 
PDF
What's new in MariaDB TX 3.0
MariaDB plc
 
PPSX
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
Dataconomy Media
 
PPTX
Future Grid Overview 2018
Chris J Law
 
PPSX
The Analytics Data Store: Information Supply Framework
Martyn Richard Jones
 
PDF
Data lake analytics for the admin
Tillmann Eitelberg
 
PDF
Proofpoint: Fraud Detection and Security on Social Media
DataStax Academy
 
PDF
Bigdata Hadoop project payment gateway domain
Kamal A
 
PDF
Can My Inventory Survive Eventual Consistency?
DataStax
 
PDF
Lecture4 big data technology foundations
hktripathy
 
PPTX
Introducing Cloudian HyperStore 6.0
Cloudian
 
PPTX
Breakout: Hadoop and the Operational Data Store
Cloudera, Inc.
 
PDF
Power Your Delta Lake with Streaming Transactional Changes
Databricks
 
PDF
Cloudian HyperStore Features and Benefits
Cloudian
 
PPTX
Microsoft R - ScaleR Overview
Khalid Salama
 
PPTX
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
PROIDEA
 
PPTX
Informix - The Ideal Database for IoT
Pradeep Natarajan
 
A Study Review of Common Big Data Architecture for Small-Medium Enterprise
Ridwan Fadjar
 
codecentric AG: CQRS and Event Sourcing Applications with Cassandra
DataStax Academy
 
Building Scalable Aggregation Systems
Jared Winick
 
What's new in MariaDB TX 3.0
MariaDB plc
 
"Quantum Clustering - Physics Inspired Clustering Algorithm", Sigalit Bechler...
Dataconomy Media
 
Future Grid Overview 2018
Chris J Law
 
The Analytics Data Store: Information Supply Framework
Martyn Richard Jones
 
Data lake analytics for the admin
Tillmann Eitelberg
 
Proofpoint: Fraud Detection and Security on Social Media
DataStax Academy
 
Bigdata Hadoop project payment gateway domain
Kamal A
 
Can My Inventory Survive Eventual Consistency?
DataStax
 
Lecture4 big data technology foundations
hktripathy
 
Introducing Cloudian HyperStore 6.0
Cloudian
 
Breakout: Hadoop and the Operational Data Store
Cloudera, Inc.
 
Power Your Delta Lake with Streaming Transactional Changes
Databricks
 
Cloudian HyperStore Features and Benefits
Cloudian
 
Microsoft R - ScaleR Overview
Khalid Salama
 
4Developers 2018: Przetwarzanie Big Data w oparciu o architekturę Lambda na p...
PROIDEA
 
Informix - The Ideal Database for IoT
Pradeep Natarajan
 
Ad

Viewers also liked (6)

PPTX
Building a citizen sensor network in windows azure
Richard Conway
 
PPS
SQL & NoSQL
Ahmad Awsaf-uz-zaman
 
PPTX
Joining Large data at Scale
Sigmoid
 
PDF
Time series database by Harshil Ambagade
Sigmoid
 
PDF
Visualization of big time series data
Rob Hyndman
 
PDF
Chronix Time Series Database - The New Time Series Kid on the Block
QAware GmbH
 
Building a citizen sensor network in windows azure
Richard Conway
 
Joining Large data at Scale
Sigmoid
 
Time series database by Harshil Ambagade
Sigmoid
 
Visualization of big time series data
Rob Hyndman
 
Chronix Time Series Database - The New Time Series Kid on the Block
QAware GmbH
 
Ad

Similar to Axibase Time Series Database (20)

PPTX
Microsoft Azure Big Data Analytics
Mark Kromer
 
PPTX
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
Maya Lumbroso
 
PPTX
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
Dataconomy Media
 
PDF
Case study on redundant enterprise energy management system - AVReporter
Avreporter KONsys
 
PPTX
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Dataconomy Media
 
PPTX
Big Data Berlin v8.0 Stream Processing with Apache Apex
Apache Apex
 
PPTX
How to get Real-Time Value from your IoT Data - Datastax
DataStax
 
PDF
[db tech showcase Tokyo 2017] C37: MariaDB ColumnStore analytics engine : use...
Insight Technology, Inc.
 
PPT
SmartCloud Monitoring and Capacity Planning
IBM Danmark
 
PDF
Streaming Analytics Unit 1 notes for engineers
ManjuAppukuttan2
 
PDF
Re-Engineering PostgreSQL as a Time-Series Database
All Things Open
 
PDF
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
Edgar Alejandro Villegas
 
PDF
AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015
Amazon Web Services Korea
 
PDF
Logging, Metrics, and APM: The Operations Trifecta
Elasticsearch
 
PDF
Infrastructure monitoring made easy, from ingest to insight
Elasticsearch
 
PPTX
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Mike Rossi
 
PPTX
Sizing MongoDB Clusters
MongoDB
 
PDF
Using the Open Science Data Cloud for Data Science Research
Robert Grossman
 
PDF
Why Gateways are Important in Your IoT Architecture
IBM Analytics
 
PPTX
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
Lightbend
 
Microsoft Azure Big Data Analytics
Mark Kromer
 
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
Maya Lumbroso
 
"An introduction to Kx Technology - a Big Data solution", Kyra Coyne, Data Sc...
Dataconomy Media
 
Case study on redundant enterprise energy management system - AVReporter
Avreporter KONsys
 
Thomas Weise, Apache Apex PMC Member and Architect/Co-Founder, DataTorrent - ...
Dataconomy Media
 
Big Data Berlin v8.0 Stream Processing with Apache Apex
Apache Apex
 
How to get Real-Time Value from your IoT Data - Datastax
DataStax
 
[db tech showcase Tokyo 2017] C37: MariaDB ColumnStore analytics engine : use...
Insight Technology, Inc.
 
SmartCloud Monitoring and Capacity Planning
IBM Danmark
 
Streaming Analytics Unit 1 notes for engineers
ManjuAppukuttan2
 
Re-Engineering PostgreSQL as a Time-Series Database
All Things Open
 
Best Practices – Extreme Performance with Data Warehousing on Oracle Databa...
Edgar Alejandro Villegas
 
AWS를 활용한 첫 빅데이터 프로젝트 시작하기(김일호)- AWS 웨비나 시리즈 2015
Amazon Web Services Korea
 
Logging, Metrics, and APM: The Operations Trifecta
Elasticsearch
 
Infrastructure monitoring made easy, from ingest to insight
Elasticsearch
 
Supercharging Smart Meter BIG DATA Analytics with Microsoft Azure Cloud- SRP ...
Mike Rossi
 
Sizing MongoDB Clusters
MongoDB
 
Using the Open Science Data Cloud for Data Science Research
Robert Grossman
 
Why Gateways are Important in Your IoT Architecture
IBM Analytics
 
Lessons From HPE: From Batch To Streaming For 20 Billion Sensors With Lightbe...
Lightbend
 

Recently uploaded (20)

PDF
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPTX
INFO8116 -Big data architecture and analytics
guddipatel10
 
PPTX
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Presentation on animal welfare a good topic
kidscream385
 
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Data Security Breach: Immediate Action Plan
varmabhuvan266
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
INFO8116 -Big data architecture and analytics
guddipatel10
 
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
Data-Driven Machine Learning for Rail Infrastructure Health Monitoring
Sione Palu
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Presentation on animal welfare a good topic
kidscream385
 

Axibase Time Series Database

  • 2. Axibase Time Series Database 2 Prepared by Axibase Axibase Time-Series Database (ATSD) is a clustered non-relational database for the storage of various information coming out of the IT infrastructure. ATSD is specifically designed to store and analyze large amounts of statistical data collected at high frequency.
  • 3. Database History 3 Prepared by Axibase • 1970 – IBM introduced relational algebra for data processing. • Cambrian explosion of relational database management systems: • 2000 – first large-scale applications emerge, such as Google Search. • 2004 – Google Big Table – first non-relational database using distributed file system. • Currently we are experiencing Cambrian explosion of non-relational (a.k.a. NoSQL) databases:
  • 4. Key Differences Between SQL and NoSQL 4 Prepared by Axibase SQL NoSQL High-level Programming Language SQL Transactions Query Optimizer Non-key indexes
  • 5. Key Differences Between SQL and NoSQL 5 Prepared by Axibase SQL NoSQL Scalability TB PB Maximum Cluster Size 48 (Oracle RAC) 1000+ Distributed Read Time Depends on table size and indexes Linear Write Time Depends on table size and indexes Linear Table Schema (column names, data types) Predetermined Raw bytes. Schema determined by application
  • 6. How Proven Is NoSQL Technology 6 Prepared by Axibase NoSQL is the leading technology behind big data applications. • Google – search, gmail, AppEngine • Yahoo/Microsoft – search • Amazon – e-commerce, search, cloud computing (AWS DynamoDB) • IBM Big Insights, Microsoft Azure HD Insight
  • 7. Big Data Adoption 7 Prepared by Axibase HBase behind Facebook Messages: • 6+ billion messages per day • 75+ billion R/W operations per day • Peak throughput: 1.5 million R/W operations per second • 2+ petabytes of data (6+ PB including replicas) with data growth of over 8 TB per day
  • 8. Big Data Adoption 8 Prepared by Axibase IBM BigInsights behind Vestas: • A wind energy company in Denmark is reducing the time to analyze petabytes of data from several weeks to 15 minutes to improve the accuracy of wind turbine placement. • Stores 2.8 PB of company historical data together with over 178 external parameters: temperature, barometric pressure, humidity, precipitation, wind direction, wind velocity etc. • Stores precise data on weather over the past 11 years. • Collects data from over 35,000 meteorological stations.
  • 9. Big Data Adoption 9 Prepared by Axibase HBase behind Explorys: • Explorys uses HBase to enable search and analysis of patient populations, treatment protocols, and clinical outcomes. • Stores over 275 billion clinical, financial and operational data elements. • 48 million unique patient files. • Collecting data from over 340 hospitals and 300,000 healthcare providers. • Pull data from 22 integrated major healthcare systems.
  • 10. Axibase Time Series Database 10 Prepared by Axibase Scalability & Speed • Collects billions of samples per day. Retains detailed data forever. Features • Combines database, rule engine, and visualization in one product. Analytical Rule Engine • Applies aggregate functions and filters on streaming data. Integration • Accepts data from any source based on industry-standard protocols. Visualization • Built-in portals with smart widgets.
  • 11. 11 Prepared by Axibase
  • 12. Big Data for IT Monitoring 12 Prepared by Axibase • Retain detailed data forever. • Collect statistics at high-frequency, for example every 15 seconds. • Consolidate performance statistics from all systems into one database: facilities, network, storage, servers, applications, databases, transactions, service providers, user activity etc. • Monitor infrastructure based on abnormal deviations instead of manual thresholds. • Apply statistical formulas to predict outages. • Take advantage of schema-less database to collect data from any source.
  • 13. Big Data for Developers 13 Prepared by Axibase • Support for annotation-style instrumentation. • Alternative to byte-code instrumentation and file logging. • Collect detailed performance and usage statistics for reporting and analytics, without writing custom monitors.
  • 14. Big Data for Operations 14 Prepared by Axibase • Gather and analyze statistical data generated by the various systems and sensors. • Analytics that can support decision control systems. • Allows for better real‐time operations decision‐support. • Generate accurate forecasts of upcoming issues: • Delays • Scheduled maintenance based on product usage and sensor data instead of warranty periods • Improved customer service times and standards.
  • 15. ATSD Architecture 15 Prepared by Axibase • ATSD architecture combines database, analytics and reporting tools into one complete product. • Data locality makes analytics run faster. • Application server layer is simplified to provide core shared services
  • 16. ATSD Components 16 Prepared by Axibase • Pluggable driver provides support for different storage engines • Compute, persistence and data collection layers scaled independently
  • 17. Fault Tolerance 17 Prepared by Axibase • ATSD is a distributed system, with high fault tolerance. • Each data sample is automatically replicated 3 times for recovery.
  • 18. ATSD Scalability 18 Prepared by Axibase • ATSD is a distributed, non-relational database with high throughput, fault tolerance and reading speed. • ATSD can collect billions of metrics per day and store petabytes of data. • ATSD supports millisecond resolution and sampling intervals of up to several measurements per second. The data is stored without losing accuracy. • Additional nodes can be added at runtime to handle increasing volumes. ATSD automatically distributes the table across active nodes. • New nodes can be added in remote data centers to minimize network traffic.
  • 19. Supported Data Types 19 Prepared by Axibase • Two types of data ingestion: push and pull. • ATSD supports numeric values, log messages and properties (collection of key-values). • ATSD uses collectors for retrieving structured and unstructured data from remote sources. • Support for standard protocols: Telnet, ICMP, CSV/TSV, FILE, JMX, HTTP, and JSON.
  • 20. Data Collection 20 Prepared by Axibase • Collection is agentless; data is pushed by external systems into ATSD. • New metrics are auto-registered. No need to update schema or restart any server components. • Existing monitoring tools can be instrumented to stream data into ATSD. • Each data sample can be tagged (key = value) at source for subsequent querying, aggregations, and roll-ups.
  • 21. Data Storage 21 Prepared by Axibase • Built-in data compression provides 70%-80% disk space savings over raw data. • No data needs to be deleted. Seek time is almost linear regardless of the dataset size. • Data storage is sparse and efficient. ATSD stores only what is collected instead of long rows with NULLs or zeros, as is the case in relational model. • VMware VMFS-attached disks are sufficient for small to medium clusters. • Direct attached disks with JBOD are recommended for larger clusters. • JBOD alternatives to minimize node recovery time are available from leading storage vendors, such as NetApp E-Series.
  • 22. Built-in Instruments 22 Prepared by Axibase Unlike conventional data warehouses, ATSD comes with a set of built-in tools for data analysis: • Analytical Rule Engine • Forecasting • Visualization
  • 23. Analytical Rule Engine 23 Prepared by Axibase • Evaluates incoming data in memory based on statistical rules. • Statistical rules are applied to the incoming data stream before data is stored on disk. • As data is ingested by ATSD server, a subset of samples that match rule queries are routed to the rule engine for processing. • Rule Engine supports both time- and count- based data windows. • Rule expressions and filters can reference not just numeric values but also tags such as system type, location, priority to ensure that alerts are raised only for critical issues. • Multiple metrics and entities can be correlated within the same rule.
  • 24. Analytical Rule Engine – Rule Examples 24 Prepared by Axibase Type Window Example Description threshold none value > 75 Raise an alert if last metric value exceeds threshold range none value > 50 AND value <= 75 Raise an alert if value is outside of specified range statistical-count count(10) avg(value) > 75 Raise an alert if average value of the last 10 samples exceeds threshold statistical-time time('15 min') avg(value) > 75 Raise an alert if average value for the last 15 minutes exceeds threshold statistical-deviation time('15 min') avg(value) / avg(value(time: '1 hour')) > 1.25 Raise an alert if 15-minute average exceeds 1-hour average by more than 25% statistical-ungrouped time('15 min') avg(value) > 75 Raise an alert if 15-minute average values for all entities in the group exceeds threshold metric correlation time('15 min') avg(value) > 75 AND avg(value(metric: 'loadavg.1m')) > 0.5 Raise an alert if average values for two separate metrics for the last 15 minutes exceed predefined thresholds entity correlation time('15 min') avg(value) > 75 AND avg(value(entity: 'host2')) > 75 Raise an alert if average values for two entities for the last 15 minutes exceed thresholds threshold override time('15 min') avg(value) >= entity.groupTag('cpu _avg').min() Raise an alert if 15-minute average value exceeds minimum threshold specified for groups to which the entity belongs cpu forecast deviation time('5 min') abs(forecast_deviation(wavg())) > 2 Raise an alert if 5-minute average deviates from forecast by more than two standard deviations cpu forecast diff time('10 min') abs(wavg() - forecast()) > 25 Raise alert if absolute forecast deviates from average by more than specified value disk threshold time('15 min') new_maximum() && threshold_linear_time(99) < 120 Raise alert if last value is the highest observed and linear threshold is expected to violate the 99% threshold in less than 120 minutes
  • 25. Analytical Rule Engine 25 Prepared by Axibase
  • 26. Analytical Rule Engine 26 Prepared by Axibase
  • 27. Forecasting 27 Prepared by Axibase • Customers have a growing need to predict problems before they occur. The accuracy of predictions and the percentage of false positives/negatives highly depends on the frequency of data collection, the retention interval, and algorithms. • The use of built-in autoregressive time-series extrapolation algorithms (Holt-Winters, ARIMA, etc.) in ATSD allows predicting of system failures at early stages. • The forecasting process is resource intensive and is most effective in a clustered system with data locality such as ATSD. • Dynamic predictions eliminate the need to set manual thresholds.
  • 30. Forecast Settings 30 Prepared by Axibase • ATSD selects the most accurate forecasting algorithm for each time-series separately based on a ranking system. • The winning algorithm is used to compute forecast for the next day, week or month. • Pre-computed forecasts can be used in rule engine.
  • 32. Visualization 32 Prepared by Axibase • ATSD can be integrated with Axibase Enterprise Reporting using the ATSD adapter • ATSD comes with a wide variety of widgets for creating interactive portals directly in ATSD. • ATSD widgets are designed from the ground-up to handle large data sets and calculations on the client. • ATSD visualization is supported on mobile devices and Smart TVs.
  • 34. Search 34 Prepared by Axibase • Implemented in ATSD is log file search system to detect problems in distributed systems for the purposes of security, audit and change control. Notifications • Supports standard notification mechanisms: email, console, web service, and notification in the environment. • For example, Axibase LED lighting system - the "Data Cube", which changes colors depending on the status of IT services.
  • 35. ATSD Benefits 35 Prepared by Axibase • Enables customers to extract value from data that already exists in their operational and IT infrastructures. • Delivers preemptive monitoring through identification of abnormal behaviors in production systems. • Eliminates most manually-defined rules from the customer’s monitoring catalog. • Serves as a centralized repository for historical data. • Directly supported by AER for Dashboards, Reports, Capacity Planning
  • 36. System Requirements 36 Prepared by Axibase • Operating Systems: • Red Hat Enterprise Linux 5.6+ • Ubuntu 12.04+ • Suse Linux Enterprise Server 10+ • Computing Hardware: Edition Community - FREE Standard Enterprise ATSD Nodes 1 1 + 1 > 5 Processors 2 vCPU, 2+ GHz 4 vCPU, 2+ GHz 4 vCPU, 2+ GHz Memory 4 GB (2GB for JVM) 16 GB (8GB for JVM) 16 GB (8GB for JVM)
  • 37. Use Cases 37 Prepared by Axibase • ITM long-term history extension • nmon reporting for AIX, Linux and Solaris • Minimize exceptions in monitoring catalog • Collect environmental data from SCADA • Predictive Maintenance – based on sensors
  • 38. ITM History Extension 38 Prepared by Axibase • ITM can be instrumented to write streaming data into CSV files. • CSV can be instantly uploaded into ATSD using inotify utility and wget. • Example: private history streaming in ITM • KHD_CSV_OUTPUT_ACTIVATE = Y
  • 39. ITM History Extension 39 Prepared by Axibase • Warehouse Proxy Agent is setup to save history data to CSV file on the local machine. • ATSD ingests the CSV files for analytics and long-term storage. • ATSD converts the data using built in parsers.
  • 40. nmon Reporting 40 Prepared by Axibase • Consolidate trusted statistics from UNIX systems in one database • ATSD is able to collect, parse and analyze nmon files • Analyze nmon data with forecasting algorithms • Capitalize on nmon data with two predefined visualization portals or easily create your own portals using built-in HTML5 widgets
  • 41. nmon Predefined Portals 41 Prepared by Axibase
  • 42. 42 Prepared by Axibase Predefined AIX Portal
  • 43. 43 Prepared by Axibase Predefined Linux Portal
  • 44. 44 Prepared by Axibase Contact Axibase Axibase Contact Details: • General - 408.973.7897 • Fax - 408.725.8885 • Email - [email protected] Our headquarters are located in Cupertino, Silicon Valley: • 19925 Stevens Creek Blvd. Cupertino, CA 95014 USA