Dean Sheehan
Snr Director Sales Engineering
InfluxEnterprise
Architectural Patterns
What we will be
covering
✓ Enterprise Overview
✓ Other Features
✓ Ingestion & Query Rates
✓ Deployment Examples
✓ Replications Patterns
✓ General Advice
Why InfluxEnterprise?
Signs You’re Ready for InfluxEnterprise
6. Your CPU average is >=70%
1. The sales team starts calling you on weekends
5. Increasing throughput causing write drops errors
4. Sprawling number of single node deployments
3. Vertical scaling not providing further benefit
2. Data recording and availability matters
InfluxEnterprise
• Open Source Core
• High Availability
• Horizonal Scalability
• Enterprise Security
• Support from InfluxData
• OnPremise/Cloud Deployment Options
What Problem Are You
Trying to Solve?
What are you dealing with?
• Metrics
• Events
• Log Data
• Sensors
• Apps
• Servers
• Long-Term Storage
• Vendor Replacement
• Time-Series Alerts
• Visualization
• Network Data
• Custom Solution
• Real-Time Analytics
• Virtualization Monitoring
• Managed Service (InfluxCloud)
InfluxEnterprise Overview
InfluxEnterprise Cluster Architecture: Meta Nodes
InfluxEnterprise Clustering: Data Nodes
InfluxEnterprise Cluster Architecture
InfluxEnterprise Cluster Architecture
Features
Security
• LDAP Support
– Enterprise customers can configure the database to use LDAP as a backing
authentication source for users, roles and permissions.
– Connection between DB and LDAP server secured once connected
• Fine-grained authorization
– Used to control access at a measurement or series level
(compared to limiting access at the database level)
– Enable authentication in your configuration file
– Create users through the query API
– Grant users explicit read and/or write privileges
– Set restrictions which define a combination of database, measurement, and
tags which cannot be accessed without an explicit grant
© 2018 InfluxData. All rights reserved.© 2017 InfluxData. All rights reserved.
Eventual Consistency
• Anti-Entropy Service
– Expands on capabilities to
detect and copy full shards
– Now allows for detection and
repair of inconsistent shards
• Hinted-Handoff Queue
– Queue inbound points
destined to land on other
nodes in the cluster which
may currently be down
– Stored by node and shard
(10GB - default)
Backup and Restore
• Useful for: Disaster recovery, Debugging, Restoring clusters to a
consistent state
• What it does: Creates a copy of the metastore and the shard data
• Backup is compressed and is not human readable
• Export is not compressed but is human readable
• OSS and InfluxEnterprise ARE NOW compatible – aka portable
• Full or partial backup options
• Move data into a new database (with new Retention Policies, etc)
Ingestion & Query Rates
Cluster
Data Node 1 Data Node 2 Data Node 3 Data Node 4
Database X (Replication=1)
Shard 1 Shard 2 Shard 3 Shard 4
a b c d
X ≈ 4x ingest rate
≤ 1x concurrent query rate
a b c d
Cluster
Data Node 1 Data Node 2 Data Node 3 Data Node 4
Database X (Replication=4)
Shard 1 Shard 2 Shard 3 Shard 4
a a’ a’’ a’’’
X ≈ 1x ingest rate
replication
≈ 4x concurrent query rate
a
b
b b’ b’’ b’’’
Cluster
Data Node 1 Data Node 2 Data Node 3 Data Node 4
Database X (Replication=2)
Shard 1 Shard 2 Shard 3 Shard 4
a a’ b b’
X ≈ 2x ingest rate
replication replication
≤ 2x concurrent query rate
a b
Deployment Examples
How does
InfluxEnterprise Fit?
Example 1: Mothership
Data Center 1
Kapacitor
Telegraf InfluxDB
Ent
Enterprise Cluster
Data Node 1
Data Node 2
Data Node 3
Data Node n
Firewall/
LoadBalancer
Telegraf
Telegraf
Chronograf
Chronograf Kapacitor
Data Center 2
Kapacitor
Telegraf InfluxDB
EntTelegraf
Telegraf
Chronograf
Example 2: Durable Data Ingest
Telegraf Cluster
Telegraf
or other
source
Kafka
Queue
LoadBalancer
InfluxDB
Cluster
Telegraf
or other
source
Telegraf
or other
source
Telegraf
or other
sources
Telegraf
Telegraf
Telegraf
Telegraf
Put each Telegraf instance in
the same Kafka Consumer
Group
How Fast is Fast?
(eg): Six datanodes at 2.5M values per
second
Example 3: Influx with ElasticSearch
InfluxDB
Cluster
• Discover trends before and during the Error from metrics
• Perform Root Cause Analysis from Logs
LoadBalancer
Telegraf
ElasticSearch
Include common
Session ID
or other UID
Kapacitor
You
Metrics
Logs
Query using the common
Session ID or UID received
form Alert
Replication Patterns
How Are You Using InfluxDB?
Data Replication
Generally there are two types of data that we care about replicating:
• New Data – Data which is coming form our raw sources
• Derived Data – The output of a SELECT INTO, or TICK script
Replication of New Data – Pattern 1
Cluster 1
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Cluster 2
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Replication of New Data – Pattern 2
Cluster 1
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Cluster 2
Firewall/
Load Balancer
Data Node 1
Data Node 2
Data Node 3
Data Node n
Telegraf
Kafka Queue
Replication of Derived Data – Pattern 3
Cluster 1
Load Balancer
Cluster 2
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Kapacitor
Load Balancer
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Data Nodes
Kapacitor
Uses output of
Kapacitor to other
cluster
Telegraf Telegraf
General Advice
General Cluster Advice
• Batch your writes!
• The number of data nodes should be a multiple of your replication
factor
• Use a single node of InfluxDB to monitor your cluster
• Put a load balancer in front of each of your data nodes
• Higher replication factors result in higher query concurrency, but
higher write latency.
• Use Fine Grained Authorization instead of multiple databases
Thank You!
InfluxEnterprise Architectural Patterns by Dean Sheehan, Senior Director, Pre & Post Sales | InfluxData

More Related Content

PDF
How to Enable Industrial Decarbonization with Node-RED and InfluxDB
PPTX
InfluxEnterprise Architecture Patterns by Tim Hall & Sam Dillard
PDF
From a Time-Series Database to a Key Operational Technology for the Enterprise
PDF
Spacecrafts Made Simple: How Loft Orbital Delivers Unparalleled Speed-to-Spac...
PPTX
Bridging the gap of Relational to Hadoop using Sqoop @ Expedia
PPTX
In Flux Limiting for a multi-tenant logging service
PDF
Accelerate Analytics and ML in the Hybrid Cloud Era
PDF
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
How to Enable Industrial Decarbonization with Node-RED and InfluxDB
InfluxEnterprise Architecture Patterns by Tim Hall & Sam Dillard
From a Time-Series Database to a Key Operational Technology for the Enterprise
Spacecrafts Made Simple: How Loft Orbital Delivers Unparalleled Speed-to-Spac...
Bridging the gap of Relational to Hadoop using Sqoop @ Expedia
In Flux Limiting for a multi-tenant logging service
Accelerate Analytics and ML in the Hybrid Cloud Era
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...

What's hot (20)

PDF
How to Develop and Operate Cloud First Data Platforms
PDF
High Performance Data Lake with Apache Hudi and Alluxio at T3Go
PDF
InfluxDB 2.0: Dashboarding 101 by David G. Simmons
PDF
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
PDF
WRITING QUERIES (INFLUXQL AND TICK)
PDF
Lessons Learned: Running InfluxDB Cloud and Other Cloud Services at Scale | T...
PDF
Apache Iceberg - A Table Format for Hige Analytic Datasets
PDF
Hoodie: How (And Why) We built an analytical datastore on Spark
PDF
Capital One: Using Cassandra In Building A Reporting Platform
PDF
Setting up InfluxData for IoT
PDF
It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify
PDF
The Future of Computing is Distributed
PPTX
Zabbix at scale with Elasticsearch
PDF
Kafka for begginer
PDF
How a Time Series Database Contributes to a Decentralized Cloud Object Storag...
PDF
Stsg17 speaker yousunjeong
PPTX
HBaseConAsia2018 Track3-2: HBase at China Telecom
PDF
Building a high-performance data lake analytics engine at Alibaba Cloud with ...
PPTX
High cardinality time series search: A new level of scale - Data Day Texas 2016
PDF
Alluxio Use Cases and Future Directions
How to Develop and Operate Cloud First Data Platforms
High Performance Data Lake with Apache Hudi and Alluxio at T3Go
InfluxDB 2.0: Dashboarding 101 by David G. Simmons
Cassandra on Google Cloud Platform (Ravi Madasu, Google / Ben Lackey, DataSta...
WRITING QUERIES (INFLUXQL AND TICK)
Lessons Learned: Running InfluxDB Cloud and Other Cloud Services at Scale | T...
Apache Iceberg - A Table Format for Hige Analytic Datasets
Hoodie: How (And Why) We built an analytical datastore on Spark
Capital One: Using Cassandra In Building A Reporting Platform
Setting up InfluxData for IoT
It's Time To Stop Using Lambda Architecture | Yaroslav Tkachenko, Shopify
The Future of Computing is Distributed
Zabbix at scale with Elasticsearch
Kafka for begginer
How a Time Series Database Contributes to a Decentralized Cloud Object Storag...
Stsg17 speaker yousunjeong
HBaseConAsia2018 Track3-2: HBase at China Telecom
Building a high-performance data lake analytics engine at Alibaba Cloud with ...
High cardinality time series search: A new level of scale - Data Day Texas 2016
Alluxio Use Cases and Future Directions
Ad

Similar to InfluxEnterprise Architectural Patterns by Dean Sheehan, Senior Director, Pre & Post Sales | InfluxData (20)

PPTX
Taking Splunk to the Next Level - Architecture Breakout Session
PPTX
Paul Dix [InfluxData] | InfluxDays Opening Keynote | InfluxDays Virtual Exper...
PDF
20150704 benchmark and user experience in sahara weiting
PDF
Data Streaming For Big Data
PPTX
Data Architectures for Robust Decision Making
PPT
MYSQL
PDF
WarsawITDays_ ApacheNiFi202
PPTX
BigData Developers MeetUp
PDF
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
PDF
Building Big Data Streaming Architectures
PDF
Orion NTA Customer Training
PDF
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
PPTX
Benchmarking Solr Performance at Scale
PDF
Data Grids with Oracle Coherence
PDF
SnappyData at Spark Summit 2017
PPTX
SnappyData, the Spark Database. A unified cluster for streaming, transactions...
PPTX
Taking Splunk to the Next Level – Architecture
PPTX
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
PDF
What's New in Apache Hive
PPTX
Always On: Building Highly Available Applications on Cassandra
Taking Splunk to the Next Level - Architecture Breakout Session
Paul Dix [InfluxData] | InfluxDays Opening Keynote | InfluxDays Virtual Exper...
20150704 benchmark and user experience in sahara weiting
Data Streaming For Big Data
Data Architectures for Robust Decision Making
MYSQL
WarsawITDays_ ApacheNiFi202
BigData Developers MeetUp
Streaming Analytics with Spark, Kafka, Cassandra and Akka by Helena Edelson
Building Big Data Streaming Architectures
Orion NTA Customer Training
Managing your Black Friday Logs - Antonio Bonuccelli - Codemotion Rome 2018
Benchmarking Solr Performance at Scale
Data Grids with Oracle Coherence
SnappyData at Spark Summit 2017
SnappyData, the Spark Database. A unified cluster for streaming, transactions...
Taking Splunk to the Next Level – Architecture
Calum McCrea, Software Engineer at Kx Systems, "Kx: How Wall Street Tech can ...
What's New in Apache Hive
Always On: Building Highly Available Applications on Cassandra
Ad

More from InfluxData (20)

PPTX
Announcing InfluxDB Clustered
PDF
Best Practices for Leveraging the Apache Arrow Ecosystem
PDF
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
PDF
Power Your Predictive Analytics with InfluxDB
PDF
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
PDF
Build an Edge-to-Cloud Solution with the MING Stack
PDF
Meet the Founders: An Open Discussion About Rewriting Using Rust
PDF
Introducing InfluxDB Cloud Dedicated
PDF
Gain Better Observability with OpenTelemetry and InfluxDB
PPTX
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
PDF
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
PPTX
Introducing InfluxDB’s New Time Series Database Storage Engine
PDF
Start Automating InfluxDB Deployments at the Edge with balena
PDF
Understanding InfluxDB’s New Storage Engine
PDF
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
PPTX
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
PDF
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
PDF
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
PDF
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
PDF
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022
Announcing InfluxDB Clustered
Best Practices for Leveraging the Apache Arrow Ecosystem
How Bevi Uses InfluxDB and Grafana to Improve Predictive Maintenance and Redu...
Power Your Predictive Analytics with InfluxDB
How Teréga Replaces Legacy Data Historians with InfluxDB, AWS and IO-Base
Build an Edge-to-Cloud Solution with the MING Stack
Meet the Founders: An Open Discussion About Rewriting Using Rust
Introducing InfluxDB Cloud Dedicated
Gain Better Observability with OpenTelemetry and InfluxDB
How a Heat Treating Plant Ensures Tight Process Control and Exceptional Quali...
How Delft University's Engineering Students Make Their EV Formula-Style Race ...
Introducing InfluxDB’s New Time Series Database Storage Engine
Start Automating InfluxDB Deployments at the Edge with balena
Understanding InfluxDB’s New Storage Engine
Streamline and Scale Out Data Pipelines with Kubernetes, Telegraf, and InfluxDB
Ward Bowman [PTC] | ThingWorx Long-Term Data Storage with InfluxDB | InfluxDa...
Scott Anderson [InfluxData] | New & Upcoming Flux Features | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Closing Thoughts | InfluxDays 2022
Steinkamp, Clifford [InfluxData] | Welcome to InfluxDays 2022 - Day 2 | Influ...
Steinkamp, Clifford [InfluxData] | Closing Thoughts Day 1 | InfluxDays 2022

Recently uploaded (20)

PDF
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
PDF
giants, standing on the shoulders of - by Daniel Stenberg
PPTX
Module 1 Introduction to Web Programming .pptx
PPTX
Microsoft User Copilot Training Slide Deck
PDF
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
PDF
4 layer Arch & Reference Arch of IoT.pdf
PDF
LMS bot: enhanced learning management systems for improved student learning e...
PDF
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
PPTX
Internet of Everything -Basic concepts details
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
A symptom-driven medical diagnosis support model based on machine learning te...
PDF
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
PPTX
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
PDF
Electrocardiogram sequences data analytics and classification using unsupervi...
PPTX
Training Program for knowledge in solar cell and solar industry
PDF
A hybrid framework for wild animal classification using fine-tuned DenseNet12...
PDF
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf
PDF
Auditboard EB SOX Playbook 2023 edition.
PDF
Early detection and classification of bone marrow changes in lumbar vertebrae...
PDF
Advancing precision in air quality forecasting through machine learning integ...
AI.gov: A Trojan Horse in the Age of Artificial Intelligence
giants, standing on the shoulders of - by Daniel Stenberg
Module 1 Introduction to Web Programming .pptx
Microsoft User Copilot Training Slide Deck
IT-ITes Industry bjjbnkmkhkhknbmhkhmjhjkhj
4 layer Arch & Reference Arch of IoT.pdf
LMS bot: enhanced learning management systems for improved student learning e...
INTERSPEECH 2025 「Recent Advances and Future Directions in Voice Conversion」
Internet of Everything -Basic concepts details
Convolutional neural network based encoder-decoder for efficient real-time ob...
A symptom-driven medical diagnosis support model based on machine learning te...
5-Ways-AI-is-Revolutionizing-Telecom-Quality-Engineering.pdf
agenticai-neweraofintelligence-250529192801-1b5e6870.pptx
Electrocardiogram sequences data analytics and classification using unsupervi...
Training Program for knowledge in solar cell and solar industry
A hybrid framework for wild animal classification using fine-tuned DenseNet12...
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf
Auditboard EB SOX Playbook 2023 edition.
Early detection and classification of bone marrow changes in lumbar vertebrae...
Advancing precision in air quality forecasting through machine learning integ...

InfluxEnterprise Architectural Patterns by Dean Sheehan, Senior Director, Pre & Post Sales | InfluxData

  • 1. Dean Sheehan Snr Director Sales Engineering InfluxEnterprise Architectural Patterns
  • 2. What we will be covering ✓ Enterprise Overview ✓ Other Features ✓ Ingestion & Query Rates ✓ Deployment Examples ✓ Replications Patterns ✓ General Advice
  • 4. Signs You’re Ready for InfluxEnterprise 6. Your CPU average is >=70% 1. The sales team starts calling you on weekends 5. Increasing throughput causing write drops errors 4. Sprawling number of single node deployments 3. Vertical scaling not providing further benefit 2. Data recording and availability matters
  • 5. InfluxEnterprise • Open Source Core • High Availability • Horizonal Scalability • Enterprise Security • Support from InfluxData • OnPremise/Cloud Deployment Options
  • 6. What Problem Are You Trying to Solve?
  • 7. What are you dealing with? • Metrics • Events • Log Data • Sensors • Apps • Servers • Long-Term Storage • Vendor Replacement • Time-Series Alerts • Visualization • Network Data • Custom Solution • Real-Time Analytics • Virtualization Monitoring • Managed Service (InfluxCloud)
  • 14. Security • LDAP Support – Enterprise customers can configure the database to use LDAP as a backing authentication source for users, roles and permissions. – Connection between DB and LDAP server secured once connected • Fine-grained authorization – Used to control access at a measurement or series level (compared to limiting access at the database level) – Enable authentication in your configuration file – Create users through the query API – Grant users explicit read and/or write privileges – Set restrictions which define a combination of database, measurement, and tags which cannot be accessed without an explicit grant
  • 15. © 2018 InfluxData. All rights reserved.© 2017 InfluxData. All rights reserved. Eventual Consistency • Anti-Entropy Service – Expands on capabilities to detect and copy full shards – Now allows for detection and repair of inconsistent shards • Hinted-Handoff Queue – Queue inbound points destined to land on other nodes in the cluster which may currently be down – Stored by node and shard (10GB - default)
  • 16. Backup and Restore • Useful for: Disaster recovery, Debugging, Restoring clusters to a consistent state • What it does: Creates a copy of the metastore and the shard data • Backup is compressed and is not human readable • Export is not compressed but is human readable • OSS and InfluxEnterprise ARE NOW compatible – aka portable • Full or partial backup options • Move data into a new database (with new Retention Policies, etc)
  • 18. Cluster Data Node 1 Data Node 2 Data Node 3 Data Node 4 Database X (Replication=1) Shard 1 Shard 2 Shard 3 Shard 4 a b c d X ≈ 4x ingest rate ≤ 1x concurrent query rate a b c d
  • 19. Cluster Data Node 1 Data Node 2 Data Node 3 Data Node 4 Database X (Replication=4) Shard 1 Shard 2 Shard 3 Shard 4 a a’ a’’ a’’’ X ≈ 1x ingest rate replication ≈ 4x concurrent query rate a b b b’ b’’ b’’’
  • 20. Cluster Data Node 1 Data Node 2 Data Node 3 Data Node 4 Database X (Replication=2) Shard 1 Shard 2 Shard 3 Shard 4 a a’ b b’ X ≈ 2x ingest rate replication replication ≤ 2x concurrent query rate a b
  • 23. Example 1: Mothership Data Center 1 Kapacitor Telegraf InfluxDB Ent Enterprise Cluster Data Node 1 Data Node 2 Data Node 3 Data Node n Firewall/ LoadBalancer Telegraf Telegraf Chronograf Chronograf Kapacitor Data Center 2 Kapacitor Telegraf InfluxDB EntTelegraf Telegraf Chronograf
  • 24. Example 2: Durable Data Ingest Telegraf Cluster Telegraf or other source Kafka Queue LoadBalancer InfluxDB Cluster Telegraf or other source Telegraf or other source Telegraf or other sources Telegraf Telegraf Telegraf Telegraf Put each Telegraf instance in the same Kafka Consumer Group How Fast is Fast? (eg): Six datanodes at 2.5M values per second
  • 25. Example 3: Influx with ElasticSearch InfluxDB Cluster • Discover trends before and during the Error from metrics • Perform Root Cause Analysis from Logs LoadBalancer Telegraf ElasticSearch Include common Session ID or other UID Kapacitor You Metrics Logs Query using the common Session ID or UID received form Alert
  • 27. How Are You Using InfluxDB?
  • 28. Data Replication Generally there are two types of data that we care about replicating: • New Data – Data which is coming form our raw sources • Derived Data – The output of a SELECT INTO, or TICK script
  • 29. Replication of New Data – Pattern 1 Cluster 1 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf Cluster 2 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf
  • 30. Replication of New Data – Pattern 2 Cluster 1 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf Cluster 2 Firewall/ Load Balancer Data Node 1 Data Node 2 Data Node 3 Data Node n Telegraf Kafka Queue
  • 31. Replication of Derived Data – Pattern 3 Cluster 1 Load Balancer Cluster 2 Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Kapacitor Load Balancer Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Data Nodes Kapacitor Uses output of Kapacitor to other cluster Telegraf Telegraf
  • 33. General Cluster Advice • Batch your writes! • The number of data nodes should be a multiple of your replication factor • Use a single node of InfluxDB to monitor your cluster • Put a load balancer in front of each of your data nodes • Higher replication factors result in higher query concurrency, but higher write latency. • Use Fine Grained Authorization instead of multiple databases