SlideShare a Scribd company logo
1Confidential
Securing your Streaming
Data Platform
Operational considerations for a secure deployment
Andrew Lance, Vormetric
David Tucker, Confluent
2Confidential
Agenda
• Introduction to Apache Kafka and Confluent
• Overview of Vormetric and its policy-driven security solution
• Confluent Platform deployment architecture
• Security considerations and solutions
• Q&A
3Confidential
About Confluent and Apache Kafka
• Founded by the creators of Apache Kafka
• Founded September2014
• Technology developed while atLinkedIn
• 73%of active Kafka committers
Cheryl Dalrymple
CFO
Jay Kreps
CEO
Neha Narkhede
CTO, VP Engineering
Luanne Dauber
CMO
Leadership
Todd Barnett
VP WW Sales
Jabari Norton
VP Business Dev
4Confidential
Before: Many Ad Hoc Pipelines
5Confidential
After: Stream Data Platform with Kafka
 Distribute
d
 Fault
Tolerant
 Stores
Messages
Search Security
Fraud Detection Application
User Tracking Operational Logs Operational MetricsMySQL Cassandra Oracle
Hadoop Elastic Search Splunk
Data
Warehouse
Kafka
 Processes Streams
6Confidential
What is a Stream Data Platform?
Kafka
Stream Data
Platform
Search
NoSQL
RDBMS Monitoring
Stream ProcessingReal-time Analytics Data Warehouse
Apps
Apps
Hadoop
Synchronous Req/Response
0 – 100s ms
Near Real Time
> 100s ms
Offline Batch
> 1 hour
Build streaming applications
Deploy streaming applications at scale
Monitor and manage streaming applications
Common Kafka Use Cases
• Log data
• Database changes
• Sensors and device data
• Monitoring streams
• Call data records
• Real-time Monitoring
• Asynchronous applications
• Fraud and security
• Bridge to Cloud
7Confidential
People Using Kafka Today
Financial Services
Entertainment & Media
Consumer Tech
Travel & Leisure
Enterprise Tech
Telecom Retail
8 of the top 10 insurance companies &
7 of the top 10 banks in the Fortune 500
9 of the top 10 telcos
in the Fortune 500
6 of the top 10 travel companies in the Fortune
500
8Confidential
Confluent Platform: It’s Kafka ++
Feature Benefit Apache Kafka Confluent Platform 3.0 Confluent Enterprise 3.0
Apache Kafka
High throughput, low latency, high availability, secure distributed message
system
Kafka Connect
Advanced framework for connecting external sources
and destinations into Kafka
Java Client Provides easy integration into Java applications
Kafka Streams
Simple library that enables streaming application development within the
Kafka framework
Additional Clients Supports non-Java clients; C, C++, Python, Go, etc.
Rest Proxy
Provides universal access to Kafka from any network connected device via
HTTP
Schema Registry
Central registry for the format of Kafka data – guarantees all data is always
consumable
Pre-Built Connectors
HDFS, JDBC, Elastic and other connectors fully Certified
and fully supported by Confluent
Confluent Control Center Includes Connector Management and Stream Monitoring
Support
Connection and Monitoring command center provides advanced
functionality and control
Community Community 24x7x365
Free Free Subscription
12Confidential
Agenda
• Introduction to Apache Kafka and Confluent
• Overview of Vormetric and its policy-driven security solution
• Confluent Platform deployment architecture
• Security considerations and solutions
• Q&A
13
Vormetric Company Overview
13
Smart Cloud
Enterprise Plus
Global Customers
• Over 1,500 customers
• 17 of the Fortune 30
Most Security Conscious Brands
• Largest financial institutions
• Largest retail companies
• Major manufacturers
• Third party business service providers
• Government agencies
Cloud Service Providers Trust Vormetric
Business Drivers
• Executive mandates
o Data breach, insider threat
• Compliance
• SLAs
”With Vormetric, people have no idea it’s even
running. Vormetric Encryption also saved us at least
nine months of application rewrite effort, and its
installation was one of the easiest we’ve ever
experienced. ”
-Karl Mudra, CIO, Delta Dental of Missouri
15
Vormetric Data Security Platform
Application
Encryption
Vormetric
Data Security
Manager
Tokenization
Data
Masking
Key
Management
Security
Intelligence
Transparent
Encryption
Encryption
Gateway
KMaaS
16
How do we Encrypt?
Sensitive Data Protection Technologies
▌ SSL, SSH,
HTTPS, IPSEC
Data in Motion
Between Devices
Data at Rest
ENCRYPTION,TOKENIZATION,
DATA MASKING
Application/Database
File System
Disk
Application/Database
File System
Disk
17
Vormetric Transparent Encryption
Policy is used to restrict access
to sensitive data by user and
process information provided
by the Operating System.
Users
Application
Database
Operating System
FS Agent
File
Systems
Volume
Managers
SSL/TLS
*communication is only
required at system boot
18
Policy Example: Kafka
Policy Summary:
Only the specified Kafka user, using only the
verified Java process has full read/write &
automatic encrypt/decrypt access to the
protected topic data.
Privileged admins and root accounts are
allowed to manage the protected data
without seeing the sensitive contents.
All other data requests are denied and
audited.
# Resource User Process Action Effects
any Kafka User Java Read / Write Permit
Encrypt / Decrypt
(audit optional)
any Root Whitelisted
management
processes
Metadata
Only
Permit
Audit
any * * * Deny & Audit
1
2
3
1
2
3
Policy Benefits
 Data-at-rest encryption without changing configs
or application code.
 Remove custodial risk of privileged root users
19
Vormetric Security Intelligence
▌Log all access and attempted access to what
matters – the data
Reveals unauthorized access attempts to protected data
Find unusual access patterns
Identify compromised users, administrators and applications
Identify attacks on data such as APTs or malicious insiders
Prebuilt integrations: Splunk, ArcSight, Qradar, LogRhythm
20Confidential
Agenda
• Introduction to Apache Kafka and Confluent
• Overview of Vormetric and its policy-driven security solution
• Confluent Platform deployment architecture
• Security considerations and solutions
• Q&A
21Confidential
22Confidential
Kafka Topics
Topic == Distributed Commit Log
• Immutable (persisted to broker storage)
• Ordered
• Sequential Offset
• Partitioned (for scalability)
23Confidential
Kafka Deployment Architecture (simplified)
Zookeeper
Producer /
ConsumerProducer /
ConsumerProducer /
Consumer
Producer /
Consumer
Broker
Broker
Broker
Broker
Broker
Zookeeper
Zookeeper
• Zookeeper quorum
manages metadata
• Broker nodes manage (and
store) topic data
• Brokers and Clients access
ZK nodes
• Brokers communicate
directly for replication
(many-to-many)
• Broker and Zookeeper
nodes utilize local storage.
24Confidential
Kafka Deployment Architecture
Zookeeper
Producer /
ConsumerProducer /
ConsumerProducer /
Consumer
Producer /
Consumer
Broker
Broker
Broker
Broker
Broker
Zookeeper
Zookeeper
• Zookeeper quorum
manages metadata
• Broker nodes manage (and
store) topic data
• Brokers and Clients access
ZK nodes
• Brokers communicate
directly for replication
(many-to-many)
• Broker and Zookeeper
nodes utilize local storage.
25Confidential
Security Options
• Authentication
• SSL certificates support for 1-way (broker-only) or 2-way (broker and client) authentication
• SASL challenge/response support via Kerberos
• Mix-n-match : SSL for wire-level encryption, SASL for authentication
• Authorization
• Access Control Lists
• Operations: Read, Write, Create, Describe, ClusterAction, ALL
• Resources: Topic, Cluster, ConsumerGroup
• NOTE: ACL’s stored in zookeeper (along with all topic metadata)
• Data Encryption
• Vormetric policy management
26Confidential
Secure Deployments: Step by Step
• SSL Configuration
• Identify / deploy Certificate Authority
• Generate certificates (brokers, clients, or both)
• Share / Install certificates on brokers and/or clients
• Set Kafka broker properties to restrict communication to SSL channels
• Kerberos Configuration (SASL)
• Identify / deploy Kerberos principal
27Confidential
Secure Deployments: Step by Step (continued)
• Data Encryption
• Identify / Deploy Vormetric DSM
• Configure cluster brokers and ZK nodes into DSM domain
• Create and distributed keys (could be coordinated with keys used by brokers and clients)
• Define encryption policy and apply policy to the storage directories
• (test/dev best-practice: exclude metadata operations from policy enforcement)
• References:
• http://docs.confluent.io/3.0.0/kafka/security.html
• <vormetric>
28Confidential
Solution Benefits
• End-to-end security management … from Kafka topic to storage layer
• Robust access controls across all layers
• Fine grained access control
• Logical constraints on privileged users
• Alerting regarding in-band and out-of-band access attempts
29Confidential
Any questions ?
30Confidential
Thank You

More Related Content

What's hot (20)

PDF
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies...
HostedbyConfluent
 
PDF
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
PDF
Some Iceberg Basics for Beginners (CDP).pdf
Michael Kogan
 
PDF
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 
PPTX
Introduction to Elasticsearch
Ismaeel Enjreny
 
PDF
Intro to Delta Lake
Databricks
 
PDF
Combining logs, metrics, and traces for unified observability
Elasticsearch
 
PDF
Building a fully managed stream processing platform on Flink at scale for Lin...
Flink Forward
 
PDF
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
PDF
Building an open data platform with apache iceberg
Alluxio, Inc.
 
PDF
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
Andrew Lamb
 
PDF
Iceberg: a fast table format for S3
DataWorks Summit
 
PDF
Redis + Kafka = Performance at Scale | Julien Ruaux, Redis Labs
HostedbyConfluent
 
PDF
ksqlDB: A Stream-Relational Database System
confluent
 
PDF
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
PPTX
The Evolution of Integration
Software AG
 
PPTX
Kafka 101
Aparna Pillai
 
PDF
Introduction to influx db
Roberto Gaudenzi
 
PPTX
Choosing an HDFS data storage format- Avro vs. Parquet and more - StampedeCon...
StampedeCon
 
PDF
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Kai Wähner
 
Apache Kafka and API Management / API Gateway – Friends, Enemies or Frenemies...
HostedbyConfluent
 
Real-Life Use Cases & Architectures for Event Streaming with Apache Kafka
Kai Wähner
 
Some Iceberg Basics for Beginners (CDP).pdf
Michael Kogan
 
The Future of Data Science and Machine Learning at Scale: A Look at MLflow, D...
Databricks
 
Introduction to Elasticsearch
Ismaeel Enjreny
 
Intro to Delta Lake
Databricks
 
Combining logs, metrics, and traces for unified observability
Elasticsearch
 
Building a fully managed stream processing platform on Flink at scale for Lin...
Flink Forward
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Building an open data platform with apache iceberg
Alluxio, Inc.
 
2022-06-23 Apache Arrow and DataFusion_ Changing the Game for implementing Da...
Andrew Lamb
 
Iceberg: a fast table format for S3
DataWorks Summit
 
Redis + Kafka = Performance at Scale | Julien Ruaux, Redis Labs
HostedbyConfluent
 
ksqlDB: A Stream-Relational Database System
confluent
 
Massive Data Processing in Adobe Using Delta Lake
Databricks
 
The Evolution of Integration
Software AG
 
Kafka 101
Aparna Pillai
 
Introduction to influx db
Roberto Gaudenzi
 
Choosing an HDFS data storage format- Avro vs. Parquet and more - StampedeCon...
StampedeCon
 
Confluent REST Proxy and Schema Registry (Concepts, Architecture, Features)
Kai Wähner
 

Viewers also liked (20)

PPTX
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
confluent
 
PDF
Securing Kafka
confluent
 
PDF
Leveraging Mainframe Data for Modern Analytics
confluent
 
PPTX
Streaming in Practice - Putting Apache Kafka in Production
confluent
 
PDF
What's new in Confluent 3.2 and Apache Kafka 0.10.2
confluent
 
PPTX
Confluent building a real-time streaming platform using kafka streams and k...
Thomas Alex
 
PPTX
Microservices in the Apache Kafka Ecosystem
confluent
 
PPTX
Deep Dive into Apache Kafka
confluent
 
PDF
A Practical Guide to Selecting a Stream Processing Technology
confluent
 
PPTX
Apache Kafka Security
DataWorks Summit/Hadoop Summit
 
PPTX
Building an Event-oriented Data Platform with Kafka, Eric Sammer
confluent
 
PDF
Real-Time Analytics with Confluent and MemSQL
SingleStore
 
PDF
Power of the Log: LSM & Append Only Data Structures
confluent
 
PDF
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming
Jen Aman
 
PPTX
Real-time Data Integration with Kafka and Cassandra (Ewen Cheslack-Postava, C...
DataStax
 
PDF
Confluent kafka meetupseattle jan2017
Nitin Kumar
 
PDF
Confluent Enterprise Datasheet
confluent
 
PPTX
Building a real-time streaming platform using Kafka Connect + Kafka Streams
confluent
 
PPTX
Introducing Kafka Streams, the new stream processing library of Apache Kafka,...
Michael Noll
 
PDF
Apache kafka-a distributed streaming platform
confluent
 
Building Large-Scale Stream Infrastructures Across Multiple Data Centers with...
confluent
 
Securing Kafka
confluent
 
Leveraging Mainframe Data for Modern Analytics
confluent
 
Streaming in Practice - Putting Apache Kafka in Production
confluent
 
What's new in Confluent 3.2 and Apache Kafka 0.10.2
confluent
 
Confluent building a real-time streaming platform using kafka streams and k...
Thomas Alex
 
Microservices in the Apache Kafka Ecosystem
confluent
 
Deep Dive into Apache Kafka
confluent
 
A Practical Guide to Selecting a Stream Processing Technology
confluent
 
Apache Kafka Security
DataWorks Summit/Hadoop Summit
 
Building an Event-oriented Data Platform with Kafka, Eric Sammer
confluent
 
Real-Time Analytics with Confluent and MemSQL
SingleStore
 
Power of the Log: LSM & Append Only Data Structures
confluent
 
Building Realtime Data Pipelines with Kafka Connect and Spark Streaming
Jen Aman
 
Real-time Data Integration with Kafka and Cassandra (Ewen Cheslack-Postava, C...
DataStax
 
Confluent kafka meetupseattle jan2017
Nitin Kumar
 
Confluent Enterprise Datasheet
confluent
 
Building a real-time streaming platform using Kafka Connect + Kafka Streams
confluent
 
Introducing Kafka Streams, the new stream processing library of Apache Kafka,...
Michael Noll
 
Apache kafka-a distributed streaming platform
confluent
 
Ad

Similar to Protecting your data at rest with Apache Kafka by Confluent and Vormetric (20)

PDF
A Tour of Apache Kafka
confluent
 
PDF
Apache Kafka 2.3 + Confluent Platform 5.3 => What's New?
Kai Wähner
 
PDF
What's new in confluent platform 5.4 online talk
confluent
 
PDF
Kafka Security 101 and Real-World Tips
confluent
 
PDF
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
PDF
Team Collaboration in Kafka Clusters With Maria Berinde-Tampanariu | Current ...
HostedbyConfluent
 
PPTX
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Michael Noll
 
PDF
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
PDF
Paris FOD meetup - kafka security 101
Abdelkrim Hadjidj
 
PDF
Confluent Partner Tech Talk with Synthesis
confluent
 
PDF
Introduction to Apache Kafka and why it matters - Madrid
Paolo Castagna
 
PPTX
Data Pipelines with Kafka Connect
Kaufman Ng
 
PDF
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Kai Wähner
 
PDF
Kafka 2018 - Securing Kafka the Right Way
Saylor Twift
 
PDF
Beyond the brokers - Un tour de l'écosystème Kafka
Florent Ramiere
 
PPTX
Kafka Security
DataWorks Summit/Hadoop Summit
 
PPTX
Confluent and Syncsort Webinar August 2016
Precisely
 
PDF
Apache Kafka® Security Overview
confluent
 
PDF
Why Build an Apache Kafka® Connector
confluent
 
PPTX
Westpac Bank Tech Talk 1: Dive into Apache Kafka
confluent
 
A Tour of Apache Kafka
confluent
 
Apache Kafka 2.3 + Confluent Platform 5.3 => What's New?
Kai Wähner
 
What's new in confluent platform 5.4 online talk
confluent
 
Kafka Security 101 and Real-World Tips
confluent
 
Benefits of Stream Processing and Apache Kafka Use Cases
confluent
 
Team Collaboration in Kafka Clusters With Maria Berinde-Tampanariu | Current ...
HostedbyConfluent
 
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Michael Noll
 
Au delà des brokers, un tour de l’environnement Kafka | Florent Ramière
confluent
 
Paris FOD meetup - kafka security 101
Abdelkrim Hadjidj
 
Confluent Partner Tech Talk with Synthesis
confluent
 
Introduction to Apache Kafka and why it matters - Madrid
Paolo Castagna
 
Data Pipelines with Kafka Connect
Kaufman Ng
 
Confluent Platform 5.4 + Apache Kafka 2.4 Overview (RBAC, Tiered Storage, Mul...
Kai Wähner
 
Kafka 2018 - Securing Kafka the Right Way
Saylor Twift
 
Beyond the brokers - Un tour de l'écosystème Kafka
Florent Ramiere
 
Confluent and Syncsort Webinar August 2016
Precisely
 
Apache Kafka® Security Overview
confluent
 
Why Build an Apache Kafka® Connector
confluent
 
Westpac Bank Tech Talk 1: Dive into Apache Kafka
confluent
 
Ad

More from confluent (20)

PDF
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
PPTX
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
PDF
Migration, backup and restore made easy using Kannika
confluent
 
PDF
Five Things You Need to Know About Data Streaming in 2025
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
PDF
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
PDF
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
PDF
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
PDF
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
PDF
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
PDF
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
PDF
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
PDF
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
PDF
Unlocking value with event-driven architecture by Confluent
confluent
 
PDF
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
PDF
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
PDF
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
PDF
Building API data products on top of your real-time data infrastructure
confluent
 
PDF
Speed Wins: From Kafka to APIs in Minutes
confluent
 
PDF
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 
Stream Processing Handson Workshop - Flink SQL Hands-on Workshop (Korean)
confluent
 
Webinar Think Right - Shift Left - 19-03-2025.pptx
confluent
 
Migration, backup and restore made easy using Kannika
confluent
 
Five Things You Need to Know About Data Streaming in 2025
confluent
 
Data in Motion Tour Seoul 2024 - Keynote
confluent
 
Data in Motion Tour Seoul 2024 - Roadmap Demo
confluent
 
From Stream to Screen: Real-Time Data Streaming to Web Frontends with Conflue...
confluent
 
Confluent per il settore FSI: Accelerare l'Innovazione con il Data Streaming...
confluent
 
Data in Motion Tour 2024 Riyadh, Saudi Arabia
confluent
 
Build a Real-Time Decision Support Application for Financial Market Traders w...
confluent
 
Strumenti e Strategie di Stream Governance con Confluent Platform
confluent
 
Compose Gen-AI Apps With Real-Time Data - In Minutes, Not Weeks
confluent
 
Building Real-Time Gen AI Applications with SingleStore and Confluent
confluent
 
Unlocking value with event-driven architecture by Confluent
confluent
 
Il Data Streaming per un’AI real-time di nuova generazione
confluent
 
Unleashing the Future: Building a Scalable and Up-to-Date GenAI Chatbot with ...
confluent
 
Break data silos with real-time connectivity using Confluent Cloud Connectors
confluent
 
Building API data products on top of your real-time data infrastructure
confluent
 
Speed Wins: From Kafka to APIs in Minutes
confluent
 
Evolving Data Governance for the Real-time Streaming and AI Era
confluent
 

Recently uploaded (20)

PDF
Continouous failure - Why do we make our lives hard?
Papp Krisztián
 
PDF
Alarm in Android-Scheduling Timed Tasks Using AlarmManager in Android.pdf
Nabin Dhakal
 
PPTX
Perfecting XM Cloud for Multisite Setup.pptx
Ahmed Okour
 
PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked} 2025
hashhshs786
 
PDF
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
PPTX
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
PPTX
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
PPT
MergeSortfbsjbjsfk sdfik k
RafishaikIT02044
 
PDF
Executive Business Intelligence Dashboards
vandeslie24
 
PPTX
Human Resources Information System (HRIS)
Amity University, Patna
 
PDF
Unlock Efficiency with Insurance Policy Administration Systems
Insurance Tech Services
 
PPTX
Engineering the Java Web Application (MVC)
abhishekoza1981
 
PDF
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
PPTX
Tally_Basic_Operations_Presentation.pptx
AditiBansal54083
 
PDF
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Imma Valls Bernaus
 
PPTX
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
PDF
GridView,Recycler view, API, SQLITE& NetworkRequest.pdf
Nabin Dhakal
 
PPTX
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
PPTX
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
PPTX
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 
Continouous failure - Why do we make our lives hard?
Papp Krisztián
 
Alarm in Android-Scheduling Timed Tasks Using AlarmManager in Android.pdf
Nabin Dhakal
 
Perfecting XM Cloud for Multisite Setup.pptx
Ahmed Okour
 
Capcut Pro Crack For PC Latest Version {Fully Unlocked} 2025
hashhshs786
 
Salesforce CRM Services.VALiNTRY360
VALiNTRY360
 
Migrating Millions of Users with Debezium, Apache Kafka, and an Acyclic Synch...
MD Sayem Ahmed
 
3uTools Full Crack Free Version Download [Latest] 2025
muhammadgurbazkhan
 
MergeSortfbsjbjsfk sdfik k
RafishaikIT02044
 
Executive Business Intelligence Dashboards
vandeslie24
 
Human Resources Information System (HRIS)
Amity University, Patna
 
Unlock Efficiency with Insurance Policy Administration Systems
Insurance Tech Services
 
Engineering the Java Web Application (MVC)
abhishekoza1981
 
Mobile CMMS Solutions Empowering the Frontline Workforce
CryotosCMMSSoftware
 
Tally_Basic_Operations_Presentation.pptx
AditiBansal54083
 
Understanding the Need for Systemic Change in Open Source Through Intersectio...
Imma Valls Bernaus
 
MailsDaddy Outlook OST to PST converter.pptx
abhishekdutt366
 
GridView,Recycler view, API, SQLITE& NetworkRequest.pdf
Nabin Dhakal
 
Hardware(Central Processing Unit ) CU and ALU
RizwanaKalsoom2
 
Revolutionizing Code Modernization with AI
KrzysztofKkol1
 
How Apagen Empowered an EPC Company with Engineering ERP Software
SatishKumar2651
 

Protecting your data at rest with Apache Kafka by Confluent and Vormetric

  • 1. 1Confidential Securing your Streaming Data Platform Operational considerations for a secure deployment Andrew Lance, Vormetric David Tucker, Confluent
  • 2. 2Confidential Agenda • Introduction to Apache Kafka and Confluent • Overview of Vormetric and its policy-driven security solution • Confluent Platform deployment architecture • Security considerations and solutions • Q&A
  • 3. 3Confidential About Confluent and Apache Kafka • Founded by the creators of Apache Kafka • Founded September2014 • Technology developed while atLinkedIn • 73%of active Kafka committers Cheryl Dalrymple CFO Jay Kreps CEO Neha Narkhede CTO, VP Engineering Luanne Dauber CMO Leadership Todd Barnett VP WW Sales Jabari Norton VP Business Dev
  • 5. 5Confidential After: Stream Data Platform with Kafka  Distribute d  Fault Tolerant  Stores Messages Search Security Fraud Detection Application User Tracking Operational Logs Operational MetricsMySQL Cassandra Oracle Hadoop Elastic Search Splunk Data Warehouse Kafka  Processes Streams
  • 6. 6Confidential What is a Stream Data Platform? Kafka Stream Data Platform Search NoSQL RDBMS Monitoring Stream ProcessingReal-time Analytics Data Warehouse Apps Apps Hadoop Synchronous Req/Response 0 – 100s ms Near Real Time > 100s ms Offline Batch > 1 hour Build streaming applications Deploy streaming applications at scale Monitor and manage streaming applications Common Kafka Use Cases • Log data • Database changes • Sensors and device data • Monitoring streams • Call data records • Real-time Monitoring • Asynchronous applications • Fraud and security • Bridge to Cloud
  • 7. 7Confidential People Using Kafka Today Financial Services Entertainment & Media Consumer Tech Travel & Leisure Enterprise Tech Telecom Retail 8 of the top 10 insurance companies & 7 of the top 10 banks in the Fortune 500 9 of the top 10 telcos in the Fortune 500 6 of the top 10 travel companies in the Fortune 500
  • 8. 8Confidential Confluent Platform: It’s Kafka ++ Feature Benefit Apache Kafka Confluent Platform 3.0 Confluent Enterprise 3.0 Apache Kafka High throughput, low latency, high availability, secure distributed message system Kafka Connect Advanced framework for connecting external sources and destinations into Kafka Java Client Provides easy integration into Java applications Kafka Streams Simple library that enables streaming application development within the Kafka framework Additional Clients Supports non-Java clients; C, C++, Python, Go, etc. Rest Proxy Provides universal access to Kafka from any network connected device via HTTP Schema Registry Central registry for the format of Kafka data – guarantees all data is always consumable Pre-Built Connectors HDFS, JDBC, Elastic and other connectors fully Certified and fully supported by Confluent Confluent Control Center Includes Connector Management and Stream Monitoring Support Connection and Monitoring command center provides advanced functionality and control Community Community 24x7x365 Free Free Subscription
  • 9. 12Confidential Agenda • Introduction to Apache Kafka and Confluent • Overview of Vormetric and its policy-driven security solution • Confluent Platform deployment architecture • Security considerations and solutions • Q&A
  • 10. 13 Vormetric Company Overview 13 Smart Cloud Enterprise Plus Global Customers • Over 1,500 customers • 17 of the Fortune 30 Most Security Conscious Brands • Largest financial institutions • Largest retail companies • Major manufacturers • Third party business service providers • Government agencies Cloud Service Providers Trust Vormetric Business Drivers • Executive mandates o Data breach, insider threat • Compliance • SLAs ”With Vormetric, people have no idea it’s even running. Vormetric Encryption also saved us at least nine months of application rewrite effort, and its installation was one of the easiest we’ve ever experienced. ” -Karl Mudra, CIO, Delta Dental of Missouri
  • 11. 15 Vormetric Data Security Platform Application Encryption Vormetric Data Security Manager Tokenization Data Masking Key Management Security Intelligence Transparent Encryption Encryption Gateway KMaaS
  • 12. 16 How do we Encrypt? Sensitive Data Protection Technologies ▌ SSL, SSH, HTTPS, IPSEC Data in Motion Between Devices Data at Rest ENCRYPTION,TOKENIZATION, DATA MASKING Application/Database File System Disk Application/Database File System Disk
  • 13. 17 Vormetric Transparent Encryption Policy is used to restrict access to sensitive data by user and process information provided by the Operating System. Users Application Database Operating System FS Agent File Systems Volume Managers SSL/TLS *communication is only required at system boot
  • 14. 18 Policy Example: Kafka Policy Summary: Only the specified Kafka user, using only the verified Java process has full read/write & automatic encrypt/decrypt access to the protected topic data. Privileged admins and root accounts are allowed to manage the protected data without seeing the sensitive contents. All other data requests are denied and audited. # Resource User Process Action Effects any Kafka User Java Read / Write Permit Encrypt / Decrypt (audit optional) any Root Whitelisted management processes Metadata Only Permit Audit any * * * Deny & Audit 1 2 3 1 2 3 Policy Benefits  Data-at-rest encryption without changing configs or application code.  Remove custodial risk of privileged root users
  • 15. 19 Vormetric Security Intelligence ▌Log all access and attempted access to what matters – the data Reveals unauthorized access attempts to protected data Find unusual access patterns Identify compromised users, administrators and applications Identify attacks on data such as APTs or malicious insiders Prebuilt integrations: Splunk, ArcSight, Qradar, LogRhythm
  • 16. 20Confidential Agenda • Introduction to Apache Kafka and Confluent • Overview of Vormetric and its policy-driven security solution • Confluent Platform deployment architecture • Security considerations and solutions • Q&A
  • 18. 22Confidential Kafka Topics Topic == Distributed Commit Log • Immutable (persisted to broker storage) • Ordered • Sequential Offset • Partitioned (for scalability)
  • 19. 23Confidential Kafka Deployment Architecture (simplified) Zookeeper Producer / ConsumerProducer / ConsumerProducer / Consumer Producer / Consumer Broker Broker Broker Broker Broker Zookeeper Zookeeper • Zookeeper quorum manages metadata • Broker nodes manage (and store) topic data • Brokers and Clients access ZK nodes • Brokers communicate directly for replication (many-to-many) • Broker and Zookeeper nodes utilize local storage.
  • 20. 24Confidential Kafka Deployment Architecture Zookeeper Producer / ConsumerProducer / ConsumerProducer / Consumer Producer / Consumer Broker Broker Broker Broker Broker Zookeeper Zookeeper • Zookeeper quorum manages metadata • Broker nodes manage (and store) topic data • Brokers and Clients access ZK nodes • Brokers communicate directly for replication (many-to-many) • Broker and Zookeeper nodes utilize local storage.
  • 21. 25Confidential Security Options • Authentication • SSL certificates support for 1-way (broker-only) or 2-way (broker and client) authentication • SASL challenge/response support via Kerberos • Mix-n-match : SSL for wire-level encryption, SASL for authentication • Authorization • Access Control Lists • Operations: Read, Write, Create, Describe, ClusterAction, ALL • Resources: Topic, Cluster, ConsumerGroup • NOTE: ACL’s stored in zookeeper (along with all topic metadata) • Data Encryption • Vormetric policy management
  • 22. 26Confidential Secure Deployments: Step by Step • SSL Configuration • Identify / deploy Certificate Authority • Generate certificates (brokers, clients, or both) • Share / Install certificates on brokers and/or clients • Set Kafka broker properties to restrict communication to SSL channels • Kerberos Configuration (SASL) • Identify / deploy Kerberos principal
  • 23. 27Confidential Secure Deployments: Step by Step (continued) • Data Encryption • Identify / Deploy Vormetric DSM • Configure cluster brokers and ZK nodes into DSM domain • Create and distributed keys (could be coordinated with keys used by brokers and clients) • Define encryption policy and apply policy to the storage directories • (test/dev best-practice: exclude metadata operations from policy enforcement) • References: • http://docs.confluent.io/3.0.0/kafka/security.html • <vormetric>
  • 24. 28Confidential Solution Benefits • End-to-end security management … from Kafka topic to storage layer • Robust access controls across all layers • Fine grained access control • Logical constraints on privileged users • Alerting regarding in-band and out-of-band access attempts

Editor's Notes

  • #8: 35% of the Fortune 500 7 out of 10 of the top 10 Fortune 500 global banks 8 of the top 10 insurance companies 9 of the 10 top telecom companies 6 of the top 10 travel companies
  • #10: Talking Points: 1. Companies are faced with very complex environments with difficult to manage parts. They want to organize large amounts of data into a well managed, unified stream data platform. 2. Customers use Confluent Platform for realtime, batch operational and analytical purposes. Take away the costly and labor intensive process of developing proprietary data replication practices and allow the Confluent Platform to make data available in realtime streams. 3. Our platform has Kafka at the core (same build as open source Kafka but with additional bug fixes applied) with components and tools that allow you successfully deploy to production, including: Kafka Schema management layer (ensures data compatibility across applications Java and Rest clients that integrate with our schema management layer Kafka Connect Kafka Streams Authentication and Authorization Confluent Control Center
  • #14: 1,300 customers, including many of the largest enterprise customers trust us. IBM and Symantec also deliver our products through OEM arrangements.
  • #17: Separate Application and Database TDE/Columnar
  • #18: Vormetric Encryption Expert Agents are software agents that insert above the file system logical volume layers. The agents evaluate any attempt to access the protected data and apply predetermined policies to either grant or deny such attempts. This is a proven high-performance solution that transparently integrates into: Linux, UNIX, and Windows operating systems   to protect data in physical, virtual, and cloud environments. across all leading applications, databases, operating systems, and storage devices.
  • #26: SASL : simply authentication and Security Layer