SlideShare a Scribd company logo
Cybersecurity with Apache
Metron and Apache Solr
Ward Bekker - Solutions Engineer, Hortonworks
Scott Cote - Senior Software Engineer, Lucidworks
1
Agenda for today’s talk
• Introduction
• Why Apache Metron?
• What is Apache Metron?
• What does Apache Metron look like?
• Who’s using Apache Metron?
• Apache Metron Ecosystem
• Demo!
2
Ward Bekker
• Hortonworks Solutions Engineer NEMEA
• SME Cybersecurity
• Apache Metron Contributor
• Twitter: @wardbekker
Ward Bekker
Ward Bekker
•Lucidworks Senior Software Engineer - Fusion Server
•Core Engineering
•Founder of DFW Data Science User Group
•Twitter: @scottccote & @DFWDataScience
Scott Cote
SCARY
haveibeenpwned.com
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks
• I commit code/docs/patches
• Running in production
• Testing in lab
• Just read/researched not touched
Why Apache Metron?
10
Months until
breach
noticed
Avg.
months log
retention
9 6
VS
3
Months
missing
==
28 Months
Police
One/Berkut
Yahoo/? FB/Cambridge
Analytica
35 Months 48 Months
Time until breach actually noticed
“Sometime in the next few years we're
going to have our first category-one
cyber-incident; one that will need a
national response.”
2018 so far...
340M Records
150M Records
92M Records
And many, many, many more..
https://en.wikipedia.org/wiki/List_of_data_
breaches
Already drowning in Data
Explosion of (new) source of data
SIEMAlarm Fatigue
18
Realtime
Verizon cyber report:
“Breaches happen in
minutes, not hours,
days, weeks or months”
"Lightspeed" by Sergei Vavinov is licensed under
CC BY 2.0
What is Apache Metron?
19
An architecture for real-time cybersecurity analytics
Telemetry Data Source
Telemetry Data Collectors
Cyber Security Stream Processing Pipeline
Apache Metron Modules
Apache SOLR usage in metron
Apache Metron
Stream Processing
pipeline
WARM/COLD INDEX LAYER: Data
Vault, Data Science workbench,
PCAP forensics,...
HOT INDEX LAYER: Real-time
search
Visualisation &
investigation
Apache Metron
Investigator
Apache Zeppelin
Built on top on proven open source big data technology
Profiling by Time
t = 1 t = 2 t = 3 t = n
⬢
⬢
⬢
⬢
⬢
⬢
⬢
⬢
What does Apache Metron look like?
28
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks
Who is using Apache Metron?
33
Who is using Apache Metron?
34
• Managed security service providers
• Telstra
• QSight/KPN
• Financial institutions
• Capital One
• Telecom providers
• Automotive industry
• Defense ministries
• Country-wide government initiatives
Apache Metron Ecosystem
35
Apache Metron Ecosystem
36
• Visualisation and Exploration
• Real-time interactive dashboarding
• Infrastructure
• Pre-built appliances optimized for Metron
• Reporting and Compliance
• NIST and other frameworks
https://hortonworks.com/blog/building-a-cybersecurity-eco-system-on-a-shared-data-platform/
Demo
37
Demo Steps
• Uploading blobs to an Fusion Application
• One of those blobs is actually malware
• Fusion logs are ingested in Metron
• Compare the md5 signature of blobs to
know malware
• Metron Investigator UI to triage the scored
events
38
Questions?
Thank you!

More Related Content

What's hot (20)

PDF
Apache Metron in the Real World
DataWorks Summit
 
PPTX
Solving Cyber at Scale
DataWorks Summit/Hadoop Summit
 
PDF
Bringing it All Together: Apache Metron (Incubating) as a Case Study of a Mod...
DataWorks Summit
 
PPTX
Scalable and adaptable typosquatting detection in Apache Metron
DataWorks Summit
 
PPTX
Apache Spot
Austin Leahy
 
PPTX
Designing and Implementing your IOT Solutions with Open Source
DataWorks Summit/Hadoop Summit
 
PDF
Application Programming Interface
Seculert
 
PPTX
Data Science Crash Course
DataWorks Summit
 
PPTX
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
PPTX
Accelerating TensorFlow with RDMA for high-performance deep learning
DataWorks Summit
 
PPTX
Omid: scalable and highly available transaction processing for Apache Phoenix
DataWorks Summit
 
PPTX
Why is my Hadoop* job slow?
DataWorks Summit/Hadoop Summit
 
PPTX
QCon London 2015 - Wrangling Data at the IOT Rodeo
Damien Dallimore
 
PPTX
Security From The Big Data and Analytics Perspective
All Things Open
 
PPTX
Security event logging and monitoring techniques
DataWorks Summit
 
PPTX
mcubed london - data science at the edge
Simon Elliston Ball
 
PPTX
Detecting Hacks: Anomaly Detection on Networking Data
DataWorks Summit
 
PPTX
Manage democratization of the data - Data Replication in Hadoop
DataWorks Summit
 
PDF
Open Security Operations Center - OpenSOC
Sheetal Dolas
 
PDF
Fighting cybersecurity threats with Apache Spot
markgrover
 
Apache Metron in the Real World
DataWorks Summit
 
Solving Cyber at Scale
DataWorks Summit/Hadoop Summit
 
Bringing it All Together: Apache Metron (Incubating) as a Case Study of a Mod...
DataWorks Summit
 
Scalable and adaptable typosquatting detection in Apache Metron
DataWorks Summit
 
Apache Spot
Austin Leahy
 
Designing and Implementing your IOT Solutions with Open Source
DataWorks Summit/Hadoop Summit
 
Application Programming Interface
Seculert
 
Data Science Crash Course
DataWorks Summit
 
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
Accelerating TensorFlow with RDMA for high-performance deep learning
DataWorks Summit
 
Omid: scalable and highly available transaction processing for Apache Phoenix
DataWorks Summit
 
Why is my Hadoop* job slow?
DataWorks Summit/Hadoop Summit
 
QCon London 2015 - Wrangling Data at the IOT Rodeo
Damien Dallimore
 
Security From The Big Data and Analytics Perspective
All Things Open
 
Security event logging and monitoring techniques
DataWorks Summit
 
mcubed london - data science at the edge
Simon Elliston Ball
 
Detecting Hacks: Anomaly Detection on Networking Data
DataWorks Summit
 
Manage democratization of the data - Data Replication in Hadoop
DataWorks Summit
 
Open Security Operations Center - OpenSOC
Sheetal Dolas
 
Fighting cybersecurity threats with Apache Spot
markgrover
 

Similar to Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks (20)

PPTX
Apache Metron Meetup May 4, 2016 - Big data cybersecurity
Hortonworks
 
PPTX
Why HTTP Won't Work For The Internet of Things (Dreamforce 2014)
kellogh
 
PDF
Test Execution Infrastructure for IoT Quality analysis
Axel Rennoch
 
PPTX
The Art of Container Monitoring
Derek Chen
 
PPTX
ThroughTheLookingGlass_EffectiveObservability.pptx
Grace Jansen
 
PDF
Proactive ops for container orchestration environments
Docker, Inc.
 
PDF
IPv6 Security Talk mit Joe Klein
Digicomp Academy AG
 
PDF
FIWARE Global Summit - Connecting Sensors to FIWARE with IDAS: An Overview
FIWARE
 
PPTX
Technology Behind IoT (JNTUK - Unit - 1)
FabMinds
 
PPTX
Gustavo Zastrow - Introduction to AWS IoT Core and MQTT
GustavoRuizZastrow
 
PDF
Io t data streaming
ratthaslip ranokphanuwat
 
PPTX
Analysis of exposed ICS//SCADA/IoT systems in Europe
Francesco Faenzi
 
PDF
Iot meets Serverless
Narendran R
 
PDF
5º MeetUP ARQconf 2016 - IoT: What is it really and how does it work?
GlobalLogic Latinoamérica
 
PDF
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
PPTX
20160000 Cloud Discovery Event - Cloud Access Security Brokers
Robin Vermeirsch
 
PPTX
IoT on azure
Joanna Lamch
 
PDF
ATT&CKing the Sentinel – deploying a threat hunting capability on Azure Senti...
CloudVillage
 
PPTX
All Things Open SDN, NFV and Open Daylight
Mark Hinkle
 
PDF
IoT overview 2014
Mirko Presser
 
Apache Metron Meetup May 4, 2016 - Big data cybersecurity
Hortonworks
 
Why HTTP Won't Work For The Internet of Things (Dreamforce 2014)
kellogh
 
Test Execution Infrastructure for IoT Quality analysis
Axel Rennoch
 
The Art of Container Monitoring
Derek Chen
 
ThroughTheLookingGlass_EffectiveObservability.pptx
Grace Jansen
 
Proactive ops for container orchestration environments
Docker, Inc.
 
IPv6 Security Talk mit Joe Klein
Digicomp Academy AG
 
FIWARE Global Summit - Connecting Sensors to FIWARE with IDAS: An Overview
FIWARE
 
Technology Behind IoT (JNTUK - Unit - 1)
FabMinds
 
Gustavo Zastrow - Introduction to AWS IoT Core and MQTT
GustavoRuizZastrow
 
Io t data streaming
ratthaslip ranokphanuwat
 
Analysis of exposed ICS//SCADA/IoT systems in Europe
Francesco Faenzi
 
Iot meets Serverless
Narendran R
 
5º MeetUP ARQconf 2016 - IoT: What is it really and how does it work?
GlobalLogic Latinoamérica
 
Devoxx university - Kafka de haut en bas
Florent Ramiere
 
20160000 Cloud Discovery Event - Cloud Access Security Brokers
Robin Vermeirsch
 
IoT on azure
Joanna Lamch
 
ATT&CKing the Sentinel – deploying a threat hunting capability on Azure Senti...
CloudVillage
 
All Things Open SDN, NFV and Open Daylight
Mark Hinkle
 
IoT overview 2014
Mirko Presser
 
Ad

More from Lucidworks (20)

PDF
Search is the Tip of the Spear for Your B2B eCommerce Strategy
Lucidworks
 
PDF
Drive Agent Effectiveness in Salesforce
Lucidworks
 
PPTX
How Crate & Barrel Connects Shoppers with Relevant Products
Lucidworks
 
PPTX
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks
 
PPTX
Connected Experiences Are Personalized Experiences
Lucidworks
 
PDF
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Lucidworks
 
PPTX
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
Lucidworks
 
PPTX
Preparing for Peak in Ecommerce | eTail Asia 2020
Lucidworks
 
PPTX
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Lucidworks
 
PPTX
AI-Powered Linguistics and Search with Fusion and Rosette
Lucidworks
 
PDF
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
Lucidworks
 
PPTX
Webinar: Smart answers for employee and customer support after covid 19 - Europe
Lucidworks
 
PDF
Smart Answers for Employee and Customer Support After COVID-19
Lucidworks
 
PPTX
Applying AI & Search in Europe - featuring 451 Research
Lucidworks
 
PPTX
Webinar: Accelerate Data Science with Fusion 5.1
Lucidworks
 
PDF
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Lucidworks
 
PPTX
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Lucidworks
 
PPTX
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
Lucidworks
 
PPTX
Webinar: Building a Business Case for Enterprise Search
Lucidworks
 
PPTX
Why Insight Engines Matter in 2020 and Beyond
Lucidworks
 
Search is the Tip of the Spear for Your B2B eCommerce Strategy
Lucidworks
 
Drive Agent Effectiveness in Salesforce
Lucidworks
 
How Crate & Barrel Connects Shoppers with Relevant Products
Lucidworks
 
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Lucidworks
 
Connected Experiences Are Personalized Experiences
Lucidworks
 
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
Lucidworks
 
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
Lucidworks
 
Preparing for Peak in Ecommerce | eTail Asia 2020
Lucidworks
 
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
Lucidworks
 
AI-Powered Linguistics and Search with Fusion and Rosette
Lucidworks
 
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
Lucidworks
 
Webinar: Smart answers for employee and customer support after covid 19 - Europe
Lucidworks
 
Smart Answers for Employee and Customer Support After COVID-19
Lucidworks
 
Applying AI & Search in Europe - featuring 451 Research
Lucidworks
 
Webinar: Accelerate Data Science with Fusion 5.1
Lucidworks
 
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Lucidworks
 
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Lucidworks
 
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
Lucidworks
 
Webinar: Building a Business Case for Enterprise Search
Lucidworks
 
Why Insight Engines Matter in 2020 and Beyond
Lucidworks
 
Ad

Recently uploaded (20)

PDF
July Patch Tuesday
Ivanti
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PDF
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
July Patch Tuesday
Ivanti
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
Windsurf Meetup Ottawa 2025-07-12 - Planning Mode at Reliza.pdf
Pavel Shukhman
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 

Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks & Scott Cote, Lucidworks