SlideShare a Scribd company logo
Tracing Your Security Telemetry
With Apache Metron
Justin Leet
Systems Architect
June 29, 2016
2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
What is Apache Metron?
3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
What Apache Metron Does?
“Apache Metron provides a scalable advanced
security analytics framework built with the Hadoop
Community evolving from the Cisco OpenSOC
Project.
A cyber security application framework that provides
organizations the ability to detect cyber anomalies
and enable organizations to rapidly respond to
identified anomalies.”
4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Apache Metron Timeline
Sep 2014 •OpenSOC Beta
June 2015
•OpenSOC Community Edition
Dec 2015
•Metron enters Apache Incubator
April 2016
•Apache Metron 0.1
Now
•Working towards 0.2 release
5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Who is Metron for?
6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Core Capabilities
7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Architecture
8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Streaming Parsing and Enrichment
9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Metron’s parsing bolt can be configured two ways
– And outputs JSON
 Grok Parser
– Less work to implement
– Regex-like syntax
– Good for lower volumes of data
 Java Parser
– More work to implement
– Good for higher volumes of data
Parsing
10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Enrichment / Threat Intel
11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Add additional information to raw source during streaming
 Adding it during streaming allows ML models to score in real time instead of
batch
 Primarily stored in HBase
 Several enrichments
– GeoIP
– Host
– Threat Intelligence
Enrichment
12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Occurs in the same Storm topology as enrichment
 Very similar process and flow
 Use a threat feed aggregator!
– Soltra adapter is provided to read feed and stream into HBase
– Flat File loader and Stix bulk loader available without threat feed aggregator
Threat Intel
13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Field Description
ip_src_addr Octet source IP
ip_dest_addr Octet destination IP
ip_src_port Integer source port
ip_dest_port Integer destination port
protocol String protocol (e.g. TCP)
timestamp Sensor epoch timestamp
source.type yaf, snort, etc.
start_time Metron epoch timestamp
end_time Metron epoch timestamp
Metron JSON
14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Standalone Storm topology
 Reads from Kafka
 Writes packets to HDFS
 Kibana panel forwards request to REST PCAP service
– MR Job launched
– Delivers results back to Kibana
PCAP
15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
PCAP
16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Tracing a Source Through Metron
17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Sensor to Parser
18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Caching proxy
– Mostly useful as a source of easy to get and easily readable logs
Squid
1467125585.752 5288 127.0.0.1 TCP_MISS/200 32250 GET https://news.ycombinator.com/ - DIRECT/104.20.43.44 text/html
Time Elapsed Remote Host Code/Statu
s
Bytes Metho
d
URL rfc931 Peer Status/ Peer Host Type
1467125585.752 5288 127.0.0.1 TCP_MISS/2
00
32250 GET https://news.ycombinator.com/ - DIRECT/104.20.43.44 text/html
19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Squid - Grok
Time Elapsed Remote Host Code/Statu
s
Bytes Metho
d
URL rfc931 Peer Status/ Peer Host Type
1467125585.752 5288 127.0.0.1 TCP_MISS/2
00
32250 GET https://news.ycombinator.com/ - DIRECT/104.20.43.44 text/html
SQUID_DELIMITED %{NUMBER:timestamp}%{SPACE:UNWANTED}
%{INT:elapsed}%{SPACE:UNWANTED}%{IPV4:ip_src_addr} %{WORD:action}/%{NUMBER:code}
%{NUMBER:bytes} %{WORD:method} %{NOTSPACE:url} -
%{WORD:UNWANTED}/%{IPV4:ip_dst_addr} %{WORD:UNWANTED}/%{WORD:UNWANTED}
20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Squid – Topology Definition
{ "parserClassName": "org.apache.metron.parsers.GrokParser", "sensorTopic": "squid", "pars
erConfig":
{ "grokPath": "/apps/metron/patterns/squid", "patternLabel": "SQUID_DELIMITED", "tim
estampField": "timestamp" },
"fieldTransformations" : [
{
"transformation" : "MTL" ,"output" : [ "full_hostname",
"domain_without_subdomains" ] ,"config" : { "full_hostname" :
"URL_TO_HOST(url)" ,"domain_without_subdomains" :
"DOMAIN_REMOVE_SUBDOMAINS(full_hostname)" } } ] }
21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Squid – Topology Result
22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Enrichment Topology
23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Loading some WHOIS derived data.
– Not directly making WHOIS query, just using a CSV containing a few rows of data.
Squid – Enrichment Definition
{
"zkQuorum" : ”localhost:2181"
,"sensorToFieldList" : {
"squid" : {
"type" : "ENRICHMENT"
,"fieldToEnrichmentTypes" : {
"domain_without_subdomains" : [ "whois" ]
}
}
}
}
24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Squid – Enrichment Result
25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Enrichment Topology
26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
 Loading a list of malicious domains
– ZeuS tracker
Squid – Enrichment Definition
{
"zkQuorum": "localhost:2181",
"sensorToFieldList": {
"squid": {
"type": "THREAT_INTEL",
"fieldToEnrichmentTypes": {
"url": ["zeusList”]
}
}
}
}
27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Squid – Threat Intel Result
28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Questions?
Justin Leet
Systems Architect
jleet@hortonworks.com
justinjleet@gmail.com

More Related Content

What's hot (20)

PPTX
An Overview on Optimization in Apache Hive: Past, Present, Future
DataWorks Summit
 
PPTX
Omid: scalable and highly available transaction processing for Apache Phoenix
DataWorks Summit
 
PPTX
LLAP: Building Cloud First BI
DataWorks Summit
 
PPTX
LLAP: Sub-Second Analytical Queries in Hive
DataWorks Summit/Hadoop Summit
 
PDF
An Apache Hive Based Data Warehouse
DataWorks Summit
 
PPTX
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
PPTX
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
 
PPTX
Hive edw-dataworks summit-eu-april-2017
alanfgates
 
PPTX
Fine-Grained Security for Spark and Hive
DataWorks Summit/Hadoop Summit
 
PPTX
Cloud Operations with Streaming Analytics using Apache NiFi and Apache Flink
DataWorks Summit
 
PPTX
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
DataWorks Summit/Hadoop Summit
 
PPTX
Accelerating TensorFlow with RDMA for high-performance deep learning
DataWorks Summit
 
PPTX
Hadoop & Cloud Storage: Object Store Integration in Production
DataWorks Summit/Hadoop Summit
 
PPTX
Row/Column- Level Security in SQL for Apache Spark
DataWorks Summit/Hadoop Summit
 
PPTX
Mission to NARs with Apache NiFi
Hortonworks
 
PDF
Next Generation Execution for Apache Storm
DataWorks Summit
 
PPTX
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
 
PPTX
A Multi Colored YARN
DataWorks Summit/Hadoop Summit
 
PPTX
Omid: scalable and highly available transaction processing for Apache Phoenix
DataWorks Summit
 
PPTX
Building a Smarter Home with Apache NiFi and Spark
DataWorks Summit/Hadoop Summit
 
An Overview on Optimization in Apache Hive: Past, Present, Future
DataWorks Summit
 
Omid: scalable and highly available transaction processing for Apache Phoenix
DataWorks Summit
 
LLAP: Building Cloud First BI
DataWorks Summit
 
LLAP: Sub-Second Analytical Queries in Hive
DataWorks Summit/Hadoop Summit
 
An Apache Hive Based Data Warehouse
DataWorks Summit
 
Mool - Automated Log Analysis using Data Science and ML
DataWorks Summit/Hadoop Summit
 
Enabling Apache Zeppelin and Spark for Data Science in the Enterprise
DataWorks Summit/Hadoop Summit
 
Hive edw-dataworks summit-eu-april-2017
alanfgates
 
Fine-Grained Security for Spark and Hive
DataWorks Summit/Hadoop Summit
 
Cloud Operations with Streaming Analytics using Apache NiFi and Apache Flink
DataWorks Summit
 
Apache Phoenix and HBase: Past, Present and Future of SQL over HBase
DataWorks Summit/Hadoop Summit
 
Accelerating TensorFlow with RDMA for high-performance deep learning
DataWorks Summit
 
Hadoop & Cloud Storage: Object Store Integration in Production
DataWorks Summit/Hadoop Summit
 
Row/Column- Level Security in SQL for Apache Spark
DataWorks Summit/Hadoop Summit
 
Mission to NARs with Apache NiFi
Hortonworks
 
Next Generation Execution for Apache Storm
DataWorks Summit
 
Streamline Hadoop DevOps with Apache Ambari
DataWorks Summit/Hadoop Summit
 
A Multi Colored YARN
DataWorks Summit/Hadoop Summit
 
Omid: scalable and highly available transaction processing for Apache Phoenix
DataWorks Summit
 
Building a Smarter Home with Apache NiFi and Spark
DataWorks Summit/Hadoop Summit
 

Similar to Tracing your security telemetry with Apache Metron (20)

PPTX
Apache Metron: Community Driven Cyber Security
DataWorks Summit/Hadoop Summit
 
PDF
Apache Metron in the Real World
Dave Russell
 
PDF
Apache Metron in the Real World
DataWorks Summit
 
PPTX
Apache metron meetup presentation at capital one
gvetticaden
 
PPTX
Apache Metron Meetup May 4, 2016 - Big data cybersecurity
Hortonworks
 
PPSX
Apache metron - An Introduction
Baban Gaigole
 
PPTX
Just the sketch: advanced streaming analytics in Apache Metron
DataWorks Summit
 
PDF
Solving Cybersecurity at Scale
DataWorks Summit
 
PDF
Bringing it All Together: Apache Metron (Incubating) as a Case Study of a Mod...
DataWorks Summit
 
PDF
Open Security Operations Center - OpenSOC
Sheetal Dolas
 
PPTX
A streaming architecture for Cyber Security - Apache Metron
Simon Elliston Ball
 
PPTX
Future of Apache Storm
DataWorks Summit/Hadoop Summit
 
PPTX
Building a modern end-to-end open source Big Data reference application
DataWorks Summit
 
PDF
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Hortonworks
 
PPTX
An adaptive and eventually self healing framework for geo-distributed real-ti...
Angad Singh
 
PDF
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks &...
Lucidworks
 
PDF
Big Traffic, Big Trouble: Big Data - Tokyo
DataWorks Summit
 
PDF
Big Traffic, Big Trouble: Big Data Security Analytics
DataWorks Summit
 
PDF
Mhug apache storm
Joseph Niemiec
 
PDF
Apache Metron Profiler - Cyber Bootcamp 2017
Nick Allen
 
Apache Metron: Community Driven Cyber Security
DataWorks Summit/Hadoop Summit
 
Apache Metron in the Real World
Dave Russell
 
Apache Metron in the Real World
DataWorks Summit
 
Apache metron meetup presentation at capital one
gvetticaden
 
Apache Metron Meetup May 4, 2016 - Big data cybersecurity
Hortonworks
 
Apache metron - An Introduction
Baban Gaigole
 
Just the sketch: advanced streaming analytics in Apache Metron
DataWorks Summit
 
Solving Cybersecurity at Scale
DataWorks Summit
 
Bringing it All Together: Apache Metron (Incubating) as a Case Study of a Mod...
DataWorks Summit
 
Open Security Operations Center - OpenSOC
Sheetal Dolas
 
A streaming architecture for Cyber Security - Apache Metron
Simon Elliston Ball
 
Future of Apache Storm
DataWorks Summit/Hadoop Summit
 
Building a modern end-to-end open source Big Data reference application
DataWorks Summit
 
Catch a Hacker in Real-Time: Live Visuals of Bots and Bad Guys
Hortonworks
 
An adaptive and eventually self healing framework for geo-distributed real-ti...
Angad Singh
 
Cybersecurity with Apache Metron and Apache Solr - Ward Bekker, Hortonworks &...
Lucidworks
 
Big Traffic, Big Trouble: Big Data - Tokyo
DataWorks Summit
 
Big Traffic, Big Trouble: Big Data Security Analytics
DataWorks Summit
 
Mhug apache storm
Joseph Niemiec
 
Apache Metron Profiler - Cyber Bootcamp 2017
Nick Allen
 
Ad

More from DataWorks Summit/Hadoop Summit (20)

PPT
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
PPT
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
PDF
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
PDF
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
PDF
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
PDF
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
PDF
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
PPTX
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
PPTX
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
PDF
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
PPTX
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
PPTX
HBase in Practice
DataWorks Summit/Hadoop Summit
 
PPTX
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
PDF
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
PPTX
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
PPTX
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit/Hadoop Summit
 
Running Apache Spark & Apache Zeppelin in Production
DataWorks Summit/Hadoop Summit
 
State of Security: Apache Spark & Apache Zeppelin
DataWorks Summit/Hadoop Summit
 
Unleashing the Power of Apache Atlas with Apache Ranger
DataWorks Summit/Hadoop Summit
 
Enabling Digital Diagnostics with a Data Science Platform
DataWorks Summit/Hadoop Summit
 
Revolutionize Text Mining with Spark and Zeppelin
DataWorks Summit/Hadoop Summit
 
Double Your Hadoop Performance with Hortonworks SmartSense
DataWorks Summit/Hadoop Summit
 
Hadoop Crash Course
DataWorks Summit/Hadoop Summit
 
Data Science Crash Course
DataWorks Summit/Hadoop Summit
 
Apache Spark Crash Course
DataWorks Summit/Hadoop Summit
 
Dataflow with Apache NiFi
DataWorks Summit/Hadoop Summit
 
Schema Registry - Set you Data Free
DataWorks Summit/Hadoop Summit
 
Building a Large-Scale, Adaptive Recommendation Engine with Apache Flink and ...
DataWorks Summit/Hadoop Summit
 
Real-Time Anomaly Detection using LSTM Auto-Encoders with Deep Learning4J on ...
DataWorks Summit/Hadoop Summit
 
How Hadoop Makes the Natixis Pack More Efficient
DataWorks Summit/Hadoop Summit
 
HBase in Practice
DataWorks Summit/Hadoop Summit
 
The Challenge of Driving Business Value from the Analytics of Things (AOT)
DataWorks Summit/Hadoop Summit
 
Breaking the 1 Million OPS/SEC Barrier in HOPS Hadoop
DataWorks Summit/Hadoop Summit
 
From Regulatory Process Verification to Predictive Maintenance and Beyond wit...
DataWorks Summit/Hadoop Summit
 
Backup and Disaster Recovery in Hadoop
DataWorks Summit/Hadoop Summit
 
Scaling HDFS to Manage Billions of Files with Distributed Storage Schemes
DataWorks Summit/Hadoop Summit
 
Ad

Recently uploaded (20)

PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PPTX
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PPTX
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
PDF
Python basic programing language for automation
DanialHabibi2
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PPTX
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Smart Trailers 2025 Update with History and Overview
Paul Menig
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
✨Unleashing Collaboration: Salesforce Channels & Community Power in Patna!✨
SanjeetMishra29
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Webinar: Introduction to LF Energy EVerest
DanBrown980551
 
Python basic programing language for automation
DanialHabibi2
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
How Startups Are Growing Faster with App Developers in Australia.pdf
India App Developer
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
MSP360 Backup Scheduling and Retention Best Practices.pptx
MSP360
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 

Tracing your security telemetry with Apache Metron

  • 1. Tracing Your Security Telemetry With Apache Metron Justin Leet Systems Architect June 29, 2016
  • 2. 2 © Hortonworks Inc. 2011 – 2016. All Rights Reserved What is Apache Metron?
  • 3. 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved What Apache Metron Does? “Apache Metron provides a scalable advanced security analytics framework built with the Hadoop Community evolving from the Cisco OpenSOC Project. A cyber security application framework that provides organizations the ability to detect cyber anomalies and enable organizations to rapidly respond to identified anomalies.”
  • 4. 4 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Metron Timeline Sep 2014 •OpenSOC Beta June 2015 •OpenSOC Community Edition Dec 2015 •Metron enters Apache Incubator April 2016 •Apache Metron 0.1 Now •Working towards 0.2 release
  • 5. 5 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Who is Metron for?
  • 6. 6 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Core Capabilities
  • 7. 7 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Architecture
  • 8. 8 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Streaming Parsing and Enrichment
  • 9. 9 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Metron’s parsing bolt can be configured two ways – And outputs JSON  Grok Parser – Less work to implement – Regex-like syntax – Good for lower volumes of data  Java Parser – More work to implement – Good for higher volumes of data Parsing
  • 10. 10 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Enrichment / Threat Intel
  • 11. 11 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Add additional information to raw source during streaming  Adding it during streaming allows ML models to score in real time instead of batch  Primarily stored in HBase  Several enrichments – GeoIP – Host – Threat Intelligence Enrichment
  • 12. 12 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Occurs in the same Storm topology as enrichment  Very similar process and flow  Use a threat feed aggregator! – Soltra adapter is provided to read feed and stream into HBase – Flat File loader and Stix bulk loader available without threat feed aggregator Threat Intel
  • 13. 13 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Field Description ip_src_addr Octet source IP ip_dest_addr Octet destination IP ip_src_port Integer source port ip_dest_port Integer destination port protocol String protocol (e.g. TCP) timestamp Sensor epoch timestamp source.type yaf, snort, etc. start_time Metron epoch timestamp end_time Metron epoch timestamp Metron JSON
  • 14. 14 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Standalone Storm topology  Reads from Kafka  Writes packets to HDFS  Kibana panel forwards request to REST PCAP service – MR Job launched – Delivers results back to Kibana PCAP
  • 15. 15 © Hortonworks Inc. 2011 – 2016. All Rights Reserved PCAP
  • 16. 16 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Tracing a Source Through Metron
  • 17. 17 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Sensor to Parser
  • 18. 18 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Caching proxy – Mostly useful as a source of easy to get and easily readable logs Squid 1467125585.752 5288 127.0.0.1 TCP_MISS/200 32250 GET https://news.ycombinator.com/ - DIRECT/104.20.43.44 text/html Time Elapsed Remote Host Code/Statu s Bytes Metho d URL rfc931 Peer Status/ Peer Host Type 1467125585.752 5288 127.0.0.1 TCP_MISS/2 00 32250 GET https://news.ycombinator.com/ - DIRECT/104.20.43.44 text/html
  • 19. 19 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Squid - Grok Time Elapsed Remote Host Code/Statu s Bytes Metho d URL rfc931 Peer Status/ Peer Host Type 1467125585.752 5288 127.0.0.1 TCP_MISS/2 00 32250 GET https://news.ycombinator.com/ - DIRECT/104.20.43.44 text/html SQUID_DELIMITED %{NUMBER:timestamp}%{SPACE:UNWANTED} %{INT:elapsed}%{SPACE:UNWANTED}%{IPV4:ip_src_addr} %{WORD:action}/%{NUMBER:code} %{NUMBER:bytes} %{WORD:method} %{NOTSPACE:url} - %{WORD:UNWANTED}/%{IPV4:ip_dst_addr} %{WORD:UNWANTED}/%{WORD:UNWANTED}
  • 20. 20 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Squid – Topology Definition { "parserClassName": "org.apache.metron.parsers.GrokParser", "sensorTopic": "squid", "pars erConfig": { "grokPath": "/apps/metron/patterns/squid", "patternLabel": "SQUID_DELIMITED", "tim estampField": "timestamp" }, "fieldTransformations" : [ { "transformation" : "MTL" ,"output" : [ "full_hostname", "domain_without_subdomains" ] ,"config" : { "full_hostname" : "URL_TO_HOST(url)" ,"domain_without_subdomains" : "DOMAIN_REMOVE_SUBDOMAINS(full_hostname)" } } ] }
  • 21. 21 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Squid – Topology Result
  • 22. 22 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Enrichment Topology
  • 23. 23 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Loading some WHOIS derived data. – Not directly making WHOIS query, just using a CSV containing a few rows of data. Squid – Enrichment Definition { "zkQuorum" : ”localhost:2181" ,"sensorToFieldList" : { "squid" : { "type" : "ENRICHMENT" ,"fieldToEnrichmentTypes" : { "domain_without_subdomains" : [ "whois" ] } } } }
  • 24. 24 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Squid – Enrichment Result
  • 25. 25 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Enrichment Topology
  • 26. 26 © Hortonworks Inc. 2011 – 2016. All Rights Reserved  Loading a list of malicious domains – ZeuS tracker Squid – Enrichment Definition { "zkQuorum": "localhost:2181", "sensorToFieldList": { "squid": { "type": "THREAT_INTEL", "fieldToEnrichmentTypes": { "url": ["zeusList”] } } } }
  • 27. 27 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Squid – Threat Intel Result
  • 28. 28 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Questions? Justin Leet Systems Architect [email protected] [email protected]