SlideShare a Scribd company logo
Using the 

AWS Elasticsearch Service 

to provide
fast feedback 

for your business
Wellington AWS User Group
December 7, 2017
steven@claritycloudworks.com
+64 27 620 1237
claritycloudworks.com
Steven Ensslen
IT knows the value 

of fast feedback
The Second Way 

of DevOps is 

“shorten and amplify
feedback loops”.
https://itrevolution.com/the-three-ways-principles-underpinning-devops/
Feedback tools in software development
1. IDE feedback
2. Unit tests
3. Continuous Integration
4. SecOps tests in build pipeline
5. Application Performance Monitoring
IT has sub-second feedback
Most business leaders have the
same feedback cycle as they did
50 years ago.
“War is the realm of uncertainty;
three quarters of the factors on
which action in war is based are
wrapped in a fog of greater or
lesser uncertainty. A sensitive and
discriminating judgment is called
for; a skilled intelligence to scent
out the truth.”
–Carl von Clausewitz

19th Century Prussian General
The fog of business
Information is late, disconnected, and vague.
Totals Hide
Information
If your analysis uses fixed-
length periods you will miss
trends.
Weekly hours
Monday Tuesday Wednesday Thursday Friday Saturday Sunday Total
Constant 8 8 8 8 8 0 0 40
Spiky 2 5 3 12 11 5 2 40
Constant Spiky
Arbitrary Boundaries Hide Trends
0
20
40
60
80
2017-9-1 2017-9-10 2017-9-19 2017-9-28 2017-10-7 2017-10-16 2017-10-25
September Total
= 1865
October Total
= 1880
November 

<< 

October
“If it moves, graph it. 

If it doesn’t move, 

graph it anyway, 

just in case it does.”
–Etsy
Elasticsearch
1. Makes fast feedback easy,

both for IT and business people
2. Makes awesome graphs
3. Is super fast and massively
scalable
What is ELK?
Elasticsearch is a RESTful API and clustering software over
Apache Lucene, which is a document database optimised for
search.
Logstash is a data ingestion tool. It transforms and ships data
across networks. Beats are a lighter, less-capable agent for
Elasticsearch.
Kibana is a powerful ad hoc query tool that quickly creates
beautiful graphs.
AWS Elasticsearch Service
Demonstration
Business Intelligence Tips
1. Work closely with a champion customer
2. Start small, both in scope and audience
3. Reuse the language and labels of your audience
4. Reuse the time periods that are already part of your processes 

(i.e. financial quarters)
5. Transform data and index the things that your audience think about,
like sessions, products, and orders, especially if your raw data
doesn’t quite map to them
Test Driven Design
1. Use Kinesis Firehose to save all of your production stream to S3, 

then apply lifecycle policies
2. At the very beginning, play a static, fake data set. Replay feature of the
Logstash sleep plugin. Do not develop or test with a random generator!
3. Whenever you encounter undesirable behaviour, 

add the recording segment to your test suite.
4. Test Elasticsearch with xUnit in your code pipleline
5. Monitor Kibana and Elasticsearch with your APM
Elasticsearch tips
1. Predefine your index mapping
2. Only use one type per index (ES6.x removes support for many types
per index)
3. Partition your index by time, typically by day
4. There are no joins, use Lambda to enrich data before loading it into
Elasticsearch
5. Ideally an ES cluster has 3 small masters and < 10 workers, 

above 10 nodes scale-up before scale-out
AWS ES tips
1. The Elasticsearch port is 80, not 9200
2. Do NOT expose ES or Kibana to the public internet!
3. Start bigger, then shrink (IMHO, seven M4.large is
big)
4. Do not use ES as a data store; use RDS, or
DynamoDB, or Redshift, or S3 with Athena
Cloudwatch vs ES
1. Only fixed thresholds for alerts
2. Much easier to use
3. Much less admin
4. Scales elastically
Kinesis Analytics vs ES
1. Simpler for detection
2. Elastic scaling
3. No graphs
4. MillisBehindLatest can be minutes!
Athena & Quicksight vs ES
1. Massive, admin-free scaling
2. Need to add Lambda, even then runs
periodically not event driven
3. Worse latency
4. Conceivably could be more expensive 

(1440 scheduled queries * ?)
Photo: Micheal Filion, https://www.flickr.com/photos/mike9alive/
Situational Awareness
100% uptime on the GPS of this car isn’t going to help anything
Clarity Cloudworks
illuminating issues before they become problems

More Related Content

What's hot (17)

PDF
Reinventing enterprise defense with the Elastic Stack
Elasticsearch
 
PDF
Construção de uma plataforma de observabilidade centralizada
Elasticsearch
 
PDF
O monitoramento da infraestrutura facilitado, da ingestão ao insight
Elasticsearch
 
PPTX
Women Who Code
Everett Toews
 
PDF
Combining logs, metrics, and traces for unified observability
Elasticsearch
 
PPTX
Azure databricks ml
Vishwas N
 
PDF
Reinventing enterprise defense with the Elastic Stack
Elasticsearch
 
PDF
Combining Logs, Metrics, and Traces for Unified Observability
Elasticsearch
 
PPTX
Amazon Athena now supports new Lake Formation fine-grained security and relia...
Dhaval Soni
 
PPT
AWS Terms
LadderMinds
 
PDF
Overview of Blue Medora - New Relic Plugin for Cisco UCS
Blue Medora
 
PDF
Elastic Cloud and FedRAMP
Elasticsearch
 
PPT
Cloud storage providers comparative study
Dayanand Shanmugham
 
PPTX
Amazon s3 adds new s3 event notifications for s3 lifecycle, s3 intelligent ti...
Dhaval Soni
 
PDF
Keynote
Elasticsearch
 
PDF
Meetup ilm virtual emea
Daliya Spasova
 
PDF
Amazon cloud service
Suresh Mandava
 
Reinventing enterprise defense with the Elastic Stack
Elasticsearch
 
Construção de uma plataforma de observabilidade centralizada
Elasticsearch
 
O monitoramento da infraestrutura facilitado, da ingestão ao insight
Elasticsearch
 
Women Who Code
Everett Toews
 
Combining logs, metrics, and traces for unified observability
Elasticsearch
 
Azure databricks ml
Vishwas N
 
Reinventing enterprise defense with the Elastic Stack
Elasticsearch
 
Combining Logs, Metrics, and Traces for Unified Observability
Elasticsearch
 
Amazon Athena now supports new Lake Formation fine-grained security and relia...
Dhaval Soni
 
AWS Terms
LadderMinds
 
Overview of Blue Medora - New Relic Plugin for Cisco UCS
Blue Medora
 
Elastic Cloud and FedRAMP
Elasticsearch
 
Cloud storage providers comparative study
Dayanand Shanmugham
 
Amazon s3 adds new s3 event notifications for s3 lifecycle, s3 intelligent ti...
Dhaval Soni
 
Keynote
Elasticsearch
 
Meetup ilm virtual emea
Daliya Spasova
 
Amazon cloud service
Suresh Mandava
 

Similar to Using AWS Elasticsearch for fast feedback on business data (20)

PDF
Growing with elastic search
Devi A S L
 
PDF
Elasticsearch
Shagun Rathore
 
PDF
Log Analytics with AWS
AWS Germany
 
PDF
Explore Elasticsearch and Why It’s Worth Using
Inexture Solutions
 
PDF
Elastic search from the trenches
Vinícius Carvalho
 
PDF
Keynote: Making search better, faster, easier
Elasticsearch
 
PDF
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
PDF
Architecture at Scale
Elasticsearch
 
PDF
Elastic Enterprise Search keynote
Elasticsearch
 
PPTX
Elastic Search Capability Presentation.pptx
Knoldus Inc.
 
PPTX
ELK Solutions Enablement Session - 17th March'2020
Ashnikbiz
 
PDF
[Virtual Meetup] Using Elasticsearch as a Time-Series Database in the Endpoin...
Anna Ossowski
 
PDF
Elasticsearch speed is key
Enterprise Search Warsaw Meetup
 
PDF
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
PDF
Using Elasticsearch for Analytics
Vaidik Kapoor
 
PDF
Roaring with elastic search sangam2018
Vinay Kumar
 
PPTX
GOTO Aarhus 2014: Making Enterprise Data Available in Real Time with elastics...
Yann Cluchey
 
PPTX
ElasticSearch Meetup 30 - 10 - 2014
Alberto Paro
 
PDF
Elastic.co's ELK Stack - Platform Agnostic Immutable Infrastructure & Analys...
AWS Chicago
 
PDF
Te damos la bienvenida a una nueva forma de realizar búsquedas
Imma Valls Bernaus
 
Growing with elastic search
Devi A S L
 
Elasticsearch
Shagun Rathore
 
Log Analytics with AWS
AWS Germany
 
Explore Elasticsearch and Why It’s Worth Using
Inexture Solutions
 
Elastic search from the trenches
Vinícius Carvalho
 
Keynote: Making search better, faster, easier
Elasticsearch
 
Elasticsearch Introduction at BigData meetup
Eric Rodriguez (Hiring in Lex)
 
Architecture at Scale
Elasticsearch
 
Elastic Enterprise Search keynote
Elasticsearch
 
Elastic Search Capability Presentation.pptx
Knoldus Inc.
 
ELK Solutions Enablement Session - 17th March'2020
Ashnikbiz
 
[Virtual Meetup] Using Elasticsearch as a Time-Series Database in the Endpoin...
Anna Ossowski
 
Elasticsearch speed is key
Enterprise Search Warsaw Meetup
 
Séminaire Big Data Alter Way - Elasticsearch - octobre 2014
ALTER WAY
 
Using Elasticsearch for Analytics
Vaidik Kapoor
 
Roaring with elastic search sangam2018
Vinay Kumar
 
GOTO Aarhus 2014: Making Enterprise Data Available in Real Time with elastics...
Yann Cluchey
 
ElasticSearch Meetup 30 - 10 - 2014
Alberto Paro
 
Elastic.co's ELK Stack - Platform Agnostic Immutable Infrastructure & Analys...
AWS Chicago
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Imma Valls Bernaus
 
Ad

Recently uploaded (20)

PPTX
一比一原版(LaTech毕业证)路易斯安那理工大学毕业证如何办理
Taqyea
 
PPT
Computer Securityyyyyyyy - Chapter 2.ppt
SolomonSB
 
PPTX
英国假毕业证诺森比亚大学成绩单GPA修改UNN学生卡网上可查学历成绩单
Taqyea
 
PPTX
L1A Season 1 ENGLISH made by A hegy fixed
toszolder91
 
PPTX
Orchestrating things in Angular application
Peter Abraham
 
PDF
Azure_DevOps introduction for CI/CD and Agile
henrymails
 
PPTX
ONLINE BIRTH CERTIFICATE APPLICATION SYSYTEM PPT.pptx
ShyamasreeDutta
 
PPTX
sajflsajfljsdfljslfjslfsdfas;fdsfksadfjlsdflkjslgfs;lfjlsajfl;sajfasfd.pptx
theknightme
 
PDF
𝐁𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓
hokimamad0
 
PPTX
PE introd.pptxfrgfgfdgfdgfgrtretrt44t444
nepmithibai2024
 
PDF
The-Hidden-Dangers-of-Skipping-Penetration-Testing.pdf.pdf
naksh4thra
 
PPTX
原版西班牙莱昂大学毕业证(León毕业证书)如何办理
Taqyea
 
PPT
introduction to networking with basics coverage
RamananMuthukrishnan
 
PDF
Apple_Environmental_Progress_Report_2025.pdf
yiukwong
 
PPTX
PM200.pptxghjgfhjghjghjghjghjghjghjghjghjghj
breadpaan921
 
PPTX
Optimization_Techniques_ML_Presentation.pptx
farispalayi
 
PPT
introductio to computers by arthur janry
RamananMuthukrishnan
 
PPTX
法国巴黎第二大学本科毕业证{Paris 2学费发票Paris 2成绩单}办理方法
Taqyea
 
PPTX
internet básico presentacion es una red global
70965857
 
PPTX
04 Output 1 Instruments & Tools (3).pptx
GEDYIONGebre
 
一比一原版(LaTech毕业证)路易斯安那理工大学毕业证如何办理
Taqyea
 
Computer Securityyyyyyyy - Chapter 2.ppt
SolomonSB
 
英国假毕业证诺森比亚大学成绩单GPA修改UNN学生卡网上可查学历成绩单
Taqyea
 
L1A Season 1 ENGLISH made by A hegy fixed
toszolder91
 
Orchestrating things in Angular application
Peter Abraham
 
Azure_DevOps introduction for CI/CD and Agile
henrymails
 
ONLINE BIRTH CERTIFICATE APPLICATION SYSYTEM PPT.pptx
ShyamasreeDutta
 
sajflsajfljsdfljslfjslfsdfas;fdsfksadfjlsdflkjslgfs;lfjlsajfl;sajfasfd.pptx
theknightme
 
𝐁𝐔𝐊𝐓𝐈 𝐊𝐄𝐌𝐄𝐍𝐀𝐍𝐆𝐀𝐍 𝐊𝐈𝐏𝐄𝐑𝟒𝐃 𝐇𝐀𝐑𝐈 𝐈𝐍𝐈 𝟐𝟎𝟐𝟓
hokimamad0
 
PE introd.pptxfrgfgfdgfdgfgrtretrt44t444
nepmithibai2024
 
The-Hidden-Dangers-of-Skipping-Penetration-Testing.pdf.pdf
naksh4thra
 
原版西班牙莱昂大学毕业证(León毕业证书)如何办理
Taqyea
 
introduction to networking with basics coverage
RamananMuthukrishnan
 
Apple_Environmental_Progress_Report_2025.pdf
yiukwong
 
PM200.pptxghjgfhjghjghjghjghjghjghjghjghjghj
breadpaan921
 
Optimization_Techniques_ML_Presentation.pptx
farispalayi
 
introductio to computers by arthur janry
RamananMuthukrishnan
 
法国巴黎第二大学本科毕业证{Paris 2学费发票Paris 2成绩单}办理方法
Taqyea
 
internet básico presentacion es una red global
70965857
 
04 Output 1 Instruments & Tools (3).pptx
GEDYIONGebre
 
Ad

Using AWS Elasticsearch for fast feedback on business data

  • 1. Using the 
 AWS Elasticsearch Service 
 to provide fast feedback 
 for your business Wellington AWS User Group December 7, 2017
  • 2. [email protected] +64 27 620 1237 claritycloudworks.com Steven Ensslen
  • 3. IT knows the value 
 of fast feedback The Second Way 
 of DevOps is 
 “shorten and amplify feedback loops”. https://itrevolution.com/the-three-ways-principles-underpinning-devops/
  • 4. Feedback tools in software development 1. IDE feedback 2. Unit tests 3. Continuous Integration 4. SecOps tests in build pipeline 5. Application Performance Monitoring
  • 5. IT has sub-second feedback Most business leaders have the same feedback cycle as they did 50 years ago.
  • 6. “War is the realm of uncertainty; three quarters of the factors on which action in war is based are wrapped in a fog of greater or lesser uncertainty. A sensitive and discriminating judgment is called for; a skilled intelligence to scent out the truth.” –Carl von Clausewitz
 19th Century Prussian General
  • 7. The fog of business Information is late, disconnected, and vague.
  • 8. Totals Hide Information If your analysis uses fixed- length periods you will miss trends. Weekly hours Monday Tuesday Wednesday Thursday Friday Saturday Sunday Total Constant 8 8 8 8 8 0 0 40 Spiky 2 5 3 12 11 5 2 40 Constant Spiky
  • 9. Arbitrary Boundaries Hide Trends 0 20 40 60 80 2017-9-1 2017-9-10 2017-9-19 2017-9-28 2017-10-7 2017-10-16 2017-10-25 September Total = 1865 October Total = 1880 November 
 << 
 October
  • 10. “If it moves, graph it. 
 If it doesn’t move, 
 graph it anyway, 
 just in case it does.” –Etsy
  • 11. Elasticsearch 1. Makes fast feedback easy,
 both for IT and business people 2. Makes awesome graphs 3. Is super fast and massively scalable
  • 12. What is ELK? Elasticsearch is a RESTful API and clustering software over Apache Lucene, which is a document database optimised for search. Logstash is a data ingestion tool. It transforms and ships data across networks. Beats are a lighter, less-capable agent for Elasticsearch. Kibana is a powerful ad hoc query tool that quickly creates beautiful graphs.
  • 15. Business Intelligence Tips 1. Work closely with a champion customer 2. Start small, both in scope and audience 3. Reuse the language and labels of your audience 4. Reuse the time periods that are already part of your processes 
 (i.e. financial quarters) 5. Transform data and index the things that your audience think about, like sessions, products, and orders, especially if your raw data doesn’t quite map to them
  • 16. Test Driven Design 1. Use Kinesis Firehose to save all of your production stream to S3, 
 then apply lifecycle policies 2. At the very beginning, play a static, fake data set. Replay feature of the Logstash sleep plugin. Do not develop or test with a random generator! 3. Whenever you encounter undesirable behaviour, 
 add the recording segment to your test suite. 4. Test Elasticsearch with xUnit in your code pipleline 5. Monitor Kibana and Elasticsearch with your APM
  • 17. Elasticsearch tips 1. Predefine your index mapping 2. Only use one type per index (ES6.x removes support for many types per index) 3. Partition your index by time, typically by day 4. There are no joins, use Lambda to enrich data before loading it into Elasticsearch 5. Ideally an ES cluster has 3 small masters and < 10 workers, 
 above 10 nodes scale-up before scale-out
  • 18. AWS ES tips 1. The Elasticsearch port is 80, not 9200 2. Do NOT expose ES or Kibana to the public internet! 3. Start bigger, then shrink (IMHO, seven M4.large is big) 4. Do not use ES as a data store; use RDS, or DynamoDB, or Redshift, or S3 with Athena
  • 19. Cloudwatch vs ES 1. Only fixed thresholds for alerts 2. Much easier to use 3. Much less admin 4. Scales elastically
  • 20. Kinesis Analytics vs ES 1. Simpler for detection 2. Elastic scaling 3. No graphs 4. MillisBehindLatest can be minutes!
  • 21. Athena & Quicksight vs ES 1. Massive, admin-free scaling 2. Need to add Lambda, even then runs periodically not event driven 3. Worse latency 4. Conceivably could be more expensive 
 (1440 scheduled queries * ?)
  • 22. Photo: Micheal Filion, https://www.flickr.com/photos/mike9alive/ Situational Awareness 100% uptime on the GPS of this car isn’t going to help anything
  • 23. Clarity Cloudworks illuminating issues before they become problems