SlideShare a Scribd company logo
Moving Quickly With Data Services In The Cloud
Matt Dimich
Cloud Architect
Thomson Reuters
@JobsWithUs
#WorkingAtTR
#HappyAtTR
What do you mean by Data Services?
Technologies
• Aurora MySQL
• Amazon S3
• Elastic Search
• Aurora PostgreSQL
• DMS/SCT
• DynamoDB
• ElastiCache
• Big Data
Areas of Concern
• Automate Creation/Provisioning
• Automate “Schema” Changes
• Data Security – Network
• Data Security – Access
• Data Security – Encryption
• High Availability
• Cost Monitoring & Reporting
• Logging, Monitoring & Alerting
• Connection Management
• Backup & Restore
• Scale & Limits
• Retry Logic, Error Handling &
Transactional Integrity
• Disaster Recovery
• Audit Infrastructure
• Internationalization
• Data Consistency
• Maintenance
• Support Team
Data Services Before Cloud…
• Provision huge, costly database servers
• Elastic Search would provision monster elastic search nodes so they could scale
• Lots of human hands would touch each change to the database
• Leads to high cost for experiments if you need a database
• Large effort to figure out the storage and compute appropriately
• (and a big time penalty if you get it wrong)
• Focused on High Availability
What’s changing…
• Provisioning takes minutes
• Scaling is easier than ever
• Start ups suddenly have the power to get an app out with relatively low cost and high scale
• Focus has shifted from high availability to time to market.
Relational Databases
Automated Provisioning
•How does it work?
• AWS CloudFormation creates our database
clusters for us.
• Consistent
• Repeatable
• Embedded Standards (i.e. default to encryption
at rest)
http://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/cfn-whatis-howdoesitwork.html
AWS CloudFormation can update our database clusters as well
Warning! Mindset Change Required!
•Don’t create it until you need it
•Turn off on weekends and nights as possible
•Start small and scale up later
Automated Schema Change Pipeline
• Reduce human error
• Repeatable
• Consistent
• Same tooling as App Deployment
• Lower time to deployment
• Build in safeguards
Multiple Iterations
1. Flyway with Jenkins on Prem
2. Flyway with Spinnaker and Jenkins
in AWS
3. Flyway with AWS Code Pipeline in
AWS
DB Change Pipeline with AWS Code Services
Object Storage
S3
• Simple Storage Service
• Object Storage
• Allows hierarchy structure
• Has powerful lifecycle rules to expire items or
move to items to cheaper storage classes etc.
HTTP
AWS Command Line
AWS Web Console
Simple Right?
It turns out this simple storage service is actually quite complex to set up…correctly.
How are other companies doing at this?
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
But those are already weeks and months old…
Moving Quickly with Data Services in the Cloud
Moving Quickly with Data Services in the Cloud
Why is this so hard?
• How do you lock down a bucket?
• IAM Policy
• AWS-managed policy attached to a user, group or role
• Self-managed policy attached to a user, group or role
• Inline policy for a user, group, or role
• Bucket Policy
• Bucket ACL
• Object ACL
Oh, and one more thing…
How to avoid the front page of the news
• Spend time creating secure CloudFormation then use that everywhere!
• At TR we often use predefined bucket types with built in standards/safeguards
• Resources – For your microservice and only your microservice
• Infrastructure – What it sounds like, stuff outside the app.
• Website – For public facing static content
AWS Responds
Introducing Amazon Macie, Launched August 14, 2017
• “AI/Machine Learning to look for Sensitive Data”
AWS – “We take care of the undifferentiated heavy lifting”
Building Blocks…
How do I restore to a point-in-time with S3?
• S3 has versioning for each object
• Everything is available on the API
• Enter the PIT restore tool
• Dry run available
• Estimate size
• Loops through a bucket, path or just one file and
reverts it to the version that was current during the
entered point-in-time
Elastic Search
Elastic Search
• It’s not as easy as saying AWS every time
• Platform Elastic Search has automated their Elastic Search distribution on AWS
and Azure.
• Gives us custom plugins and encryption at rest
• A step toward a managed service
But I Have Experience Running This
• Same ES distribution as our Data Centers
• Different hardware profile
• The team started smaller
• But not small enough
• Force our applications to ”tip over” the
resources before upgrading
Moving Quickly with Data Services in the Cloud
@JobsWithUs
#WorkingAtTR
#HappyAtTR
In Summary
1. Use Infrastructure as code
2. Craft your code securely and carefully
3. Re-use it everywhere
So you can…
1. Move Quickly
2. Stay off the front page of the news

More Related Content

What's hot (14)

PDF
Dos and don'ts in AWS
Marek Piątek
 
PPTX
Azure functions serverless
Udaiappa Ramachandran
 
PDF
ELB를 활용한 Socket.IO 멀티노드 구축사례
Anson Park
 
ODP
Scaling AWS With Scalr
Ram Viswanadha
 
KEY
Jclouds Intro
guesta31f61
 
PPTX
Azure vidyapeeth -Introduction to Azure Container Service & Registry Service
Ilyas F ☁☁☁
 
PPTX
Private cloud cloud-phoenix-april-2014
Miguel Zuniga
 
PDF
Serverless Architecture on AWS
Rajind Ruparathna
 
PPTX
Azure functions
EducationTamil
 
PDF
Architecture Battle: PaaS vs. FaaS
Estelle Auberix
 
PPTX
Azure PaaS (WebApp & SQL Database) workshop solution
Gelis Wu
 
PDF
How to copy multiple files from local to aws s3 bucket using aws cli
Katy Slemon
 
PPTX
Azure functions
vivek p s
 
PPTX
Azure Web Apps Advanced Security
Udaiappa Ramachandran
 
Dos and don'ts in AWS
Marek Piątek
 
Azure functions serverless
Udaiappa Ramachandran
 
ELB를 활용한 Socket.IO 멀티노드 구축사례
Anson Park
 
Scaling AWS With Scalr
Ram Viswanadha
 
Jclouds Intro
guesta31f61
 
Azure vidyapeeth -Introduction to Azure Container Service & Registry Service
Ilyas F ☁☁☁
 
Private cloud cloud-phoenix-april-2014
Miguel Zuniga
 
Serverless Architecture on AWS
Rajind Ruparathna
 
Azure functions
EducationTamil
 
Architecture Battle: PaaS vs. FaaS
Estelle Auberix
 
Azure PaaS (WebApp & SQL Database) workshop solution
Gelis Wu
 
How to copy multiple files from local to aws s3 bucket using aws cli
Katy Slemon
 
Azure functions
vivek p s
 
Azure Web Apps Advanced Security
Udaiappa Ramachandran
 

Similar to Moving Quickly with Data Services in the Cloud (20)

PDF
AWS architect certification course
wiTTyMinds1
 
PPTX
amazon web servics in the cloud aws and its categories compute cloud and stor...
soundharya59
 
PPTX
Aws Solution Architecture Associate - summary
onoffshake
 
PPTX
Migrating enterprise workloads to AWS
Tom Laszewski
 
PPTX
Architecting Cloud Apps
jineshvaria
 
PDF
Introduction to Amazon Web Services
Dayanand Shanmugham
 
PDF
AWS Summit Seoul 2015 - AWS 최신 서비스 살펴보기 - Aurora, Lambda, EFS, Machine Learn...
Amazon Web Services Korea
 
PPTX
CuriousMinds and Siemens in Brasov 2015 - Building and Developing for the Clo...
Vadim Zendejas
 
PPTX
CC-9b AWS Infra as a service part 3 (AWS).pptx
AdeelAsghar36
 
PDF
Building a Bigdata Architecture on AWS
Arun Sirimalla
 
PPTX
Architecting for AWS Cloud - let's do it right!
Misha Hanin
 
PPTX
AWS 101
Monica Rut Avellino
 
PPTX
Jump Start to Amazon Web Services
Gagan Sikri
 
PPTX
Pitt Immersion Day Module 4 - storage in AWS
EagleDream Technologies
 
PPTX
ABCs of AWS: S3
Mark Cohen
 
PPTX
Case Study Amazon AWS
VijaySingh790398
 
PPTX
cse40822-amazon.pptx
prathamgunj
 
PPT
Cloud Computing Workshop
Charlie Moad
 
PDF
AWS Summit - Atlanta
Sandy Carter
 
PPTX
AWS Amazon S3 Mastery Bootcamp
Matt Bohn
 
AWS architect certification course
wiTTyMinds1
 
amazon web servics in the cloud aws and its categories compute cloud and stor...
soundharya59
 
Aws Solution Architecture Associate - summary
onoffshake
 
Migrating enterprise workloads to AWS
Tom Laszewski
 
Architecting Cloud Apps
jineshvaria
 
Introduction to Amazon Web Services
Dayanand Shanmugham
 
AWS Summit Seoul 2015 - AWS 최신 서비스 살펴보기 - Aurora, Lambda, EFS, Machine Learn...
Amazon Web Services Korea
 
CuriousMinds and Siemens in Brasov 2015 - Building and Developing for the Clo...
Vadim Zendejas
 
CC-9b AWS Infra as a service part 3 (AWS).pptx
AdeelAsghar36
 
Building a Bigdata Architecture on AWS
Arun Sirimalla
 
Architecting for AWS Cloud - let's do it right!
Misha Hanin
 
Jump Start to Amazon Web Services
Gagan Sikri
 
Pitt Immersion Day Module 4 - storage in AWS
EagleDream Technologies
 
ABCs of AWS: S3
Mark Cohen
 
Case Study Amazon AWS
VijaySingh790398
 
cse40822-amazon.pptx
prathamgunj
 
Cloud Computing Workshop
Charlie Moad
 
AWS Summit - Atlanta
Sandy Carter
 
AWS Amazon S3 Mastery Bootcamp
Matt Bohn
 
Ad

Recently uploaded (20)

PPTX
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
PPTX
ChiSquare Procedure in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PPTX
Home Care Tools: Benefits, features and more
Third Rock Techkno
 
PDF
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pdf
Varsha Nayak
 
PPTX
In From the Cold: Open Source as Part of Mainstream Software Asset Management
Shane Coughlan
 
PPTX
Empowering Asian Contributions: The Rise of Regional User Groups in Open Sour...
Shane Coughlan
 
PDF
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
PDF
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
PDF
Empower Your Tech Vision- Why Businesses Prefer to Hire Remote Developers fro...
logixshapers59
 
PDF
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
PDF
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
PPTX
Tally_Basic_Operations_Presentation.pptx
AditiBansal54083
 
PPTX
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
PPTX
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 
PDF
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
PDF
Driver Easy Pro 6.1.1 Crack Licensce key 2025 FREE
utfefguu
 
PPTX
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PDF
TheFutureIsDynamic-BoxLang witch Luis Majano.pdf
Ortus Solutions, Corp
 
PDF
HiHelloHR – Simplify HR Operations for Modern Workplaces
HiHelloHR
 
PPTX
AEM User Group: India Chapter Kickoff Meeting
jennaf3
 
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
ChiSquare Procedure in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
Home Care Tools: Benefits, features and more
Third Rock Techkno
 
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pdf
Varsha Nayak
 
In From the Cold: Open Source as Part of Mainstream Software Asset Management
Shane Coughlan
 
Empowering Asian Contributions: The Rise of Regional User Groups in Open Sour...
Shane Coughlan
 
AI + DevOps = Smart Automation with devseccops.ai.pdf
Devseccops.ai
 
Open Chain Q2 Steering Committee Meeting - 2025-06-25
Shane Coughlan
 
Empower Your Tech Vision- Why Businesses Prefer to Hire Remote Developers fro...
logixshapers59
 
유니티에서 Burst Compiler+ThreadedJobs+SIMD 적용사례
Seongdae Kim
 
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
Tally_Basic_Operations_Presentation.pptx
AditiBansal54083
 
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
Why Businesses Are Switching to Open Source Alternatives to Crystal Reports.pptx
Varsha Nayak
 
Automate Cybersecurity Tasks with Python
VICTOR MAESTRE RAMIREZ
 
Driver Easy Pro 6.1.1 Crack Licensce key 2025 FREE
utfefguu
 
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
TheFutureIsDynamic-BoxLang witch Luis Majano.pdf
Ortus Solutions, Corp
 
HiHelloHR – Simplify HR Operations for Modern Workplaces
HiHelloHR
 
AEM User Group: India Chapter Kickoff Meeting
jennaf3
 
Ad

Moving Quickly with Data Services in the Cloud

  • 1. Moving Quickly With Data Services In The Cloud Matt Dimich Cloud Architect Thomson Reuters @JobsWithUs #WorkingAtTR #HappyAtTR
  • 2. What do you mean by Data Services? Technologies • Aurora MySQL • Amazon S3 • Elastic Search • Aurora PostgreSQL • DMS/SCT • DynamoDB • ElastiCache • Big Data Areas of Concern • Automate Creation/Provisioning • Automate “Schema” Changes • Data Security – Network • Data Security – Access • Data Security – Encryption • High Availability • Cost Monitoring & Reporting • Logging, Monitoring & Alerting • Connection Management • Backup & Restore • Scale & Limits • Retry Logic, Error Handling & Transactional Integrity • Disaster Recovery • Audit Infrastructure • Internationalization • Data Consistency • Maintenance • Support Team
  • 3. Data Services Before Cloud… • Provision huge, costly database servers • Elastic Search would provision monster elastic search nodes so they could scale • Lots of human hands would touch each change to the database • Leads to high cost for experiments if you need a database • Large effort to figure out the storage and compute appropriately • (and a big time penalty if you get it wrong) • Focused on High Availability
  • 4. What’s changing… • Provisioning takes minutes • Scaling is easier than ever • Start ups suddenly have the power to get an app out with relatively low cost and high scale • Focus has shifted from high availability to time to market.
  • 6. Automated Provisioning •How does it work? • AWS CloudFormation creates our database clusters for us. • Consistent • Repeatable • Embedded Standards (i.e. default to encryption at rest) http://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/cfn-whatis-howdoesitwork.html AWS CloudFormation can update our database clusters as well
  • 8. •Don’t create it until you need it
  • 9. •Turn off on weekends and nights as possible
  • 10. •Start small and scale up later
  • 11. Automated Schema Change Pipeline • Reduce human error • Repeatable • Consistent • Same tooling as App Deployment • Lower time to deployment • Build in safeguards
  • 12. Multiple Iterations 1. Flyway with Jenkins on Prem 2. Flyway with Spinnaker and Jenkins in AWS 3. Flyway with AWS Code Pipeline in AWS
  • 13. DB Change Pipeline with AWS Code Services
  • 15. S3 • Simple Storage Service • Object Storage • Allows hierarchy structure • Has powerful lifecycle rules to expire items or move to items to cheaper storage classes etc. HTTP AWS Command Line AWS Web Console
  • 16. Simple Right? It turns out this simple storage service is actually quite complex to set up…correctly.
  • 17. How are other companies doing at this?
  • 26. But those are already weeks and months old…
  • 29. Why is this so hard? • How do you lock down a bucket? • IAM Policy • AWS-managed policy attached to a user, group or role • Self-managed policy attached to a user, group or role • Inline policy for a user, group, or role • Bucket Policy • Bucket ACL • Object ACL
  • 30. Oh, and one more thing…
  • 31. How to avoid the front page of the news • Spend time creating secure CloudFormation then use that everywhere! • At TR we often use predefined bucket types with built in standards/safeguards • Resources – For your microservice and only your microservice • Infrastructure – What it sounds like, stuff outside the app. • Website – For public facing static content
  • 32. AWS Responds Introducing Amazon Macie, Launched August 14, 2017 • “AI/Machine Learning to look for Sensitive Data”
  • 33. AWS – “We take care of the undifferentiated heavy lifting”
  • 35. How do I restore to a point-in-time with S3? • S3 has versioning for each object • Everything is available on the API • Enter the PIT restore tool • Dry run available • Estimate size • Loops through a bucket, path or just one file and reverts it to the version that was current during the entered point-in-time
  • 37. Elastic Search • It’s not as easy as saying AWS every time • Platform Elastic Search has automated their Elastic Search distribution on AWS and Azure. • Gives us custom plugins and encryption at rest • A step toward a managed service
  • 38. But I Have Experience Running This • Same ES distribution as our Data Centers • Different hardware profile • The team started smaller • But not small enough • Force our applications to ”tip over” the resources before upgrading
  • 40. @JobsWithUs #WorkingAtTR #HappyAtTR In Summary 1. Use Infrastructure as code 2. Craft your code securely and carefully 3. Re-use it everywhere So you can… 1. Move Quickly 2. Stay off the front page of the news