SlideShare a Scribd company logo
NOSQL: DOING IT WRONG BEFORE GETTING IT RIGHT
6-27-2015/Lawrence Trinh
Content
Goals
• Accommodate new products
– Content
– Marketing
– Schema changes
• Start using Node.js for services and web tier
• Move content out to AWS
– Improve abilities to scale up/down
Background
• Web App
– Host website (www.fandango.com)
• Backoffice tools
– Map data
– Modify existing content
• SQL Server
– Store data
Enter Couchbase
• CDC
– Monitor SQL Server changes
• Export
– Grabs changes and pushes to
• Import
– Contains rules to load data
• Custom query language
– Sit in front of couchbase to
pull/join data
Challenges
• Rules
– Large number /data type
– Import grows quickly
• Breaking points
– Lots of places things can
go wrong
• Second source of truth
– New tools write directly to
Couchbase
Enter Microservices
• Data store
– Simplified flow
– Centralized
• Flexibility
– (Re)building should be less
risky
• Service ownership
– Easily distributable
Q&A

More Related Content

What's hot (20)

PPTX
Hadoop Infrastructure @Uber Past, Present and Future
DataWorks Summit
 
PPTX
Hadoop World 2011: Data Ingestion, Egression, and Preparation for Hadoop - Sa...
Cloudera, Inc.
 
PPTX
Data Engineer's Lunch #55: Get Started in Data Engineering
Anant Corporation
 
PDF
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...
Data Con LA
 
PDF
03-NOV-1510-Ognjen-Antonic-Telemach-stream-1
Ognjen Antonic
 
PPTX
Embeddable data transformation for real time streams
Joey Echeverria
 
PPTX
Scaling Deep Learning on Hadoop at LinkedIn
DataWorks Summit
 
PDF
Big Telco - Yousun Jeong
Spark Summit
 
PDF
What database
Regunath B
 
PDF
What's new in SQL on Hadoop and Beyond
DataWorks Summit/Hadoop Summit
 
PDF
Presto Strata London 2019: Cost-Based Optimizer for interactive SQL on anything
Piotr Findeisen
 
PDF
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
AWS Chicago
 
PPTX
Real time fraud detection at 1+M scale on hadoop stack
DataWorks Summit/Hadoop Summit
 
PPTX
Built-In Security for the Cloud
DataWorks Summit
 
PPTX
Reshape Data Lake (as of 2020.07)
Eric Sun
 
PPTX
Qubole - Big data in cloud
Dmitry Tolpeko
 
PDF
Owning Your Own (Data) Lake House
Data Con LA
 
PDF
HBaseCon 2013: Real-Time Model Scoring in Recommender Systems
Cloudera, Inc.
 
PPTX
Drilling into Data with Apache Drill
DataWorks Summit
 
PPTX
A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics
DataWorks Summit
 
Hadoop Infrastructure @Uber Past, Present and Future
DataWorks Summit
 
Hadoop World 2011: Data Ingestion, Egression, and Preparation for Hadoop - Sa...
Cloudera, Inc.
 
Data Engineer's Lunch #55: Get Started in Data Engineering
Anant Corporation
 
Big Data Day LA 2016/ Use Case Driven track - Hydrator: Open Source, Code-Fre...
Data Con LA
 
03-NOV-1510-Ognjen-Antonic-Telemach-stream-1
Ognjen Antonic
 
Embeddable data transformation for real time streams
Joey Echeverria
 
Scaling Deep Learning on Hadoop at LinkedIn
DataWorks Summit
 
Big Telco - Yousun Jeong
Spark Summit
 
What database
Regunath B
 
What's new in SQL on Hadoop and Beyond
DataWorks Summit/Hadoop Summit
 
Presto Strata London 2019: Cost-Based Optimizer for interactive SQL on anything
Piotr Findeisen
 
Jeremy Engle's slides from Redshift / Big Data meetup on July 13, 2017
AWS Chicago
 
Real time fraud detection at 1+M scale on hadoop stack
DataWorks Summit/Hadoop Summit
 
Built-In Security for the Cloud
DataWorks Summit
 
Reshape Data Lake (as of 2020.07)
Eric Sun
 
Qubole - Big data in cloud
Dmitry Tolpeko
 
Owning Your Own (Data) Lake House
Data Con LA
 
HBaseCon 2013: Real-Time Model Scoring in Recommender Systems
Cloudera, Inc.
 
Drilling into Data with Apache Drill
DataWorks Summit
 
A Non-Standard use Case of Hadoop: High Scale Image Processing and Analytics
DataWorks Summit
 

Viewers also liked (20)

PPTX
La big datacamp2014_vikram_dixit
Data Con LA
 
PDF
Big Data Day LA 2015 - HBase at Factual: Real time and Batch Uses by Molly O'...
Data Con LA
 
PDF
Aziksa hadoop for buisness users2 santosh jha
Data Con LA
 
PDF
Kiji cassandra la june 2014 - v02 clint-kelly
Data Con LA
 
PDF
Yarn cloudera-kathleenting061414 kate-ting
Data Con LA
 
PPTX
2014 bigdatacamp asya_kamsky
Data Con LA
 
PPTX
Summit v4 dave wolcott
Data Con LA
 
PDF
20140614 introduction to spark-ben white
Data Con LA
 
PDF
140614 bigdatacamp-la-keynote-jon hsieh
Data Con LA
 
PPT
Big datacamp june14_alex_liu
Data Con LA
 
PDF
Big Data Day LA 2015 - Solr Search with Spark for Big Data Analytics in Actio...
Data Con LA
 
PDF
Ag big datacampla-06-14-2014-ajay_gopal
Data Con LA
 
PDF
Hadoop and NoSQL joining forces by Dale Kim of MapR
Data Con LA
 
PDF
Big Data Day LA 2015 - Lessons Learned from Designing Data Ingest Systems by ...
Data Con LA
 
PPTX
Hadoop Innovation Summit 2014
Data Con LA
 
PPTX
Big Data Day LA 2015 - Deep Learning Human Vocalized Animal Sounds by Sabri S...
Data Con LA
 
PPTX
Big Data Day LA 2016/ Data Science Track - Decision Making and Lambda Archite...
Data Con LA
 
PDF
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Introduction to Kafka - Je...
Data Con LA
 
PDF
Big Data Day LA 2016/ Big Data Track - Twitter Heron @ Scale - Karthik Ramasa...
Data Con LA
 
PDF
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Data Con LA
 
La big datacamp2014_vikram_dixit
Data Con LA
 
Big Data Day LA 2015 - HBase at Factual: Real time and Batch Uses by Molly O'...
Data Con LA
 
Aziksa hadoop for buisness users2 santosh jha
Data Con LA
 
Kiji cassandra la june 2014 - v02 clint-kelly
Data Con LA
 
Yarn cloudera-kathleenting061414 kate-ting
Data Con LA
 
2014 bigdatacamp asya_kamsky
Data Con LA
 
Summit v4 dave wolcott
Data Con LA
 
20140614 introduction to spark-ben white
Data Con LA
 
140614 bigdatacamp-la-keynote-jon hsieh
Data Con LA
 
Big datacamp june14_alex_liu
Data Con LA
 
Big Data Day LA 2015 - Solr Search with Spark for Big Data Analytics in Actio...
Data Con LA
 
Ag big datacampla-06-14-2014-ajay_gopal
Data Con LA
 
Hadoop and NoSQL joining forces by Dale Kim of MapR
Data Con LA
 
Big Data Day LA 2015 - Lessons Learned from Designing Data Ingest Systems by ...
Data Con LA
 
Hadoop Innovation Summit 2014
Data Con LA
 
Big Data Day LA 2015 - Deep Learning Human Vocalized Animal Sounds by Sabri S...
Data Con LA
 
Big Data Day LA 2016/ Data Science Track - Decision Making and Lambda Archite...
Data Con LA
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Introduction to Kafka - Je...
Data Con LA
 
Big Data Day LA 2016/ Big Data Track - Twitter Heron @ Scale - Karthik Ramasa...
Data Con LA
 
Big Data Day LA 2016/ Hadoop/ Spark/ Kafka track - Alluxio (formerly Tachyon)...
Data Con LA
 
Ad

Similar to Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawrence Trinh of Fandango (20)

PPTX
A Scalable Data Transformation Framework using Hadoop Ecosystem
DataWorks Summit
 
PDF
David Max SATURN 2018 - Migrating from Oracle to Espresso
David Max
 
PPT
A Scalable Data Transformation Framework using the Hadoop Ecosystem
Serendio Inc.
 
PPTX
Apache Hadoop Hive
Some corner at the Laboratory
 
PDF
Couchbase Chennai meetup #3 What's new in Couchbase Server & Couchbase Mobile
Karthik Babu Sekar
 
PDF
Operations for databases – the agile/devops journey
Eduardo Piairo
 
PPTX
SPTechCon Austin - The Slippery Slope of SharePoint Migrations
Jill Hannemann
 
PPTX
The Path Through SharePoint Migrations
Brian Caauwe
 
PPTX
NoSql Brownbag
Sandeep Kumar
 
PPTX
Intro to Azure Data Factory v1
Eric Bragas
 
DOC
scopp_chris_resume
Chris Scopp
 
DOC
scopp_chris_resume
Chris Scopp
 
PPTX
Using Amazon RDS to power enterprise applications (Peoplesoft)
Tom Laszewski
 
PPTX
Case Study: Implementing Hadoop and Elastic Map Reduce on Scale-out Object S...
Cloudian
 
PPTX
Plan, prepare & overall process of upgrade and migrate to SharePoint 2013
Kashish Sukhija
 
PPTX
The Future of Data Engineering - 2019 InfoQ QConSF
Chris Riccomini
 
PDF
Cloud-native Data
cornelia davis
 
PDF
Cloud-Native-Data with Cornelia Davis
VMware Tanzu
 
DOC
Tah 03302015 withendclient
Terry Hendrickson
 
PPTX
Introduction to Big Data Technologies: Hadoop/EMR/Map Reduce & Redshift
DataKitchen
 
A Scalable Data Transformation Framework using Hadoop Ecosystem
DataWorks Summit
 
David Max SATURN 2018 - Migrating from Oracle to Espresso
David Max
 
A Scalable Data Transformation Framework using the Hadoop Ecosystem
Serendio Inc.
 
Apache Hadoop Hive
Some corner at the Laboratory
 
Couchbase Chennai meetup #3 What's new in Couchbase Server & Couchbase Mobile
Karthik Babu Sekar
 
Operations for databases – the agile/devops journey
Eduardo Piairo
 
SPTechCon Austin - The Slippery Slope of SharePoint Migrations
Jill Hannemann
 
The Path Through SharePoint Migrations
Brian Caauwe
 
NoSql Brownbag
Sandeep Kumar
 
Intro to Azure Data Factory v1
Eric Bragas
 
scopp_chris_resume
Chris Scopp
 
scopp_chris_resume
Chris Scopp
 
Using Amazon RDS to power enterprise applications (Peoplesoft)
Tom Laszewski
 
Case Study: Implementing Hadoop and Elastic Map Reduce on Scale-out Object S...
Cloudian
 
Plan, prepare & overall process of upgrade and migrate to SharePoint 2013
Kashish Sukhija
 
The Future of Data Engineering - 2019 InfoQ QConSF
Chris Riccomini
 
Cloud-native Data
cornelia davis
 
Cloud-Native-Data with Cornelia Davis
VMware Tanzu
 
Tah 03302015 withendclient
Terry Hendrickson
 
Introduction to Big Data Technologies: Hadoop/EMR/Map Reduce & Redshift
DataKitchen
 
Ad

More from Data Con LA (20)

PPTX
Data Con LA 2022 Keynotes
Data Con LA
 
PPTX
Data Con LA 2022 Keynotes
Data Con LA
 
PDF
Data Con LA 2022 Keynote
Data Con LA
 
PPTX
Data Con LA 2022 - Startup Showcase
Data Con LA
 
PPTX
Data Con LA 2022 Keynote
Data Con LA
 
PDF
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA
 
PPTX
Data Con LA 2022 - AI Ethics
Data Con LA
 
PDF
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA
 
PDF
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA
 
PDF
Data Con LA 2022 - Real world consumer segmentation
Data Con LA
 
PPTX
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA
 
PPTX
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA
 
PDF
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA
 
PDF
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA
 
PDF
Data Con LA 2022 - Intro to Data Science
Data Con LA
 
PDF
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA
 
PPTX
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA
 
PPTX
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA
 
PPTX
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA
 
PPTX
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA
 
Data Con LA 2022 Keynotes
Data Con LA
 
Data Con LA 2022 Keynotes
Data Con LA
 
Data Con LA 2022 Keynote
Data Con LA
 
Data Con LA 2022 - Startup Showcase
Data Con LA
 
Data Con LA 2022 Keynote
Data Con LA
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA
 
Data Con LA 2022 - AI Ethics
Data Con LA
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA
 
Data Con LA 2022 - Intro to Data Science
Data Con LA
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA
 

Recently uploaded (20)

PDF
Market Insight : ETH Dominance Returns
CIFDAQ
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
PDF
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PDF
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PPTX
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
AgileNetwork
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
Market Insight : ETH Dominance Returns
CIFDAQ
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
State-Dependent Conformal Perception Bounds for Neuro-Symbolic Verification
Ivan Ruchkin
 
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
The Future of Artificial Intelligence (AI)
Mukul
 
Agile Chennai 18-19 July 2025 | Workshop - Enhancing Agile Collaboration with...
AgileNetwork
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 

Big Data Day LA 2015 - NoSQL: Doing it wrong before getting it right by Lawrence Trinh of Fandango

  • 1. NOSQL: DOING IT WRONG BEFORE GETTING IT RIGHT 6-27-2015/Lawrence Trinh
  • 3. Goals • Accommodate new products – Content – Marketing – Schema changes • Start using Node.js for services and web tier • Move content out to AWS – Improve abilities to scale up/down
  • 4. Background • Web App – Host website (www.fandango.com) • Backoffice tools – Map data – Modify existing content • SQL Server – Store data
  • 5. Enter Couchbase • CDC – Monitor SQL Server changes • Export – Grabs changes and pushes to • Import – Contains rules to load data • Custom query language – Sit in front of couchbase to pull/join data
  • 6. Challenges • Rules – Large number /data type – Import grows quickly • Breaking points – Lots of places things can go wrong • Second source of truth – New tools write directly to Couchbase
  • 7. Enter Microservices • Data store – Simplified flow – Centralized • Flexibility – (Re)building should be less risky • Service ownership – Easily distributable
  • 8. Q&A