SlideShare a Scribd company logo
Gaining a Competitive Edge in FS 
with MongoDB and Pentaho 
Matt Kalan 
Business Architect, Financial Services at MongoDB 
matt.kalan@mongodb.com 
@matthewkalan 
Bo Borland 
Vice President, Field Technical Sales at Pentaho 
bborland@pentaho.com 
@boborland
• Financial Services Industry Drivers 
• Traditional and Desired User Scenarios 
• Data Management Requirements 
• MongoDB Capabilities 
• Pentaho Capabilities 
• Pentaho BA Demo - Analyzing MongoDB data 
• Pentaho DI Demo - Blending Disparate Data 
• Questions 
2 
Agenda
FS Industry Transformation 
Drivers of change 
• Lost revenue (fees, prop 
3 
trading) 
• Better risk management 
• Regulatory change and 
uncertainty 
• New competitors 
• Emerging markets 
opportunities 
• Proliferation of channels 
• Globally distributed 
operations 
• Faster market 
movements 
Requirements 
• New products and new 
markets 
• Increase wallet share 
• Agility to respond to 
competitors & regulators 
• Firm-wide, cross-silo 
reporting 
• Cost savings 
• Cross-channel and 
global integration 
• Intraday decision 
support 
• Operational efficiencies
How to Respond to Transformation 
4 
How to Address 
• Maximize customer engagement 
• Enable cross-silo regulatory and operational 
reporting 
• Leverage automation and tools for 
analytics and notifications 
• Based on agile, comprehensive, and 
timely data management 
Requirements 
• New products and new 
markets 
• Increase wallet share 
• Agility to respond to 
competitors & regulators 
• Firm-wide, cross-silo 
reporting 
• Cost savings 
• Cross-channel and 
global integration 
• Intraday decision 
support 
• Operational efficiencies
Traditional Interactions 
5 
Customer 
Customer 
has 
rela>vely 
limited 
value 
from 
advisor 
Phone/email/IM 
3. 
Calls 
client 
4. 
Client 
having 
a 
baby 
and 
looking 
for 
a 
new 
house 
5. 
Already 
got 
mortgage 
and 
set 
up 
ESA 
6. 
Wants 
to 
talk 
again 
in 
3 
months 
Investment 
Advisor 
Investment 
Analysis 
Has 
minimal 
customer 
intelligence 
Mostly 
just 
publishing 
out 
informa>on 
Fundamentals 
Pricing 
News 
1. 
Decide 
to 
check-­‐in 
aDer 
a 
quarter 
2. 
Review 
porKolio, 
research, 
and 
past 
correspondence
Desired Interactions 
6 
Customer 
Phone/email/IM 
Investment 
Advisor 
Investment 
Analysis 
1. Uses 
online 
savings 
guide 
w/ 
1 
child 
Much 
greater 
and 
frequent 
customer 
intel 
Customers 
benefit 
from 
high 
value 
app(s) 
and 
more 
relevant 
advice 
Richer 
and 
relevant 
informa>on 
and 
engagement 
Data 
Analysis 
Fundamentals 
Pricing 
News 
Twi8er 
Blogs/RSS 
Facebook 
Central 
Bank 
info 
Pa8ern 
analysis 
2. 
No>fica>on 
5. 
Timely 
call 
to 
check-­‐in 
3. 
Uses 
mobile 
app 
6. 
Client 
is 
having 
a 
to 
research 
Tesla 
baby 
and 
wants 
a 
mortgage 
4. 
No>fica>on 
7. 
You 
suggest 
an 
ESA 
and 
mortgage 
proposal 
8. 
You 
also 
discuss 
Tesla 
and 
ba[ery 
technology 
Single 
view 
of 
Customer
How to Manage All This Data
RDBMSs not engineered for these 
modern applications 
Data Types 
• Unstructured data 
• JSON & Digital 
• Polymorphic data 
8 
Volume of Data 
• Petabytes of data 
• Trillions of records 
• Millions of queries per 
second 
Agile Development 
• Iterative 
• Short development 
cycles 
• New workloads 
New Architectures 
• Horizontal scaling 
• Commodity servers 
• Cloud computing 
Single Views 
• Disparate data 
• Intraday 
• Cross-channel/silo 
• Global
Many shapes of investment and 
customer data 
symbol: “TSLA”, 
type: ”news”, 
headline: “Tesla…”, 
url: ”http://...”, } 
9 
{symbol: “TSLA”, 
eps: -1.11} 
tweet: “Nice car…”, 
type: ”tweet”} 
{symbol: “TSLA”, 
type: “fundamental”, 
mktCap: 34.93, 
eps: -1.11} 
{symbol: “TSLA”, 
type: “price”, 
bid: 280.31, 
offer: 280.51, 
date: 2014-08-23, 
bidQty: 300, 
offerQty: 100} 
Investment and 
Market Data 
custID: 1000, 
type: ”mResearch”, 
symbol: “TSLA”, 
sector: ”Auto”, ...} 
{custID: 1000, 
type: ”call”, ....} 
custID: 1000, 
type: ”researchPaper”, 
doc: ”AutoOverview”, 
...} 
{custID: 1000, 
type: “email”, 
date: 2014-09-14, 
subject: “Tesla”} 
{custID: 1000, 
type: “savingApp”, 
income: 200000, 
mthSvngs: 10000, 
date: 2014-08-15, 
numChild: 1, 
offerQty: 100} 
Customer 
Activity Data
Differently shaped data are spread 
across many systems 
… Bank 
mobile 
app 
10 
Website 
app 
Wealth 
Mgmt 
App 
Banking 
CRM 
app 
Investment 
Banking 
CRM 
app 
ONE COMMON MODEL 
CustID | Activity ID | Date | Type | 100s or 1000s fields mostly agreed up front
Need to aggregate it in one dynamic 
database 
… Bank 
mobile 
app 
11 
Website 
app 
Wealth 
Mgmt 
App 
Banking 
CRM 
app 
Investment 
Banking 
CRM 
app 
COMMON FIELDS 
CustomerID | Activity ID | Type… 
Single 
View 
DYNAMIC FIELDS 
Can vary from record to record
Easy Horizontal Scaling Required 
ApplicaGon 
One 
Logical 
Database 
ParGGon 
2 
12 
ParGGon 
1 
• No impact to application 
• Minimal impact to 
operations 
ParGGon 
N 
• Elastic capacity as you 
need it 
• Automatic balancing 
Primary 
Primary 
Primary 
…
Rich Querying & Indexes Required 
13 
Objects + Rich Querying 
Multiple Fields • Select John’s holdings 
• Select everyone holding MSFT 
Geospatial • Find the nearest branch right now 
Text Search • Find all customers that mention China 
in their call activity (for a new product) 
Aggregation • Calculate the value of John’s portfolio 
• Show holdings by customer 
Map Reduce 
• For those that hold greater than 50,000 
shares of each sector, what is the next 
largest sector they hold? 
{ ! 
customer_id: 100,! 
customer_name: ‘John Smith’ ! 
as_of_date: 2014-06-11,! 
last_updated_location: ! 
[45.123, 47.232],! 
phone: [‘212-555-1212’, ! 
‘917-111-2222, …]! 
holdings: [ ! 
{ symbol: “MSFT”,! 
quantity: 10000, … },! 
{ symbol: “IBM”,! 
quantity: 20000, … }, …! 
]! 
}!
MongoDB capabilities 
ApplicaGon 
Driver 
Mongos 
Shard 
2 
14 
Shard 
1 
Primary 
Secondary 
Secondary 
Primary 
Secondary 
Secondary 
… 
Shard 
N 
Primary 
Secondary 
Secondary 
db.customer.insert({…})! 
db.customer.find({ ! 
name: ”John Smith”})! 
1. Dynamic 
Document 
Schema 
{ name: “John Smith”,! 
date: “2013-08-01”),! 
address: “10 3rd St.”,! 
phone: [! 
{ home: 1234567890},! 
{ mobile: 1234568138} ]! 
}! 
2. 
Na>ve 
language 
drivers 
4. 
High 
performance 
- Data 
locality 
- Indexes 
- RAM 
3. 
High 
availability 
- Replica 
sets 
- Strong 
Consistency 
5. 
Horizontal 
scalability 
- Sharding

More Related Content

PDF
Up Your Analytics Game with Pentaho and Vertica
Pentaho
 
PDF
Big Data Predictions for 2015
Pentaho
 
PDF
Improving the Business of Healthcare through Better Analytics
Pentaho
 
PPTX
Data Mashups for Analytics
Katharine Bierce
 
PDF
Embedded Analytics in Customer Success
Pentaho
 
PPTX
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
MongoDB
 
PDF
Data Is Your Next Product Opportunity
Pentaho
 
PDF
Big Data for Product Managers
Pentaho
 
Up Your Analytics Game with Pentaho and Vertica
Pentaho
 
Big Data Predictions for 2015
Pentaho
 
Improving the Business of Healthcare through Better Analytics
Pentaho
 
Data Mashups for Analytics
Katharine Bierce
 
Embedded Analytics in Customer Success
Pentaho
 
Data Integration and Advanced Analytics for MongoDB: Blend, Enrich and Analyz...
MongoDB
 
Data Is Your Next Product Opportunity
Pentaho
 
Big Data for Product Managers
Pentaho
 

What's hot (20)

PDF
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
OpenAnalytics Spain
 
PDF
Pentaho Healthcare Solutions
Pentaho
 
PPTX
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho
 
PDF
Embedded Analytics in Human Capital Management
Pentaho
 
PDF
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Pentaho
 
PDF
Why Your Product Needs an Analytic Strategy
Pentaho
 
PDF
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BICC Thomas More
 
PPT
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB
 
PPT
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB
 
PPT
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB
 
PDF
Embedded Analytics in CRM and Marketing
Pentaho
 
PPTX
Moving from data to insights: How to effectively drive business decisions & g...
Cloudera, Inc.
 
PDF
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Pentaho
 
PPTX
Cox Automotive: data sells cars
Cloudera, Inc.
 
PDF
Accelerating Fast Data Strategy with Data Virtualization
Denodo
 
PDF
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Denodo
 
PDF
Understanding Big Data
Capgemini
 
PDF
Big Data LDN 2018: THE NEXT WAVE: DATA, AI AND ANALYTICS IN 2019 AND BEYOND
Matt Stubbs
 
PDF
Infographic: The Road to Data-Driven Decision Making
Jeannette Browning
 
PPTX
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Pentaho
 
Open Analytics 2014 - Pedro Alves - Innovation though Open Source
OpenAnalytics Spain
 
Pentaho Healthcare Solutions
Pentaho
 
Pentaho Analytics for MongoDB - presentation from MongoDB World 2014
Pentaho
 
Embedded Analytics in Human Capital Management
Pentaho
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Pentaho
 
Why Your Product Needs an Analytic Strategy
Pentaho
 
BI congres 2016-4: Hoe groei je als organisatie in analytische maturiteit? - ...
BICC Thomas More
 
MongoDB IoT City Tour EINDHOVEN: Analysing the Internet of Things: Davy Nys, ...
MongoDB
 
MongoDB IoT City Tour LONDON: Analysing the Internet of Things: Davy Nys, Pen...
MongoDB
 
MongoDB IoT City Tour STUTTGART: Analysing the Internet of Things. By, Pentaho
MongoDB
 
Embedded Analytics in CRM and Marketing
Pentaho
 
Moving from data to insights: How to effectively drive business decisions & g...
Cloudera, Inc.
 
Exclusive Verizon Employee Webinar: Getting More From Your CDR Data
Pentaho
 
Cox Automotive: data sells cars
Cloudera, Inc.
 
Accelerating Fast Data Strategy with Data Virtualization
Denodo
 
Product Keynote: Denodo 8.0 - A Logical Data Fabric for the Intelligent Enter...
Denodo
 
Understanding Big Data
Capgemini
 
Big Data LDN 2018: THE NEXT WAVE: DATA, AI AND ANALYTICS IN 2019 AND BEYOND
Matt Stubbs
 
Infographic: The Road to Data-Driven Decision Making
Jeannette Browning
 
Big Data Integration Webinar: Getting Started With Hadoop Big Data
Pentaho
 
Ad

Similar to Competitive edgewithmongod bandpentaho_2014sep_v3[1] (20)

PPTX
Webinar: How Financial Services Organizations Use MongoDB
MongoDB
 
PDF
How Financial Services Organizations Use MongoDB
MongoDB
 
PPTX
Webinar: How to Drive Business Value in Financial Services with MongoDB
MongoDB
 
PPT
How Retail Banks Use MongoDB
MongoDB
 
PPTX
Webinar: How to Drive Business Value in Financial Services with MongoDB
MongoDB
 
PDF
Single View of the Customer
MongoDB
 
PPTX
Webinar: Achieving Customer Centricity and High Margins in Financial Services...
MongoDB
 
PPT
Webinar: Making A Single View of the Customer Real with MongoDB
MongoDB
 
PPTX
Webinar: An Enterprise Architect’s View of MongoDB
MongoDB
 
PPTX
An Enterprise Architect's View of MongoDB
MongoDB
 
PPTX
Enterprise architectsview 2015-apr
MongoDB
 
PPTX
Webinar: How Financial Firms Create a Single Customer View with MongoDB
MongoDB
 
PDF
Big Data Paris - A Modern Enterprise Architecture
MongoDB
 
PDF
Creating a Modern Data Architecture for Digital Transformation
MongoDB
 
PDF
MongoDB Breakfast Milan - Mainframe Offloading Strategies
MongoDB
 
PPTX
Webinar: Analytics with NoSQL: Why, for What, and When?
MongoDB
 
PPTX
MongoDB & Hadoop - Understanding Your Big Data
MongoDB
 
PPTX
How to deliver a Single View in Financial Services
MongoDB
 
PPTX
MongoDB in a Mainframe World
MongoDB
 
PPTX
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
MongoDB
 
Webinar: How Financial Services Organizations Use MongoDB
MongoDB
 
How Financial Services Organizations Use MongoDB
MongoDB
 
Webinar: How to Drive Business Value in Financial Services with MongoDB
MongoDB
 
How Retail Banks Use MongoDB
MongoDB
 
Webinar: How to Drive Business Value in Financial Services with MongoDB
MongoDB
 
Single View of the Customer
MongoDB
 
Webinar: Achieving Customer Centricity and High Margins in Financial Services...
MongoDB
 
Webinar: Making A Single View of the Customer Real with MongoDB
MongoDB
 
Webinar: An Enterprise Architect’s View of MongoDB
MongoDB
 
An Enterprise Architect's View of MongoDB
MongoDB
 
Enterprise architectsview 2015-apr
MongoDB
 
Webinar: How Financial Firms Create a Single Customer View with MongoDB
MongoDB
 
Big Data Paris - A Modern Enterprise Architecture
MongoDB
 
Creating a Modern Data Architecture for Digital Transformation
MongoDB
 
MongoDB Breakfast Milan - Mainframe Offloading Strategies
MongoDB
 
Webinar: Analytics with NoSQL: Why, for What, and When?
MongoDB
 
MongoDB & Hadoop - Understanding Your Big Data
MongoDB
 
How to deliver a Single View in Financial Services
MongoDB
 
MongoDB in a Mainframe World
MongoDB
 
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
MongoDB
 
Ad

More from Pentaho (7)

PPTX
Data Mashups for Analytics
Pentaho
 
PPTX
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Pentaho
 
PDF
The Next Big Thing in Big Data
Pentaho
 
PDF
30 for 30: Quick Start Your Pentaho Evaluation
Pentaho
 
PDF
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...
Pentaho
 
PDF
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho
 
PPTX
Bay Area Hadoop User Group
Pentaho
 
Data Mashups for Analytics
Pentaho
 
Filling the Data Lake - Strata + HadoopWorld San Jose 2016 Preview Presentation
Pentaho
 
The Next Big Thing in Big Data
Pentaho
 
30 for 30: Quick Start Your Pentaho Evaluation
Pentaho
 
Predictive Analytics with Pentaho Data Mining - Análisis Predictivo con Penta...
Pentaho
 
Pentaho Business Analytics for ISVs and SaaS providers in healthcare
Pentaho
 
Bay Area Hadoop User Group
Pentaho
 

Recently uploaded (20)

PPTX
E-commerce and its impact on business.
pandeyranjan5483
 
PDF
New Royals Distribution Plan Presentation
ksherwin
 
PPTX
Final PPT on DAJGUA, EV Charging, Meter Devoloution, CGRF, Annual Accounts & ...
directord
 
DOCX
unit 1 BC.docx - INTRODUCTION TO BUSINESS COMMUICATION
MANJU N
 
PPTX
E-Way Bill under GST – Transport & Logistics.pptx
Keerthana Chinnathambi
 
PDF
Withum Webinar - OBBBA: Tax Insights for Food and Consumer Brands
Withum
 
PPTX
Memorandum and articles of association explained.pptx
Keerthana Chinnathambi
 
PDF
NewBase 26 July 2025 Energy News issue - 1806 by Khaled Al Awadi_compressed.pdf
Khaled Al Awadi
 
PDF
Equinox Gold - Corporate Presentation.pdf
Equinox Gold Corp.
 
PDF
Keppel Ltd. 1H 2025 Results Presentation Slides
KeppelCorporation
 
PPTX
Appreciations - July 25.pptxffsdjjjjjjjjjjjj
anushavnayak
 
PPTX
Brain Based Enterprises - Harmonising Man, Woman and Machine
Peter Cook
 
PDF
Top 10 Corporates in India Investing in Sustainable Energy.pdf
Essar Group
 
PPTX
Integrative Negotiation: Expanding the Pie
badranomar1990
 
PDF
bain-temasek-sea-green-economy-2022-report-investing-behind-the-new-realities...
YudiSaputra43
 
PPTX
Virbyze_Our company profile_Preview.pptx
myckwabs
 
PPTX
Business Plan Presentation: Vision, Strategy, Services, Growth Goals & Future...
neelsoni2108
 
PDF
Using Innovative Solar Manufacturing to Drive India's Renewable Energy Revolu...
Insolation Energy
 
PPTX
Appreciations - July 25.pptxsdsdsddddddsssss
anushavnayak
 
PPTX
Pakistan’s Leading Manpower Export Agencies for Qatar
Glassrooms Dubai
 
E-commerce and its impact on business.
pandeyranjan5483
 
New Royals Distribution Plan Presentation
ksherwin
 
Final PPT on DAJGUA, EV Charging, Meter Devoloution, CGRF, Annual Accounts & ...
directord
 
unit 1 BC.docx - INTRODUCTION TO BUSINESS COMMUICATION
MANJU N
 
E-Way Bill under GST – Transport & Logistics.pptx
Keerthana Chinnathambi
 
Withum Webinar - OBBBA: Tax Insights for Food and Consumer Brands
Withum
 
Memorandum and articles of association explained.pptx
Keerthana Chinnathambi
 
NewBase 26 July 2025 Energy News issue - 1806 by Khaled Al Awadi_compressed.pdf
Khaled Al Awadi
 
Equinox Gold - Corporate Presentation.pdf
Equinox Gold Corp.
 
Keppel Ltd. 1H 2025 Results Presentation Slides
KeppelCorporation
 
Appreciations - July 25.pptxffsdjjjjjjjjjjjj
anushavnayak
 
Brain Based Enterprises - Harmonising Man, Woman and Machine
Peter Cook
 
Top 10 Corporates in India Investing in Sustainable Energy.pdf
Essar Group
 
Integrative Negotiation: Expanding the Pie
badranomar1990
 
bain-temasek-sea-green-economy-2022-report-investing-behind-the-new-realities...
YudiSaputra43
 
Virbyze_Our company profile_Preview.pptx
myckwabs
 
Business Plan Presentation: Vision, Strategy, Services, Growth Goals & Future...
neelsoni2108
 
Using Innovative Solar Manufacturing to Drive India's Renewable Energy Revolu...
Insolation Energy
 
Appreciations - July 25.pptxsdsdsddddddsssss
anushavnayak
 
Pakistan’s Leading Manpower Export Agencies for Qatar
Glassrooms Dubai
 

Competitive edgewithmongod bandpentaho_2014sep_v3[1]

  • 1. Gaining a Competitive Edge in FS with MongoDB and Pentaho Matt Kalan Business Architect, Financial Services at MongoDB [email protected] @matthewkalan Bo Borland Vice President, Field Technical Sales at Pentaho [email protected] @boborland
  • 2. • Financial Services Industry Drivers • Traditional and Desired User Scenarios • Data Management Requirements • MongoDB Capabilities • Pentaho Capabilities • Pentaho BA Demo - Analyzing MongoDB data • Pentaho DI Demo - Blending Disparate Data • Questions 2 Agenda
  • 3. FS Industry Transformation Drivers of change • Lost revenue (fees, prop 3 trading) • Better risk management • Regulatory change and uncertainty • New competitors • Emerging markets opportunities • Proliferation of channels • Globally distributed operations • Faster market movements Requirements • New products and new markets • Increase wallet share • Agility to respond to competitors & regulators • Firm-wide, cross-silo reporting • Cost savings • Cross-channel and global integration • Intraday decision support • Operational efficiencies
  • 4. How to Respond to Transformation 4 How to Address • Maximize customer engagement • Enable cross-silo regulatory and operational reporting • Leverage automation and tools for analytics and notifications • Based on agile, comprehensive, and timely data management Requirements • New products and new markets • Increase wallet share • Agility to respond to competitors & regulators • Firm-wide, cross-silo reporting • Cost savings • Cross-channel and global integration • Intraday decision support • Operational efficiencies
  • 5. Traditional Interactions 5 Customer Customer has rela>vely limited value from advisor Phone/email/IM 3. Calls client 4. Client having a baby and looking for a new house 5. Already got mortgage and set up ESA 6. Wants to talk again in 3 months Investment Advisor Investment Analysis Has minimal customer intelligence Mostly just publishing out informa>on Fundamentals Pricing News 1. Decide to check-­‐in aDer a quarter 2. Review porKolio, research, and past correspondence
  • 6. Desired Interactions 6 Customer Phone/email/IM Investment Advisor Investment Analysis 1. Uses online savings guide w/ 1 child Much greater and frequent customer intel Customers benefit from high value app(s) and more relevant advice Richer and relevant informa>on and engagement Data Analysis Fundamentals Pricing News Twi8er Blogs/RSS Facebook Central Bank info Pa8ern analysis 2. No>fica>on 5. Timely call to check-­‐in 3. Uses mobile app 6. Client is having a to research Tesla baby and wants a mortgage 4. No>fica>on 7. You suggest an ESA and mortgage proposal 8. You also discuss Tesla and ba[ery technology Single view of Customer
  • 7. How to Manage All This Data
  • 8. RDBMSs not engineered for these modern applications Data Types • Unstructured data • JSON & Digital • Polymorphic data 8 Volume of Data • Petabytes of data • Trillions of records • Millions of queries per second Agile Development • Iterative • Short development cycles • New workloads New Architectures • Horizontal scaling • Commodity servers • Cloud computing Single Views • Disparate data • Intraday • Cross-channel/silo • Global
  • 9. Many shapes of investment and customer data symbol: “TSLA”, type: ”news”, headline: “Tesla…”, url: ”http://...”, } 9 {symbol: “TSLA”, eps: -1.11} tweet: “Nice car…”, type: ”tweet”} {symbol: “TSLA”, type: “fundamental”, mktCap: 34.93, eps: -1.11} {symbol: “TSLA”, type: “price”, bid: 280.31, offer: 280.51, date: 2014-08-23, bidQty: 300, offerQty: 100} Investment and Market Data custID: 1000, type: ”mResearch”, symbol: “TSLA”, sector: ”Auto”, ...} {custID: 1000, type: ”call”, ....} custID: 1000, type: ”researchPaper”, doc: ”AutoOverview”, ...} {custID: 1000, type: “email”, date: 2014-09-14, subject: “Tesla”} {custID: 1000, type: “savingApp”, income: 200000, mthSvngs: 10000, date: 2014-08-15, numChild: 1, offerQty: 100} Customer Activity Data
  • 10. Differently shaped data are spread across many systems … Bank mobile app 10 Website app Wealth Mgmt App Banking CRM app Investment Banking CRM app ONE COMMON MODEL CustID | Activity ID | Date | Type | 100s or 1000s fields mostly agreed up front
  • 11. Need to aggregate it in one dynamic database … Bank mobile app 11 Website app Wealth Mgmt App Banking CRM app Investment Banking CRM app COMMON FIELDS CustomerID | Activity ID | Type… Single View DYNAMIC FIELDS Can vary from record to record
  • 12. Easy Horizontal Scaling Required ApplicaGon One Logical Database ParGGon 2 12 ParGGon 1 • No impact to application • Minimal impact to operations ParGGon N • Elastic capacity as you need it • Automatic balancing Primary Primary Primary …
  • 13. Rich Querying & Indexes Required 13 Objects + Rich Querying Multiple Fields • Select John’s holdings • Select everyone holding MSFT Geospatial • Find the nearest branch right now Text Search • Find all customers that mention China in their call activity (for a new product) Aggregation • Calculate the value of John’s portfolio • Show holdings by customer Map Reduce • For those that hold greater than 50,000 shares of each sector, what is the next largest sector they hold? { ! customer_id: 100,! customer_name: ‘John Smith’ ! as_of_date: 2014-06-11,! last_updated_location: ! [45.123, 47.232],! phone: [‘212-555-1212’, ! ‘917-111-2222, …]! holdings: [ ! { symbol: “MSFT”,! quantity: 10000, … },! { symbol: “IBM”,! quantity: 20000, … }, …! ]! }!
  • 14. MongoDB capabilities ApplicaGon Driver Mongos Shard 2 14 Shard 1 Primary Secondary Secondary Primary Secondary Secondary … Shard N Primary Secondary Secondary db.customer.insert({…})! db.customer.find({ ! name: ”John Smith”})! 1. Dynamic Document Schema { name: “John Smith”,! date: “2013-08-01”),! address: “10 3rd St.”,! phone: [! { home: 1234567890},! { mobile: 1234568138} ]! }! 2. Na>ve language drivers 4. High performance - Data locality - Indexes - RAM 3. High availability - Replica sets - Strong Consistency 5. Horizontal scalability - Sharding