SlideShare a Scribd company logo
Data Analytics with MongoDB
for fun and profit
Michael Gordon
CFO, MongoDB
Dr. Grigori Melnik
VP, Products, MongoDB
@gmelnik
Safe Harbor Statement
The development, release, and timing of any features or functionality
described for our products remains at our sole discretion. This
information is merely intended to outline our general product direction
and it should not be relied on in making a purchasing decision nor is
this a commitment, promise or legal obligation to deliver any material,
code, or functionality.
data is a new oil
data is a new soil
MongoDB World 2018: Data Analytics with MongoDB
Options for Visualizing MongoDB Data
Custom Code +
Charting Libraries
ETL +
3rd Party BI Tools
MongoDB BI Connector +
3rd Party BI Tools
MongoDB
Charts
MongoDB Compass
Things to Think About
• Use the correct architecture
• Determine what your needs are
• Multiple heterogeneous data sources?
• Huge amounts of complex data?
• Sophisticated aggregations?
• Quick self-service?
• Need to interact? share? collaborate?
• Choose the right tools for you
MongoDB World 2018: Data Analytics with MongoDB
Architectural
Considerations
Architectural Considerations
• Hidden secondaries
maintain a copy of the
primary data set
• Hidden secondaries are
used for workloads with
different access patterns
• Cannot become primary
Build Your Own
Custom Code with Some Reuse
• Powerful Aggregation Framework
• Rich collection of drivers (incl. Python and R)
• 3rd party charting libraries (d3, Vega, Vega.lite etc)
• Pros:
• Custom tailored solution: fits exactly as required!
• Idiomatic to your development language
• High degree of expressiveness
• Cons:
• High investment
• Maintenance
• Deep understanding of the underlying tech and its language(s)
Exploratory Analysis with
MongoDB Compass
MongoDB Compass
• Developer tool
• Data management and
manipulation
• document view
• table view
• Visual schema analyzer
• with query builder
• export to language
• Aggregation pipeline builder
• A good first place to start
MongoDB Compass - When to Use
• Exploratory data analysis
• Data preparation & basic manipulation
• Data ingestion via JSON or CSV import
• Day-to-day development/operations
• Adding and understanding indexes
• Adding validation rules
• Authoring & troubleshooting aggregation pipelines
• Viewing real-time server stats
• 10,000 → 1ft view of data
Demo:
Exploratory Analysis with
Compass
MongoDB
BI Connector
MongoDB BI Connector
• Visualize and explore MongoDB
data in SQL-based BI tools:
• Automatically discovers the schema
• Translates complex SQL statements
issued by the BI tool into MongoDB
aggregation queries
• Converts the results into a tabular
format for rendering inside the BI tool
BI Connector & ODBC Driver
ODBC Driver mongosqld
...
MongoDB BI Connector - When to Use
• Want to speak SQL to MongoDB
• Multi data sources (not just MongoDB)
• Business analysts
• Reporting only
• Powerful but you lose the benefits of the Document Model
Demo:
Data Analysis with
Microsoft Excel and
BI Connector
MongoDB
Charts
Wouldn’t it be lovely if...
You could visualize your MongoDB data natively…
• without needing to write custom code
• without needing to move your data into a different repository or
• without needing to wrangle with flaky ETL pipelines
• without needing to purchase and configure third-party tools
• without losing the richness of the Document Model
MongoDB Charts
• Lightweight
• Intuitive
• Build visualizations on MongoDB
data (nested, polymorphic)
• Share content easily
MongoDB Charts - When to Use
• The fastest way to build visualizations over your MongoDB data
• Ad hoc analyses
• Benefit from the Document Model
• No need to flatten /ETL your mongodb data
• Type handling
• Polymorphic collections
• Nested documents
• Array reductions
• Prebuilt dashboards for collab
• Self-service
• Intuitive enough for domain experts, non-devs to use!
Demo:
Data Viz with Charts
Lifecycle
1. Ingest 2. Explore/Prepare
‒ Calcs
‒ Groups
‒ Data types
3. Visualize
‒ Chart types
‒ Binning
‒ Limiting
‒ Multi-series
4. Analyze/Use
‒ Dashboards
‒ Key views on data
‒ Patterns
‒ Drilldowns
‒ Pivots
5. Share
‒ Export
‒ Collaborate
‒ Embed
Final Thoughts
To think clearly about the relationship between evidence and conclusion,
the relevant question is How do I know that? Answering this question
requires self-awareness about the quality and integrity of information,
and particularly how that information arrived to one’s own world.
Similarly, to ask others, How do you know that? How do they know that?
These questions are among the best you can ask analytically.
~ Edward Tufte
you
How do I know that?
they
{ }
Other Recommended Sessions
Thank you
MongoDB World 2018: Data Analytics with MongoDB

More Related Content

What's hot (20)

PPTX
Beyond the Basics 1: Storage Engines
MongoDB
 
PDF
CZJUG Intro - BI Platform as a Service - a case for Java in the Cloud
Jaroslav Gergic
 
PPTX
Web Browser Controls in Adlib: The Hidden Diamond in the Adlib Treasure Chest
Axiell ALM
 
PPTX
Everything you always wanted to know about SharePoint 2013 Search relevance
Joris Poelmans
 
PPTX
Introduction BI Semantic Model with Sql Server Data Tools copy
Slava Kokaev
 
PPTX
Best analytics tool
Ritu Sarkar
 
PPTX
Inside the mind of a SharePoint Solutions Architect
Noorez Khamis
 
PPTX
Data visualization
Slava Kokaev
 
PDF
Standard Web APIs for Multidisciplinary Collaboration
Axel Reichwein
 
PPTX
Is BCS Dead?
Jeff Fried
 
PDF
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
semanticsconference
 
PPTX
MongoDB.local Seattle 2019: Bringing Data to Life with MongoDB Charts
MongoDB
 
PPTX
What's New in Capture Overview - Release 16 EP4
OpenText
 
PPTX
Neumann Data Platform
Paul Tauk
 
PDF
SharePoint Business Data List Connector by Layer2
Frank Daske
 
PDF
InterSystems Health Data Platform Comparison Chart
Todd Winey
 
PPTX
Choosing the Right Open Source Database
All Things Open
 
PDF
Introduction to Open Services for Lifecycle Collaboration (OSLC)
Axel Reichwein
 
PDF
Enabling the digital thread using open OSLC standards
Axel Reichwein
 
Beyond the Basics 1: Storage Engines
MongoDB
 
CZJUG Intro - BI Platform as a Service - a case for Java in the Cloud
Jaroslav Gergic
 
Web Browser Controls in Adlib: The Hidden Diamond in the Adlib Treasure Chest
Axiell ALM
 
Everything you always wanted to know about SharePoint 2013 Search relevance
Joris Poelmans
 
Introduction BI Semantic Model with Sql Server Data Tools copy
Slava Kokaev
 
Best analytics tool
Ritu Sarkar
 
Inside the mind of a SharePoint Solutions Architect
Noorez Khamis
 
Data visualization
Slava Kokaev
 
Standard Web APIs for Multidisciplinary Collaboration
Axel Reichwein
 
Is BCS Dead?
Jeff Fried
 
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
semanticsconference
 
MongoDB.local Seattle 2019: Bringing Data to Life with MongoDB Charts
MongoDB
 
What's New in Capture Overview - Release 16 EP4
OpenText
 
Neumann Data Platform
Paul Tauk
 
SharePoint Business Data List Connector by Layer2
Frank Daske
 
InterSystems Health Data Platform Comparison Chart
Todd Winey
 
Choosing the Right Open Source Database
All Things Open
 
Introduction to Open Services for Lifecycle Collaboration (OSLC)
Axel Reichwein
 
Enabling the digital thread using open OSLC standards
Axel Reichwein
 

Similar to MongoDB World 2018: Data Analytics with MongoDB (20)

PDF
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
MongoDB
 
PPTX
Bringing Data to Life with MongoDB Charts
MongoDB
 
PPTX
Bringing Data to Life with MongoDB Charts - Guillaume Meister
MongoDB
 
PDF
MongoDB World 2018: Bringing Data to Life with MongoDB Charts
MongoDB
 
PPTX
Data Analytics with MongoDB - Jane Fine
MongoDB
 
PDF
Bringing Data to Life with MongoDB Charts
MongoDB
 
PPTX
Bringing Data to Life with MongoDB Charts
MongoDB
 
PPTX
MongoDB Charts Meetup - 7-24-2018
Jay Gordon
 
PDF
MongoDB .local Paris 2020: Devenez explorateur de données avec MongoDB Charts
MongoDB
 
PDF
MongoDB World 2019: Simplici-tea: Getting Started with MongoDB Charts on Atlas
MongoDB
 
PPTX
Tableau & MongoDB: Visual Analytics at the Speed of Thought
MongoDB
 
PPTX
How to learn MongoDB for beginner's
surajkumartpoint
 
PPTX
MongoDB 3.4 webinar
Andrew Morgan
 
PPTX
MongoDB Days Silicon Valley: Jumpstart: The Right and Wrong Use Cases for Mon...
MongoDB
 
PPTX
Why MongoDB over other Databases - Habilelabs
HabileLabs
 
PDF
MongoDB_Spark
Mat Keep
 
PDF
10 - MongoDB
Kangaroot
 
PDF
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
João Gabriel Lima
 
PPTX
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
MongoDB
 
PPTX
Migrating from RDBMS to MongoDB
MongoDB
 
MongoDB .local Houston 2019: Wide Ranging Analytical Solutions on MongoDB
MongoDB
 
Bringing Data to Life with MongoDB Charts
MongoDB
 
Bringing Data to Life with MongoDB Charts - Guillaume Meister
MongoDB
 
MongoDB World 2018: Bringing Data to Life with MongoDB Charts
MongoDB
 
Data Analytics with MongoDB - Jane Fine
MongoDB
 
Bringing Data to Life with MongoDB Charts
MongoDB
 
Bringing Data to Life with MongoDB Charts
MongoDB
 
MongoDB Charts Meetup - 7-24-2018
Jay Gordon
 
MongoDB .local Paris 2020: Devenez explorateur de données avec MongoDB Charts
MongoDB
 
MongoDB World 2019: Simplici-tea: Getting Started with MongoDB Charts on Atlas
MongoDB
 
Tableau & MongoDB: Visual Analytics at the Speed of Thought
MongoDB
 
How to learn MongoDB for beginner's
surajkumartpoint
 
MongoDB 3.4 webinar
Andrew Morgan
 
MongoDB Days Silicon Valley: Jumpstart: The Right and Wrong Use Cases for Mon...
MongoDB
 
Why MongoDB over other Databases - Habilelabs
HabileLabs
 
MongoDB_Spark
Mat Keep
 
10 - MongoDB
Kangaroot
 
Apache Spark and MongoDB - Turning Analytics into Real-Time Action
João Gabriel Lima
 
Webinar: “ditch Oracle NOW”: Best Practices for Migrating to MongoDB
MongoDB
 
Migrating from RDBMS to MongoDB
MongoDB
 
Ad

More from MongoDB (20)

PDF
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB
 
PDF
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
PDF
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB
 
PDF
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB
 
PDF
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB
 
PDF
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB
 
PDF
MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
PDF
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB
 
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB
 
PDF
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB
 
PDF
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB
 
PDF
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB
 
PDF
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB
 
PDF
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB
 
PDF
MongoDB .local Paris 2020: Les bonnes pratiques pour sécuriser MongoDB
MongoDB
 
MongoDB SoCal 2020: Migrate Anything* to MongoDB Atlas
MongoDB
 
MongoDB SoCal 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
MongoDB SoCal 2020: Using MongoDB Services in Kubernetes: Any Platform, Devel...
MongoDB
 
MongoDB SoCal 2020: A Complete Methodology of Data Modeling for MongoDB
MongoDB
 
MongoDB SoCal 2020: From Pharmacist to Analyst: Leveraging MongoDB for Real-T...
MongoDB
 
MongoDB SoCal 2020: Best Practices for Working with IoT and Time-series Data
MongoDB
 
MongoDB SoCal 2020: MongoDB Atlas Jump Start
MongoDB
 
MongoDB .local San Francisco 2020: Using Client Side Encryption in MongoDB 4.2
MongoDB
 
MongoDB .local San Francisco 2020: Using MongoDB Services in Kubernetes: any ...
MongoDB
 
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
MongoDB
 
MongoDB .local San Francisco 2020: From SQL to NoSQL -- Changing Your Mindset
MongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Jumpstart
MongoDB
 
MongoDB .local San Francisco 2020: Tips and Tricks++ for Querying and Indexin...
MongoDB
 
MongoDB .local San Francisco 2020: Aggregation Pipeline Power++
MongoDB
 
MongoDB .local San Francisco 2020: A Complete Methodology of Data Modeling fo...
MongoDB
 
MongoDB .local San Francisco 2020: MongoDB Atlas Data Lake Technical Deep Dive
MongoDB
 
MongoDB .local San Francisco 2020: Developing Alexa Skills with MongoDB & Golang
MongoDB
 
MongoDB .local Paris 2020: Realm : l'ingrédient secret pour de meilleures app...
MongoDB
 
MongoDB .local Paris 2020: Upply @MongoDB : Upply : Quand le Machine Learning...
MongoDB
 
MongoDB .local Paris 2020: Les bonnes pratiques pour sécuriser MongoDB
MongoDB
 
Ad

Recently uploaded (20)

PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
PPTX
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
July Patch Tuesday
Ivanti
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PPTX
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
PPTX
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
PDF
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
PDF
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
PDF
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
PDF
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
"Beyond English: Navigating the Challenges of Building a Ukrainian-language R...
Fwdays
 
Building Search Using OpenSearch: Limitations and Workarounds
Sease
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
July Patch Tuesday
Ivanti
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
Q2 FY26 Tableau User Group Leader Quarterly Call
lward7
 
Future Tech Innovations 2025 – A TechLists Insight
TechLists
 
IoT-Powered Industrial Transformation – Smart Manufacturing to Connected Heal...
Rejig Digital
 
Go Concurrency Real-World Patterns, Pitfalls, and Playground Battles.pdf
Emily Achieng
 
"AI Transformation: Directions and Challenges", Pavlo Shaternik
Fwdays
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
Transforming Utility Networks: Large-scale Data Migrations with FME
Safe Software
 
Reverse Engineering of Security Products: Developing an Advanced Microsoft De...
nwbxhhcyjv
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
The Rise of AI and IoT in Mobile App Tech.pdf
IMG Global Infotech
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 

MongoDB World 2018: Data Analytics with MongoDB

  • 1. Data Analytics with MongoDB for fun and profit Michael Gordon CFO, MongoDB Dr. Grigori Melnik VP, Products, MongoDB @gmelnik
  • 2. Safe Harbor Statement The development, release, and timing of any features or functionality described for our products remains at our sole discretion. This information is merely intended to outline our general product direction and it should not be relied on in making a purchasing decision nor is this a commitment, promise or legal obligation to deliver any material, code, or functionality.
  • 3. data is a new oil
  • 4. data is a new soil
  • 6. Options for Visualizing MongoDB Data Custom Code + Charting Libraries ETL + 3rd Party BI Tools MongoDB BI Connector + 3rd Party BI Tools MongoDB Charts MongoDB Compass
  • 7. Things to Think About • Use the correct architecture • Determine what your needs are • Multiple heterogeneous data sources? • Huge amounts of complex data? • Sophisticated aggregations? • Quick self-service? • Need to interact? share? collaborate? • Choose the right tools for you
  • 10. Architectural Considerations • Hidden secondaries maintain a copy of the primary data set • Hidden secondaries are used for workloads with different access patterns • Cannot become primary
  • 12. Custom Code with Some Reuse • Powerful Aggregation Framework • Rich collection of drivers (incl. Python and R) • 3rd party charting libraries (d3, Vega, Vega.lite etc) • Pros: • Custom tailored solution: fits exactly as required! • Idiomatic to your development language • High degree of expressiveness • Cons: • High investment • Maintenance • Deep understanding of the underlying tech and its language(s)
  • 14. MongoDB Compass • Developer tool • Data management and manipulation • document view • table view • Visual schema analyzer • with query builder • export to language • Aggregation pipeline builder • A good first place to start
  • 15. MongoDB Compass - When to Use • Exploratory data analysis • Data preparation & basic manipulation • Data ingestion via JSON or CSV import • Day-to-day development/operations • Adding and understanding indexes • Adding validation rules • Authoring & troubleshooting aggregation pipelines • Viewing real-time server stats • 10,000 → 1ft view of data
  • 18. MongoDB BI Connector • Visualize and explore MongoDB data in SQL-based BI tools: • Automatically discovers the schema • Translates complex SQL statements issued by the BI tool into MongoDB aggregation queries • Converts the results into a tabular format for rendering inside the BI tool
  • 19. BI Connector & ODBC Driver ODBC Driver mongosqld ...
  • 20. MongoDB BI Connector - When to Use • Want to speak SQL to MongoDB • Multi data sources (not just MongoDB) • Business analysts • Reporting only • Powerful but you lose the benefits of the Document Model
  • 21. Demo: Data Analysis with Microsoft Excel and BI Connector
  • 23. Wouldn’t it be lovely if... You could visualize your MongoDB data natively… • without needing to write custom code • without needing to move your data into a different repository or • without needing to wrangle with flaky ETL pipelines • without needing to purchase and configure third-party tools • without losing the richness of the Document Model
  • 24. MongoDB Charts • Lightweight • Intuitive • Build visualizations on MongoDB data (nested, polymorphic) • Share content easily
  • 25. MongoDB Charts - When to Use • The fastest way to build visualizations over your MongoDB data • Ad hoc analyses • Benefit from the Document Model • No need to flatten /ETL your mongodb data • Type handling • Polymorphic collections • Nested documents • Array reductions • Prebuilt dashboards for collab • Self-service • Intuitive enough for domain experts, non-devs to use!
  • 27. Lifecycle 1. Ingest 2. Explore/Prepare ‒ Calcs ‒ Groups ‒ Data types 3. Visualize ‒ Chart types ‒ Binning ‒ Limiting ‒ Multi-series 4. Analyze/Use ‒ Dashboards ‒ Key views on data ‒ Patterns ‒ Drilldowns ‒ Pivots 5. Share ‒ Export ‒ Collaborate ‒ Embed
  • 28. Final Thoughts To think clearly about the relationship between evidence and conclusion, the relevant question is How do I know that? Answering this question requires self-awareness about the quality and integrity of information, and particularly how that information arrived to one’s own world. Similarly, to ask others, How do you know that? How do they know that? These questions are among the best you can ask analytically. ~ Edward Tufte you How do I know that? they { }