SlideShare a Scribd company logo
Administration
        Michael DelNegro
 Principal Database Administrator
               AOL



                1
Presentation Overview
• Introduction
• My Applications
• Tips
• Tools
• Resources
• Upcoming
                    2
About Me
• DBA at AOL (Dulles) for six years
• Background in Sybase
• Now MySQL, PostgreSQL and NoSQL
• Was: Blogsmith, Uncut Video, Travel, Autos,
  Journals, Real Estate, Ficlets, Shopping
• Currently: Patch, MapQuest, HSS,
  Datalayer, Demand
• I Heart Big Data
                       3
About MongoDB
• “Scalable, high-performance, open source,
  document-oriented database”
• Databases (Databases)
 • Collections (Tables)
    • Documents (Rows)
     • Fields (Columns) - K/V Pairs
• Indexes
• No Joins
 • Favors Embedding Data instead of FKs
                     4
More About MongoDB
• JSON-style Documents
• Javascript Shell
• Capped Collections
• Flexible Schemas
• Replication w/ Autofailover
• Sharding
• GridFS: File Storage
• Map-Reduce         5
MongoDB Support
• Operating Systems
 • Linux, Windows, Mac OS X, Solaris
 • 32bit, 64bit
• Drivers
 • Java, Javascript, Perl, Ruby, Scala, Erlang, C,
     C#, C++, Haskell, PHP, Python
  • R, Smalltalk, node.js, ColdFusion
                        6
MongoDB Use Cases
• Website Data Store
• Caching Tier
• Document and Content Mgmt Systems
• Event Logging
• Real-time Stats/Analytics
• Archiving
• High Volume Problems
                  7
MongoDB Misuse

• Complex Transactional Systems
• Traditional Business Intelligence
• SQL is a Must

                      8
MongoDB at AOL

• In use since Summer 2010
• I currently administer two applications for
  MapQuest and Patch
• There are other MongoDB applications in
  use throughout the company and more on
  the way


                      9
MapQuest App
• Tracks User Profile Information
• V. 1.6.5.latest (just upgraded from 1.6.3)
• 26 Total Hosts, CentOS 5, 16GB RAM
• 300 million users, 130 million saved maps
• Replica Sets
• 3 Sharded Collections
 • lookup, east coast, west coast
• Java                10
Patch App
• Tracks User Activity
• Master, 2 Slaves
• V. 1.6.3
• About 100GB of data
• Throwaway Data (for now)
• Ruby on Rails
                   11
About Patch
• “HyperLocal” news sites across the
  country
• Fills gap in coverage left by local
  newspapers
• Currently 800 sites are live
• 1000+ by end of 2011
                       12
Nearby Patch Sites
• Vienna (ex. vienna.patch.com)
• Ashburn
• Reston
• McLean
• CollegePark
• GreaterAnnapolis
• 50+ in DC Area
                      13
Upcoming Ops Plans

• Upgrade to 1.8
• Migrate Patch to Replica Sets
• Move MapQuest to bigger hardware (16GB
  -> 64GB memory)
• Add additional slaves

                     14
Admin Tips
• Slaves are a MUST pre1.8
• Use 64 bit version
 • 32 bit version has 2.5 GB storage limit
• Use xfs or ext4
• Keep eye on oplog size
• Turn off atime & dtime
• Consider using getLastError()
                     15
More Admin Tips
• Increase File Descriptor Limits
• Do not use kill -9 (pre-1.8)
• Consider having a slave on replication delay
 • -- slavedelay <seconds>
• db.runCommand(“logRotate”)
• Keep db.<collection>.totalIndexSize() less
  than RAM
• Linux dirty_background_ratio and
  dirty_ratio (pre 2.6.22)
                     16
Even More Admin Tips
• Omit parenthesis to see command details
• 5 Primitives of Mongo
 • insert, remove, update, find, getMore
• Replication is a slave polling master process
• Master and slaves each have their own
  oplog
• Choose shard key carefully (ex. timestamp)
                      17
Admin Tools
• mongodump-mongorestore
 • use fsync and lock database to ensure
    consistent backup
• fsync and lock are a must for file system
  backups (ex LVM)
• http://localhost:28017 (server port + 1000)
• db.currentOp()
                     18
More Admin Tools
• mongostat
• db.printReplicationInfo()
• db.serverStatus()
• db.<collection>.stats()
• Database Profiler
• Explain
                     19
Admin Resources
• mongodb.org
 • Events
 • Forums
• Wordnik Mongo Admin Tools (Github)
• Mongo Snippets (Github)
• IRC (freenode #mongodb)
• Little MongoDB Book
                   20
More Admin Resources
• slideshare (Use Time-Based Search)
• GUI Admin Tools
 • MongoVUE
 • Others
• Kristina Chodorow's Blog
• Boxed Ice
                    21
Even More Resources
• Follow @MongoQuestion (StackOverflow)
• MongoDB on Quora (@q_mongodb)
• 10gen Deployment Strategies Slides
• Books
• Training
• 10gen Support
• Office Hours in NYC and Redwood City
                  22
New MongoDB Release
• 1.8 (Released March 16)
 • Single Server Durability (Journaling)
 • Enhancements to Sharding & Replica Sets
 • Covered and Sparse Indexes
 • Tab Completion
 • Maximum BSON Document: 16MB
 • 1.8 Features Presentation
                    23
Future Releases
• 2.0 (May/June?)
 • Better Map-Reduce and Aggregation
 • Improved Concurrency
 • Online Compaction
 • TTL Time-Out Collections
• Beyond
 • Full-Text Search?
                   24
Thank You!

• www.slideshare.net/radiocats
• @radiocats on Twitter
• www.linkedin.com/in/mdelnegro

                   25

More Related Content

What's hot (20)

PDF
Building Google-in-a-box: using Apache SolrCloud and Bigtop to index your big...
rhatr
 
PPTX
MongoDB
Albin John
 
PDF
Cassandra at Vast
Cassandra Austin
 
PPT
Wmware NoSQL
Murat Çakal
 
PDF
CosmosDB for DBAs & Developers
Niko Neugebauer
 
PDF
20141206 4 q14_dataconference_i_am_your_db
hyeongchae lee
 
PPTX
The Hive Think Tank: Rocking the Database World with RocksDB
The Hive
 
PDF
Ndb cluster 80_ycsb_disk
mikaelronstrom
 
PDF
Presto Meetup @ Facebook (3/22/2016)
Martin Traverso
 
PDF
Beyond Postgres: Interesting Projects, Tools and forks
Sameer Kumar
 
PPTX
The Hive Think Tank: Rocking the Database World with RocksDB
The Hive
 
PDF
2016 jan-pugs-meetup-v9.5-features
Sameer Kumar
 
PDF
NOSQL Overview
Tobias Lindaaker
 
PDF
RocksDB meetup
Javier González
 
PDF
PGDay.Seoul 2016 lightingtalk
hyeongchae lee
 
PPTX
How to ensure Presto scalability 
in multi use case
Kai Sasaki
 
PDF
Orchestrating MySQL
Ivan Zoratti
 
PPTX
Webinar 2017. Supercharge your analytics with ClickHouse. Vadim Tkachenko
Altinity Ltd
 
PDF
Иван Глушков (Echo)
Ontico
 
PPTX
Introduction to CosmosDB - Azure Bootcamp 2018
Josh Carlisle
 
Building Google-in-a-box: using Apache SolrCloud and Bigtop to index your big...
rhatr
 
MongoDB
Albin John
 
Cassandra at Vast
Cassandra Austin
 
Wmware NoSQL
Murat Çakal
 
CosmosDB for DBAs & Developers
Niko Neugebauer
 
20141206 4 q14_dataconference_i_am_your_db
hyeongchae lee
 
The Hive Think Tank: Rocking the Database World with RocksDB
The Hive
 
Ndb cluster 80_ycsb_disk
mikaelronstrom
 
Presto Meetup @ Facebook (3/22/2016)
Martin Traverso
 
Beyond Postgres: Interesting Projects, Tools and forks
Sameer Kumar
 
The Hive Think Tank: Rocking the Database World with RocksDB
The Hive
 
2016 jan-pugs-meetup-v9.5-features
Sameer Kumar
 
NOSQL Overview
Tobias Lindaaker
 
RocksDB meetup
Javier González
 
PGDay.Seoul 2016 lightingtalk
hyeongchae lee
 
How to ensure Presto scalability 
in multi use case
Kai Sasaki
 
Orchestrating MySQL
Ivan Zoratti
 
Webinar 2017. Supercharge your analytics with ClickHouse. Vadim Tkachenko
Altinity Ltd
 
Иван Глушков (Echo)
Ontico
 
Introduction to CosmosDB - Azure Bootcamp 2018
Josh Carlisle
 

Similar to Mongo db admin_20110329 (20)

PPTX
Operationalizing MongoDB at AOL
radiocats
 
PPTX
MongoDC 2012: "Operationalizing" MongoDB@AOL
MongoDB
 
PDF
Austin Cassandra Users 6/19: Apache Cassandra at Vast
DataStax Academy
 
PDF
The MySQL Server ecosystem in 2016
sys army
 
PDF
The MySQL Server ecosystem in 2016
Colin Charles
 
PPTX
Hadoop ppt on the basics and architecture
saipriyacoool
 
PPTX
Drop acid
Mike Feltman
 
PDF
Your backend architecture is what matters slideshare
Colin Charles
 
PDF
MySQL in the Hosted Cloud
Colin Charles
 
PPTX
Agility and Scalability with MongoDB
MongoDB
 
PDF
Pldc2012 monitoring-and-trending-with-mysql
radiocats
 
PPTX
How does Apache Pegasus (incubating) community develop at SensorsData
acelyc1112009
 
PPTX
Meetup#2: Building responsive Symbology & Suggest WebService
Minsk MongoDB User Group
 
PPTX
Monitoring MongoDB’s Engines in the Wild
Tim Vaillancourt
 
PDF
City of Atlanta Oracle Application Footprint
Danny Bryant
 
PDF
Intro to big data choco devday - 23-01-2014
Hassan Islamov
 
PDF
Webinar - DreamObjects/Ceph Case Study
Ceph Community
 
PDF
MySQL in the Cloud
Colin Charles
 
PDF
OSDC 2017 | Lessons from database failures by Colin Charles
NETWAYS
 
PDF
The MySQL Server Ecosystem in 2016
Colin Charles
 
Operationalizing MongoDB at AOL
radiocats
 
MongoDC 2012: "Operationalizing" MongoDB@AOL
MongoDB
 
Austin Cassandra Users 6/19: Apache Cassandra at Vast
DataStax Academy
 
The MySQL Server ecosystem in 2016
sys army
 
The MySQL Server ecosystem in 2016
Colin Charles
 
Hadoop ppt on the basics and architecture
saipriyacoool
 
Drop acid
Mike Feltman
 
Your backend architecture is what matters slideshare
Colin Charles
 
MySQL in the Hosted Cloud
Colin Charles
 
Agility and Scalability with MongoDB
MongoDB
 
Pldc2012 monitoring-and-trending-with-mysql
radiocats
 
How does Apache Pegasus (incubating) community develop at SensorsData
acelyc1112009
 
Meetup#2: Building responsive Symbology & Suggest WebService
Minsk MongoDB User Group
 
Monitoring MongoDB’s Engines in the Wild
Tim Vaillancourt
 
City of Atlanta Oracle Application Footprint
Danny Bryant
 
Intro to big data choco devday - 23-01-2014
Hassan Islamov
 
Webinar - DreamObjects/Ceph Case Study
Ceph Community
 
MySQL in the Cloud
Colin Charles
 
OSDC 2017 | Lessons from database failures by Colin Charles
NETWAYS
 
The MySQL Server Ecosystem in 2016
Colin Charles
 
Ad

Recently uploaded (20)

PPTX
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Blockchain Transactions Explained For Everyone
CIFDAQ
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PPTX
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
PDF
July Patch Tuesday
Ivanti
 
PDF
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
PDF
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
PPTX
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
From Sci-Fi to Reality: Exploring AI Evolution
Svetlana Meissner
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Blockchain Transactions Explained For Everyone
CIFDAQ
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
What Makes Contify’s News API Stand Out: Key Features at a Glance
Contify
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
OpenID AuthZEN - Analyst Briefing July 2025
David Brossard
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Newgen Beyond Frankenstein_Build vs Buy_Digital_version.pdf
darshakparmar
 
July Patch Tuesday
Ivanti
 
POV_ Why Enterprises Need to Find Value in ZERO.pdf
darshakparmar
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Newgen 2022-Forrester Newgen TEI_13 05 2022-The-Total-Economic-Impact-Newgen-...
darshakparmar
 
HubSpot Main Hub: A Unified Growth Platform
Jaswinder Singh
 
WooCommerce Workshop: Bring Your Laptop
Laura Hartwig
 
Ad

Mongo db admin_20110329

  • 1. Administration Michael DelNegro Principal Database Administrator AOL 1
  • 2. Presentation Overview • Introduction • My Applications • Tips • Tools • Resources • Upcoming 2
  • 3. About Me • DBA at AOL (Dulles) for six years • Background in Sybase • Now MySQL, PostgreSQL and NoSQL • Was: Blogsmith, Uncut Video, Travel, Autos, Journals, Real Estate, Ficlets, Shopping • Currently: Patch, MapQuest, HSS, Datalayer, Demand • I Heart Big Data 3
  • 4. About MongoDB • “Scalable, high-performance, open source, document-oriented database” • Databases (Databases) • Collections (Tables) • Documents (Rows) • Fields (Columns) - K/V Pairs • Indexes • No Joins • Favors Embedding Data instead of FKs 4
  • 5. More About MongoDB • JSON-style Documents • Javascript Shell • Capped Collections • Flexible Schemas • Replication w/ Autofailover • Sharding • GridFS: File Storage • Map-Reduce 5
  • 6. MongoDB Support • Operating Systems • Linux, Windows, Mac OS X, Solaris • 32bit, 64bit • Drivers • Java, Javascript, Perl, Ruby, Scala, Erlang, C, C#, C++, Haskell, PHP, Python • R, Smalltalk, node.js, ColdFusion 6
  • 7. MongoDB Use Cases • Website Data Store • Caching Tier • Document and Content Mgmt Systems • Event Logging • Real-time Stats/Analytics • Archiving • High Volume Problems 7
  • 8. MongoDB Misuse • Complex Transactional Systems • Traditional Business Intelligence • SQL is a Must 8
  • 9. MongoDB at AOL • In use since Summer 2010 • I currently administer two applications for MapQuest and Patch • There are other MongoDB applications in use throughout the company and more on the way 9
  • 10. MapQuest App • Tracks User Profile Information • V. 1.6.5.latest (just upgraded from 1.6.3) • 26 Total Hosts, CentOS 5, 16GB RAM • 300 million users, 130 million saved maps • Replica Sets • 3 Sharded Collections • lookup, east coast, west coast • Java 10
  • 11. Patch App • Tracks User Activity • Master, 2 Slaves • V. 1.6.3 • About 100GB of data • Throwaway Data (for now) • Ruby on Rails 11
  • 12. About Patch • “HyperLocal” news sites across the country • Fills gap in coverage left by local newspapers • Currently 800 sites are live • 1000+ by end of 2011 12
  • 13. Nearby Patch Sites • Vienna (ex. vienna.patch.com) • Ashburn • Reston • McLean • CollegePark • GreaterAnnapolis • 50+ in DC Area 13
  • 14. Upcoming Ops Plans • Upgrade to 1.8 • Migrate Patch to Replica Sets • Move MapQuest to bigger hardware (16GB -> 64GB memory) • Add additional slaves 14
  • 15. Admin Tips • Slaves are a MUST pre1.8 • Use 64 bit version • 32 bit version has 2.5 GB storage limit • Use xfs or ext4 • Keep eye on oplog size • Turn off atime & dtime • Consider using getLastError() 15
  • 16. More Admin Tips • Increase File Descriptor Limits • Do not use kill -9 (pre-1.8) • Consider having a slave on replication delay • -- slavedelay <seconds> • db.runCommand(“logRotate”) • Keep db.<collection>.totalIndexSize() less than RAM • Linux dirty_background_ratio and dirty_ratio (pre 2.6.22) 16
  • 17. Even More Admin Tips • Omit parenthesis to see command details • 5 Primitives of Mongo • insert, remove, update, find, getMore • Replication is a slave polling master process • Master and slaves each have their own oplog • Choose shard key carefully (ex. timestamp) 17
  • 18. Admin Tools • mongodump-mongorestore • use fsync and lock database to ensure consistent backup • fsync and lock are a must for file system backups (ex LVM) • http://localhost:28017 (server port + 1000) • db.currentOp() 18
  • 19. More Admin Tools • mongostat • db.printReplicationInfo() • db.serverStatus() • db.<collection>.stats() • Database Profiler • Explain 19
  • 20. Admin Resources • mongodb.org • Events • Forums • Wordnik Mongo Admin Tools (Github) • Mongo Snippets (Github) • IRC (freenode #mongodb) • Little MongoDB Book 20
  • 21. More Admin Resources • slideshare (Use Time-Based Search) • GUI Admin Tools • MongoVUE • Others • Kristina Chodorow's Blog • Boxed Ice 21
  • 22. Even More Resources • Follow @MongoQuestion (StackOverflow) • MongoDB on Quora (@q_mongodb) • 10gen Deployment Strategies Slides • Books • Training • 10gen Support • Office Hours in NYC and Redwood City 22
  • 23. New MongoDB Release • 1.8 (Released March 16) • Single Server Durability (Journaling) • Enhancements to Sharding & Replica Sets • Covered and Sparse Indexes • Tab Completion • Maximum BSON Document: 16MB • 1.8 Features Presentation 23
  • 24. Future Releases • 2.0 (May/June?) • Better Map-Reduce and Aggregation • Improved Concurrency • Online Compaction • TTL Time-Out Collections • Beyond • Full-Text Search? 24
  • 25. Thank You! • www.slideshare.net/radiocats • @radiocats on Twitter • www.linkedin.com/in/mdelnegro 25

Editor's Notes