SlideShare a Scribd company logo
XNAT Tuning & Monitoring
                             John Paulett
                      jpaulett@wustl.edu
                          June 29, 2010
Overview
Share NRG’s experiences running a large XNAT
  installation, including methods for tuning,
   testing, and monitoring the application.
Plan
1.   Sample XNAT Architecture
2.   Hardware “Recommendations”
3.   Monitoring XNAT
4.   Performance Testing Tools
5.   Tuning XNAT
XNAT SCALES!
XNAT Scales
From a single study with dozens of scans

To hundreds of studies, including large, multi-
  site studies
Central Neuroimaging Data Archive
(CNDA)
Flagship XNAT installation at Washington
  University (WUSTL)

As of June 2010:
  – 500 studies
  – 8000 subjects
  – 11k imaging sessions
  – 9 TB of data
CNDA Architecture
CNDA Architecture
• 1x Kemp load balancer
  – SSL acceleration
• 1x Quad-core Xeon, 16GB RAM: PostgreSQL
  8.3
• 2x Dual-core Xeon, 4GB RAM (one in
  standby): Tomcat 5.5 & DicomServer
• BlueArc NAS
• Multiple Sun Grid Engine nodes
FUTURE ARCHITECTURE
Future Architecture
Reduce single points of failure
  – Standby Kemp load balancer
  – PostgreSQL Warm Standby
  – Actively load balance Tomcat
  – Archival storage SAN

Use new super computer at WUSTL

Improve ability to upgrade without downtime
Future Architecture
“RECOMMENDED” HARDWARE
Grow into your Architecture
Get single good server
  – Multicore with 4-16GB RAM (better than
    central.xnat.org)
  – Consider your archive’s future size & location


When you outgrow:
  – Buy a more powerful machine for PostgreSQL
  – Leave Tomcat on first server
MONITORING XNAT
XNAT Tuning & Monitoring
Pingdom
• World-wide tests for site availability &
  response time
• SMS & Email alerts when sites are
  unavailable
Munin
PostgreSQL, Tomcat, & Linux metrics over time
  – Memory, CPU, queries, requests, etc.
XNAT Tuning & Monitoring
XNAT Tuning & Monitoring
Monit
Active process monitoring & management

Define criteria for emailing alerts & restarting
 processes
  – CPU, memory thresholds
  – Connection failures (check web services)
PERFORMANCE TESTING TOOLS
JMeter
Generate load & analyze throughput

Complex HTTP transactions
JMeter




         https://svn.apache.org/repos/asf/jakarta/jmeter/
YourKit Profiling
Lower level debugging tool

Memory & CPU profiling

Hunt down memory leaks & code hot spots

Can instrument in production server
http://jira.atlassian.com/browse/JRA-12524
TUNING XNAT
On Tuning
Tuning results dependent on many variables,
  what worked in one case may not work
  universally

XNAT is a complex system! Some parts are CPU
  bound, others are memory bound, and others
  are bandwidth bound

General rule:
 faster CPUs + more RAM + bigger network pipe = faster XNAT
On Tuning
1.   Find something that is “slow”
2.   Quantify slowness
3.   Tune
4.   Quantify improvement
5.   Go to #1
PostgreSQL Tuning
PostgreSQL 8.3 has serious performance
  improvements

Put PostgreSQL and Tomcat on separate
  machines
  – Get powerful database machine
  – PostgreSQL can take advantage of multiple
    cores & lots of RAM
postgresql.conf
Default settings designed for legacy machines

Increase available memory. Allows query planner
  to do more work in RAM and less on disk

Increase max connections

Tweak kernel settings to allow access to more
  memory
Tomcat Tuning
Increase available memory & use “server VM”

catalina.sh:
JAVA_OPTS="$JAVA_OPTS "-XX:MaxPermSize=256m" "-
  XX:PermSize=256m" "-mx1512m" "-server"
Tomcat Tuning
Increase connections & threads in server.xml

  – At this point, consider load balancing between
    multiple Tomcat servers
XNAT Tuning
Upgrade to XNAT 1.4!

Increase MaxConnections to database in
 WEB-INF/conf/InstanceSettings.xml
  – Set in line with PostgreSQL’s max_connections
Tools Mentioned
Google Analytics: Free
Pingdom: Monthly subscription. One site free
Munin: Open Source
Monit: Open Source
JMeter: Open Source
YourKit: Commercial. Time-limited trial & free
  for open source
Questions & Your Experiences?
http://www.xnat.org/XNAT+2010+Workshop+-
   +Tuning,+Optimization,+Monitoring
XNAT Tuning & Monitoring

More Related Content

PDF
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
Danny Abukalam
 
PDF
MEETUP - Unboxing Apache Cassandra 3.10
Erick Ramirez
 
PDF
CASSANDRA MEETUP - Choosing the right cloud instances for success
Erick Ramirez
 
PDF
OpenNebula TechDay Boston 2015 - HA HPC with OpenNebula
OpenNebula Project
 
PDF
Integration of Glusterfs in to commvault simpana
Gluster.org
 
PDF
Exploring the Performance Impact of Virtualization on an HPC Cloud
Ryousei Takano
 
PDF
Sharding: Past, Present and Future with Krutika Dhananjay
Gluster.org
 
PDF
User-space Network Processing
Ryousei Takano
 
Bruno Silva - eMedLab: Merging HPC and Cloud for Biomedical Research
Danny Abukalam
 
MEETUP - Unboxing Apache Cassandra 3.10
Erick Ramirez
 
CASSANDRA MEETUP - Choosing the right cloud instances for success
Erick Ramirez
 
OpenNebula TechDay Boston 2015 - HA HPC with OpenNebula
OpenNebula Project
 
Integration of Glusterfs in to commvault simpana
Gluster.org
 
Exploring the Performance Impact of Virtualization on an HPC Cloud
Ryousei Takano
 
Sharding: Past, Present and Future with Krutika Dhananjay
Gluster.org
 
User-space Network Processing
Ryousei Takano
 

What's hot (20)

PDF
Mosix Cluster
Abhay Pai
 
PDF
USENIX NSDI 2016 (Session: Resource Sharing)
Ryousei Takano
 
PDF
State of Gluster Performance
Gluster.org
 
PDF
NSCC Training Introductory Class
National Supercomputing Centre Singapore
 
PDF
クラウド時代の半導体メモリー技術
Ryousei Takano
 
PDF
NSCC Training - Introductory Class
National Supercomputing Centre Singapore
 
PPTX
NSCC Training Introductory Class
National Supercomputing Centre Singapore
 
PPTX
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
In-Memory Computing Summit
 
PPTX
ДЕНИС КЛЕПIКОВ «Long Term storage for Prometheus» Lviv DevOps Conference 2019
UA DevOps Conference
 
PDF
Hyperconverged Cloud, Not just a toy anymore - Andrew Hatfield, Red Hat
OpenStack
 
PDF
Deploying pNFS over Distributed File Storage w/ Jiffin Tony Thottan and Niels...
Gluster.org
 
PPTX
4 use cases for C* to Scylla
◄ ★ Jack Pavlov ★ ►
 
PDF
OpenNebula TechDay Boston 2015 - Bringing Private Cloud Computing to HPC and ...
OpenNebula Project
 
PDF
CPU Optimizations in the CERN Cloud - February 2016
Belmiro Moreira
 
PDF
Programming Trends in High Performance Computing
Juris Vencels
 
PDF
OpenNebula out in the Open, Ander Astudillo, SURFsara
OpenNebula Project
 
PDF
OpenNebulaConf2015 2.05 OpenNebula at the Leibniz Supercomputing Centre - Mat...
OpenNebula Project
 
PDF
OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure -...
OpenNebula Project
 
PDF
OpenNebulaConf2015 1.09.02 Installgems Add-on - Alvaro Simon Garcia
OpenNebula Project
 
PDF
Propelling IoT Innovation with Predictive Analytics
SingleStore
 
Mosix Cluster
Abhay Pai
 
USENIX NSDI 2016 (Session: Resource Sharing)
Ryousei Takano
 
State of Gluster Performance
Gluster.org
 
NSCC Training Introductory Class
National Supercomputing Centre Singapore
 
クラウド時代の半導体メモリー技術
Ryousei Takano
 
NSCC Training - Introductory Class
National Supercomputing Centre Singapore
 
NSCC Training Introductory Class
National Supercomputing Centre Singapore
 
IMC Summit 2016 Breakout - Andy Pavlo - What Non-Volatile Memory Means for th...
In-Memory Computing Summit
 
ДЕНИС КЛЕПIКОВ «Long Term storage for Prometheus» Lviv DevOps Conference 2019
UA DevOps Conference
 
Hyperconverged Cloud, Not just a toy anymore - Andrew Hatfield, Red Hat
OpenStack
 
Deploying pNFS over Distributed File Storage w/ Jiffin Tony Thottan and Niels...
Gluster.org
 
4 use cases for C* to Scylla
◄ ★ Jack Pavlov ★ ►
 
OpenNebula TechDay Boston 2015 - Bringing Private Cloud Computing to HPC and ...
OpenNebula Project
 
CPU Optimizations in the CERN Cloud - February 2016
Belmiro Moreira
 
Programming Trends in High Performance Computing
Juris Vencels
 
OpenNebula out in the Open, Ander Astudillo, SURFsara
OpenNebula Project
 
OpenNebulaConf2015 2.05 OpenNebula at the Leibniz Supercomputing Centre - Mat...
OpenNebula Project
 
OpenNebulaConf2015 2.14 Cloud Service Experience in TeideHPC Infrastructure -...
OpenNebula Project
 
OpenNebulaConf2015 1.09.02 Installgems Add-on - Alvaro Simon Garcia
OpenNebula Project
 
Propelling IoT Innovation with Predictive Analytics
SingleStore
 
Ad

Similar to XNAT Tuning & Monitoring (20)

PPTX
CPN302 your-linux-ami-optimization-and-performance
Coburn Watson
 
PDF
Sanger OpenStack presentation March 2017
Dave Holland
 
PDF
NAVGEM on the Cloud: Computational Evaluation of Cloud HPC with a Global Atmo...
inside-BigData.com
 
PDF
Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Community
 
PDF
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Danielle Womboldt
 
PDF
Toward 10,000 Containers on OpenStack
Ton Ngo
 
PDF
Leveraging chaos mesh in Astra Serverless testing
Pierre Laporte
 
PDF
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Databricks
 
PPTX
Building big data pipelines with Kafka and Kubernetes
Venu Ryali
 
PDF
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld
 
PPT
PFQ@ PAM12
Nicola Bonelli
 
PPTX
OpenStack HA
tcp cloud
 
PPTX
OpenStack High Availability
Jakub Pavlik
 
PDF
Tackling Network Bottlenecks with Hardware Accelerations: Cloud vs. On-Premise
Databricks
 
PDF
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Belmiro Moreira
 
PDF
OSMC 2019 | Monitoring Alerts and Metrics on Large Power Systems Clusters by ...
NETWAYS
 
PPTX
Lessons learned from shifting real data around: An ad hoc data challenge from...
Jisc
 
PDF
S016827 pendulum-swings-nola-v1710d
Tony Pearson
 
PDF
IBM Power9 Features and Specifications
inside-BigData.com
 
PPTX
High performace network of Cloud Native Taiwan User Group
HungWei Chiu
 
CPN302 your-linux-ami-optimization-and-performance
Coburn Watson
 
Sanger OpenStack presentation March 2017
Dave Holland
 
NAVGEM on the Cloud: Computational Evaluation of Cloud HPC with a Global Atmo...
inside-BigData.com
 
Ceph Day Beijing - Ceph all-flash array design based on NUMA architecture
Ceph Community
 
Ceph Day Beijing - Ceph All-Flash Array Design Based on NUMA Architecture
Danielle Womboldt
 
Toward 10,000 Containers on OpenStack
Ton Ngo
 
Leveraging chaos mesh in Astra Serverless testing
Pierre Laporte
 
Experience of Running Spark on Kubernetes on OpenStack for High Energy Physic...
Databricks
 
Building big data pipelines with Kafka and Kubernetes
Venu Ryali
 
VMworld 2013: How SRP Delivers More Than Power to Their Customers
VMworld
 
PFQ@ PAM12
Nicola Bonelli
 
OpenStack HA
tcp cloud
 
OpenStack High Availability
Jakub Pavlik
 
Tackling Network Bottlenecks with Hardware Accelerations: Cloud vs. On-Premise
Databricks
 
Tips Tricks and Tactics with Cells and Scaling OpenStack - May, 2015
Belmiro Moreira
 
OSMC 2019 | Monitoring Alerts and Metrics on Large Power Systems Clusters by ...
NETWAYS
 
Lessons learned from shifting real data around: An ad hoc data challenge from...
Jisc
 
S016827 pendulum-swings-nola-v1710d
Tony Pearson
 
IBM Power9 Features and Specifications
inside-BigData.com
 
High performace network of Cloud Native Taiwan User Group
HungWei Chiu
 
Ad

More from John Paulett (8)

PDF
GeoDjango & HTML5 Geolocation
John Paulett
 
PDF
XNAT Case Study: DIAN QC Uploader
John Paulett
 
PDF
XNAT Open Source Development
John Paulett
 
PDF
Mercurial: Modern Source Control Management
John Paulett
 
PDF
PostgreSQL Scaling And Failover
John Paulett
 
PDF
Drools
John Paulett
 
PDF
Automated Testing
John Paulett
 
PDF
Getting Started with (Distributed) Version Control
John Paulett
 
GeoDjango & HTML5 Geolocation
John Paulett
 
XNAT Case Study: DIAN QC Uploader
John Paulett
 
XNAT Open Source Development
John Paulett
 
Mercurial: Modern Source Control Management
John Paulett
 
PostgreSQL Scaling And Failover
John Paulett
 
Drools
John Paulett
 
Automated Testing
John Paulett
 
Getting Started with (Distributed) Version Control
John Paulett
 

Recently uploaded (20)

PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PDF
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
IT Runs Better with ThousandEyes AI-driven Assurance
ThousandEyes
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
MASTERDECK GRAPHSUMMIT SYDNEY (Public).pdf
Neo4j
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 

XNAT Tuning & Monitoring