SlideShare a Scribd company logo
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
First steps with Globus Compute
multi-user endpoint
Chris Scott (NeSI)
chris.scott@nesi.org.nz
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
● Background about NeSI
● Details about our use case for Globus Compute
● Some challenges encountered
● Early experience with multi-user endpoint
Outline
New Zealand eScience Infrastructure
Andrew Chen (Engineering)
Using NeSI supercomputers for advancing image
processing capabilities using computer vision
Dr Olaf Morgenstern
and Dr Erik Behrens
(Earth Science)
Deep South Challenge
project using NeSI
supercomputers
for climate modelling.
Yoshihiro Kaneko
(Seismology)
GNS Science using NeSI
supercomputers to
recreate earthquake
events to better
understand their
processes and
aftermath effects.
Dr Kim Handley
(Biological
Sciences)
Genomics Aotearoa
project using NeSI
supercomputers to
better understand
environmental
processes on a
microbial level
Dr Sarah Masters,
Dr Deborah Crittenden,
Nathaniel Gunby (Chemistry)
Using NeSI supercomputers to
develop new analysis tools for
studying molecules’ properties.
Dr Richie Poulton
(Psychology)
Using NeSI Data
Transfer platform to
send MRI scan images
from Dunedin
Multidisciplinary Health
& Development Study
Research Unit to a
partner laboratory in
the United States for
analysis.
NeSI is a national
collaboration of:
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
NeSI
Services
High performance computing (HPC) and data analytics
• Fit-for-purpose national HPC platform (CPUs, GPUs) including specialised
software and tools for data analytics
• Flexible environments for research DevOps / research software engineers
• Hybrid eResearch platforms for institutions and communities
Data services
• High speed, secure data transfer with end-to-end integration
• Hosting of large actively used research datasets, repositories, and archives
Training and researcher skill development
• In-person and online training to grow capabilities in NZ research sector
• Partnership with The Carpentries (global programme to teach foundational
coding and data science skills to researchers)
Consultancy
• Embed Research Software Engineers into research teams to lift their
computational capabilities, optimise tools & workflows, develop code and more
New Zealand eScience Infrastructure
Mahuika and Māui are housed inside a purpose-built
High Performance Computing Facility in Wellington.
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
● Pharmacology group at the University of Auckland
○ Improving understanding of medicines in humans
and improving dosing
○ Want to manage their research from familiar
Windows environment
○ Parts of their workflow require significant resources
■ Non-linear mixed effects modelling (NONMEM)
■ Fortran MPI code
■ Bootstrapping
Use case
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
● Implemented in Python using Globus SDKs (transfer
and compute)
● Provides an API that Wings for NONMEM (wfn)
interfaces with (compatible with previous version):
○ Upload files and submit Slurm jobs
○ Wait for Slurm jobs to complete and download files
● Globus HTTPS transfer to Guest collection
● Globus compute running on login node (user managed)
● Globus authentication - token based
https://github.com/chrisdjscott/RemoteJobManager
Remote Job
Manager tool
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
● Each researcher has to manage their own endpoint
○ Start it, restart it if it gets into a bad state (e.g.
caused by instabilities on our platform), login node
gets rebooted, etc.
○ Each user has a scron job that checks state of the
endpoint every hour and restarts it if necessary
○ Lot of extra complexity and places for things to go
wrong
Notes about
current use of
Globus
Compute
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
● Using LocalProvider instead of SlurmProvider
○ Resources vary between NONMEM simulations
(time, cores, memory) but endpoint resources are
hardcoded when you create the endpoint
○ WFN writes a template Slurm script that gets
uploaded and submitted
○ Poll for Slurm job completion
○ Extra code/complexity that we wouldn’t need if
could use the SlurmProvider
Notes about
current use of
Globus
Compute
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
● Early stages of testing the multi-user endpoint
● Two changes that make things much better for us
○ Centrally managed - we run the endpoint as a service
and give the researchers the endpoint id
○ Endpoint resources are no longer hard-coded
■ Can use templating in the endpoint config to
allow researchers to pass in certain parameters
when they call the function
Multi-user
endpoint
New Zealand eScience Infrastructure
New Zealand eScience Infrastructure
● Multi-user endpoint looks like it will work really well
for this group
○ Centrally managed service (users don’t have to
run anything themselves)
○ Can switch to SlurmProvider, will simplify things
● Next steps are to continue testing with the wider
team and update RJM to use multi-user endpoint so
the researchers can benefit from it
Summary

More Related Content

Similar to First Steps with Globus Compute Multi-User Endpoints (20)

PPTX
Synergy 2014 - Syn122 Moving Australian National Research into the Cloud
Citrix
 
PPTX
Thoughts on Cybersecurity
Frank Wuerthwein
 
PDF
CloudLightning and the OPM-based Use Case
CloudLightning
 
PDF
Augmented and Virtual Reality Presentation by university of Michigan
vkgtharet
 
PPTX
Dp2 ppt by_bikramjit_chowdhury_final
Bikramjit Chowdhury
 
PDF
SciNet -- Pushing scientific boundaries
Lenovo Data Center
 
PPTX
Rack Cluster Deployment for SDSC Supercomputer
Rebekah Rodriguez
 
PDF
01-06 OCRE Test Suite - Fernandes.pdf
OCRE | Open Clouds for Research Environments
 
PDF
ApacheCon NA 2013
LucaCinquini
 
PPTX
Research in the Cloud
David Wallom
 
PDF
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
PPTX
Continuous modeling - automating model building on high-performance e-Infrast...
Ola Spjuth
 
PPTX
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Larry Smarr
 
PDF
Accelerating Research and Enterprise Solutions by Bridging HPC and AI
inside-BigData.com
 
PDF
Reliable, Remote Computation at All Scales
Globus
 
PPTX
Last Conference 2017: Big Data in a Production Environment: Lessons Learnt
Mark Grebler
 
PDF
Advanced Research Computing at York
Ming Li
 
PDF
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022
HostedbyConfluent
 
PDF
NASA Advanced Computing Environment for Science & Engineering
inside-BigData.com
 
PDF
Frank Würthwein - NRP and the Path forward
Larry Smarr
 
Synergy 2014 - Syn122 Moving Australian National Research into the Cloud
Citrix
 
Thoughts on Cybersecurity
Frank Wuerthwein
 
CloudLightning and the OPM-based Use Case
CloudLightning
 
Augmented and Virtual Reality Presentation by university of Michigan
vkgtharet
 
Dp2 ppt by_bikramjit_chowdhury_final
Bikramjit Chowdhury
 
SciNet -- Pushing scientific boundaries
Lenovo Data Center
 
Rack Cluster Deployment for SDSC Supercomputer
Rebekah Rodriguez
 
01-06 OCRE Test Suite - Fernandes.pdf
OCRE | Open Clouds for Research Environments
 
ApacheCon NA 2013
LucaCinquini
 
Research in the Cloud
David Wallom
 
A Maturing Role of Workflows in the Presence of Heterogenous Computing Archit...
Ilkay Altintas, Ph.D.
 
Continuous modeling - automating model building on high-performance e-Infrast...
Ola Spjuth
 
Panel: Building the NRP Ecosystem with the Regional Networks on their Campuses;
Larry Smarr
 
Accelerating Research and Enterprise Solutions by Bridging HPC and AI
inside-BigData.com
 
Reliable, Remote Computation at All Scales
Globus
 
Last Conference 2017: Big Data in a Production Environment: Lessons Learnt
Mark Grebler
 
Advanced Research Computing at York
Ming Li
 
Monitoring Exascale Supercomputers With Tim Osborne | Current 2022
HostedbyConfluent
 
NASA Advanced Computing Environment for Science & Engineering
inside-BigData.com
 
Frank Würthwein - NRP and the Path forward
Larry Smarr
 

More from Globus (20)

PDF
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus
 
PDF
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
PDF
Globus Compute Introduction - GlobusWorld 2024
Globus
 
PDF
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
PDF
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
PDF
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
PDF
Understanding Globus Data Transfers with NetSage
Globus
 
PDF
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
PDF
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
PDF
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
PDF
The Department of Energy's Integrated Research Infrastructure (IRI)
Globus
 
PDF
GlobusWorld 2024 Opening Keynote session
Globus
 
PDF
Enhancing Performance with Globus and the Science DMZ
Globus
 
PDF
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Globus
 
PDF
Globus at the United States Geological Survey
Globus
 
PDF
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
PDF
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Globus
 
PDF
Reactive Documents and Computational Pipelines - Bridging the Gap
Globus
 
PDF
Innovating Inference at Exascale - Remote Triggering of Large Language Models...
Globus
 
Globus Compute wth IRI Workflows - GlobusWorld 2024
Globus
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
Globus Compute Introduction - GlobusWorld 2024
Globus
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Understanding Globus Data Transfers with NetSage
Globus
 
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
The Department of Energy's Integrated Research Infrastructure (IRI)
Globus
 
GlobusWorld 2024 Opening Keynote session
Globus
 
Enhancing Performance with Globus and the Science DMZ
Globus
 
Extending Globus into a Site-wide Automated Data Infrastructure.pdf
Globus
 
Globus at the United States Geological Survey
Globus
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 
Globus Compute with Integrated Research Infrastructure (IRI) workflows
Globus
 
Reactive Documents and Computational Pipelines - Bridging the Gap
Globus
 
Innovating Inference at Exascale - Remote Triggering of Large Language Models...
Globus
 
Ad

Recently uploaded (20)

PPTX
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
PPTX
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
PPTX
Change Common Properties in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
PPTX
OpenChain @ OSS NA - In From the Cold: Open Source as Part of Mainstream Soft...
Shane Coughlan
 
PDF
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
PDF
Wondershare PDFelement Pro Crack for MacOS New Version Latest 2025
bashirkhan333g
 
PPTX
Agentic Automation Journey Series Day 2 – Prompt Engineering for UiPath Agents
klpathrudu
 
PDF
Generic or Specific? Making sensible software design decisions
Bert Jan Schrijver
 
PPTX
Customise Your Correlation Table in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PPTX
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
PDF
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
PDF
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
PDF
TheFutureIsDynamic-BoxLang witch Luis Majano.pdf
Ortus Solutions, Corp
 
PDF
4K Video Downloader Plus Pro Crack for MacOS New Download 2025
bashirkhan333g
 
PDF
MiniTool Power Data Recovery 8.8 With Crack New Latest 2025
bashirkhan333g
 
PPTX
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
PDF
[Solution] Why Choose the VeryPDF DRM Protector Custom-Built Solution for You...
Lingwen1998
 
PDF
MiniTool Partition Wizard Free Crack + Full Free Download 2025
bashirkhan333g
 
PPTX
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
PDF
AOMEI Partition Assistant Crack 10.8.2 + WinPE Free Downlaod New Version 2025
bashirkhan333g
 
Comprehensive Risk Assessment Module for Smarter Risk Management
EHA Soft Solutions
 
Agentic Automation Journey Session 1/5: Context Grounding and Autopilot for E...
klpathrudu
 
Change Common Properties in IBM SPSS Statistics Version 31.pptx
Version 1 Analytics
 
OpenChain @ OSS NA - In From the Cold: Open Source as Part of Mainstream Soft...
Shane Coughlan
 
Build It, Buy It, or Already Got It? Make Smarter Martech Decisions
bbedford2
 
Wondershare PDFelement Pro Crack for MacOS New Version Latest 2025
bashirkhan333g
 
Agentic Automation Journey Series Day 2 – Prompt Engineering for UiPath Agents
klpathrudu
 
Generic or Specific? Making sensible software design decisions
Bert Jan Schrijver
 
Customise Your Correlation Table in IBM SPSS Statistics.pptx
Version 1 Analytics
 
Help for Correlations in IBM SPSS Statistics.pptx
Version 1 Analytics
 
MiniTool Partition Wizard 12.8 Crack License Key LATEST
hashhshs786
 
SAP Firmaya İade ABAB Kodları - ABAB ile yazılmıl hazır kod örneği
Salih Küçük
 
TheFutureIsDynamic-BoxLang witch Luis Majano.pdf
Ortus Solutions, Corp
 
4K Video Downloader Plus Pro Crack for MacOS New Download 2025
bashirkhan333g
 
MiniTool Power Data Recovery 8.8 With Crack New Latest 2025
bashirkhan333g
 
Foundations of Marketo Engage - Powering Campaigns with Marketo Personalization
bbedford2
 
[Solution] Why Choose the VeryPDF DRM Protector Custom-Built Solution for You...
Lingwen1998
 
MiniTool Partition Wizard Free Crack + Full Free Download 2025
bashirkhan333g
 
Agentic Automation: Build & Deploy Your First UiPath Agent
klpathrudu
 
AOMEI Partition Assistant Crack 10.8.2 + WinPE Free Downlaod New Version 2025
bashirkhan333g
 
Ad

First Steps with Globus Compute Multi-User Endpoints

  • 1. New Zealand eScience Infrastructure New Zealand eScience Infrastructure First steps with Globus Compute multi-user endpoint Chris Scott (NeSI) [email protected]
  • 2. New Zealand eScience Infrastructure New Zealand eScience Infrastructure ● Background about NeSI ● Details about our use case for Globus Compute ● Some challenges encountered ● Early experience with multi-user endpoint Outline
  • 3. New Zealand eScience Infrastructure Andrew Chen (Engineering) Using NeSI supercomputers for advancing image processing capabilities using computer vision Dr Olaf Morgenstern and Dr Erik Behrens (Earth Science) Deep South Challenge project using NeSI supercomputers for climate modelling. Yoshihiro Kaneko (Seismology) GNS Science using NeSI supercomputers to recreate earthquake events to better understand their processes and aftermath effects. Dr Kim Handley (Biological Sciences) Genomics Aotearoa project using NeSI supercomputers to better understand environmental processes on a microbial level Dr Sarah Masters, Dr Deborah Crittenden, Nathaniel Gunby (Chemistry) Using NeSI supercomputers to develop new analysis tools for studying molecules’ properties. Dr Richie Poulton (Psychology) Using NeSI Data Transfer platform to send MRI scan images from Dunedin Multidisciplinary Health & Development Study Research Unit to a partner laboratory in the United States for analysis. NeSI is a national collaboration of:
  • 4. New Zealand eScience Infrastructure New Zealand eScience Infrastructure NeSI Services High performance computing (HPC) and data analytics • Fit-for-purpose national HPC platform (CPUs, GPUs) including specialised software and tools for data analytics • Flexible environments for research DevOps / research software engineers • Hybrid eResearch platforms for institutions and communities Data services • High speed, secure data transfer with end-to-end integration • Hosting of large actively used research datasets, repositories, and archives Training and researcher skill development • In-person and online training to grow capabilities in NZ research sector • Partnership with The Carpentries (global programme to teach foundational coding and data science skills to researchers) Consultancy • Embed Research Software Engineers into research teams to lift their computational capabilities, optimise tools & workflows, develop code and more
  • 5. New Zealand eScience Infrastructure Mahuika and Māui are housed inside a purpose-built High Performance Computing Facility in Wellington.
  • 6. New Zealand eScience Infrastructure New Zealand eScience Infrastructure ● Pharmacology group at the University of Auckland ○ Improving understanding of medicines in humans and improving dosing ○ Want to manage their research from familiar Windows environment ○ Parts of their workflow require significant resources ■ Non-linear mixed effects modelling (NONMEM) ■ Fortran MPI code ■ Bootstrapping Use case
  • 7. New Zealand eScience Infrastructure New Zealand eScience Infrastructure ● Implemented in Python using Globus SDKs (transfer and compute) ● Provides an API that Wings for NONMEM (wfn) interfaces with (compatible with previous version): ○ Upload files and submit Slurm jobs ○ Wait for Slurm jobs to complete and download files ● Globus HTTPS transfer to Guest collection ● Globus compute running on login node (user managed) ● Globus authentication - token based https://github.com/chrisdjscott/RemoteJobManager Remote Job Manager tool
  • 8. New Zealand eScience Infrastructure New Zealand eScience Infrastructure ● Each researcher has to manage their own endpoint ○ Start it, restart it if it gets into a bad state (e.g. caused by instabilities on our platform), login node gets rebooted, etc. ○ Each user has a scron job that checks state of the endpoint every hour and restarts it if necessary ○ Lot of extra complexity and places for things to go wrong Notes about current use of Globus Compute
  • 9. New Zealand eScience Infrastructure New Zealand eScience Infrastructure ● Using LocalProvider instead of SlurmProvider ○ Resources vary between NONMEM simulations (time, cores, memory) but endpoint resources are hardcoded when you create the endpoint ○ WFN writes a template Slurm script that gets uploaded and submitted ○ Poll for Slurm job completion ○ Extra code/complexity that we wouldn’t need if could use the SlurmProvider Notes about current use of Globus Compute
  • 10. New Zealand eScience Infrastructure New Zealand eScience Infrastructure ● Early stages of testing the multi-user endpoint ● Two changes that make things much better for us ○ Centrally managed - we run the endpoint as a service and give the researchers the endpoint id ○ Endpoint resources are no longer hard-coded ■ Can use templating in the endpoint config to allow researchers to pass in certain parameters when they call the function Multi-user endpoint
  • 11. New Zealand eScience Infrastructure New Zealand eScience Infrastructure ● Multi-user endpoint looks like it will work really well for this group ○ Centrally managed service (users don’t have to run anything themselves) ○ Can switch to SlurmProvider, will simplify things ● Next steps are to continue testing with the wider team and update RJM to use multi-user endpoint so the researchers can benefit from it Summary