SlideShare a Scribd company logo
Conf-DDDD-IN
The State of HDF
Summer ESIP 2023
This work was supported by NASA/GSFC under Raytheon Technologies contract number 80GSFC21CA001.
This document does not contain technology or Technical Data controlled under either the U.S. International Traffic
in Arms Regulations or the U.S. Export Administration Regulations.
Dana Robinson
Director of Software Engineering
NASA EED-3 / The HDF Group
derobins@hdfgroup.org
Conf-DDDD-IN
2
• About us
• Software status
• Current focus
• How we can help each other
Outline
Conf-DDDD-IN
3
About us
Conf-DDDD-IN
4
• Located in Champaign, IL
• Spun off from NCSA in 2006
• Non-profit 501(c)(3)
• ~25 employees
The HDF Group
Conf-DDDD-IN
5
• Mission-driven
– Sustainable development of HDF
technologies
– Guarantee continual accessibility of HDF
data
• Services
– Maintain and develop HDF products
– Consulting and support contracts
– Training
The HDF Group
Conf-DDDD-IN
6
• HDF5 1.0.0 was released in 1998
• What will the next 25 years bring?
https://forum.hdfgroup.org/t/what-do-you-want-to-see-in-hdf5-2-0/10003
25 years of HDF5!
🎂 🎉
Conf-DDDD-IN
7
Software Status
Conf-DDDD-IN
8
• HDF5
• HDF4
• HDFView
Release schedules
Conf-DDDD-IN
9
• Latest: 1.14.1(May 2023)
• 1.8 branch retired early this year
• 1.10 and 1.12 retiring this year
• Have not scheduled 2024 yet
HDF5 Schedule
Conf-DDDD-IN
10
• 1.14.0
– Multi-dataset I/O
– Selection I/O
– Subfiling
– Onion VFD
• 1.14.1
– Maintenance release
– bugfixes, minor features
HDF5 New Features
https://www.hdfgroup.org/2023/05/release-of-hdf5-1-14-1-newsletter-194/
Conf-DDDD-IN
11
• 1.14.2
– Read-only S3 VFD improvements
• Better logging
• Temporary security credential support
– CVE-free
HDF5 New Features
Conf-DDDD-IN
12
• Latest: 4.2.16-2 (June 2023)
– Patch release (fixes a Java issue w/ HDFView)
– Bugfixes
– Build system improvements
HDF4 Schedule
https://www.hdfgroup.org/2023/07/release-of-hdf-4-2-16-2-a-patch-release-newsletter-195/
Conf-DDDD-IN
13
HDF4 Changes
https://github.com/HDFGroup/hdf4/discussions
Conf-DDDD-IN
14
As always, we:
• Strive to maintain API compatibility
– HDF5's compatibility macro scheme, etc.
• Are committed to file format forward and
backward compatibility
– Should always be able to create files in
earlier formats
– Should always be able to read earlier file
formats
HDF Product Change Policy
Conf-DDDD-IN
15
Before implementing a breaking change, we
will:
• Create a discussion post on GitHub
• Create a post in the HDF Forum
• Make an announcement in the newsletter
• Attempt to contact people who we know
might be affected
• Allow at least 30 days for comments
HDF Product Change Policy
https://forum.hdfgroup.org/t/hdf4-change-procedure/11240
Conf-DDDD-IN
16
• Latest: HDFView 3.3.0 (April 2023)
• Based on:
– HDF5 1.14.1
– HDF4 4.2.16 (4.2.16-2 coming soon)
• Releases based on older versions of
HDF5 (e.g., 1.10) have been retired
• Looking to modernize or replace this!
HDFView
https://www.hdfgroup.org/2023/04/release-of-hdfview-3-3-0-newsletter-193/
Conf-DDDD-IN
17
Current Focus
Conf-DDDD-IN
18
• Improve software quality
• Increase transparency
• Strengthen our community
• Modernize HDF Products
What are we working on?
Conf-DDDD-IN
19
HDF Product Development
HDF5
HDF4
Conf-DDDD-IN
20
HDF Product Development
HDFView
HSDS
Conf-DDDD-IN
21
• All CVE issues fixed
– https://github.com/HDFGroup/cve_hdf5
– 3 currently unfixed
• Testing improvements
– Integration testing w/ key products (h5py, etc.)
– More transparency w/ CDash
• Resolve unfixed GitHub issues
• Code cleanup
Software quality
Conf-DDDD-IN
22
HDF5, although open source, grew up in a
walled garden and this mindset persists
I'm working to fix this!
The goal is to do all product-specific (e.g.,
HDF5) work with the community
Transparency
Conf-DDDD-IN
23
• HDF5 Working Group meeting
– Every Thursday at 10:05 central time
– Email me (derobins@hdfgroup.org) for an
invite
– Covers pull requests, issues, tech discussion
• Most project planning moving to GitHub
• More testing via GitHub actions
– Will use public CDash for non-GitHub tests
Transparency
Conf-DDDD-IN
24
• Aforementioned HDF5 WG meeting
• Everything in the transparency slides,
really
• Adding external people as code owners
• Spending more time connecting with our
users, both new and old
Community
Conf-DDDD-IN
25
• Some of our key software has been
around for a long time!
– HDF(4): 35 years (1988)
– HDF5: 25 years (1998)
– HDFView: 16 years (2007)
• Although we strive to keep our software
up to date, the code is still old and could
use an overhaul
Modernization
Conf-DDDD-IN
26
• Better support for new compilers
– Especially Intel's oneAPI
• Better support for AI/ML workflows
• Better cloud integration
• Better support for heterogeneous
computing
• Windows Unicode support
Modernization
Conf-DDDD-IN
27
• Better variable-length support
• Multithreading
• Support for sparse data
• Improve performance
• Internal cleanup
– Easier to modify & debug
– Refactor code that made sense in 1998 but
not so much now
Modernization
Conf-DDDD-IN
28
How can we help each other?
Conf-DDDD-IN
29
Support our non-profit mission
Contact: info@hdfgroup.org
https://www.hdfgroup.org/donate
Conf-DDDD-IN
30
• US HUG
– August 16-18, 2023
– The Ohio State University - Columbus, OH
– https://www.hdfgroup.org/hug/hug23/
• European HUG (focus on
compression)
– September 19-21, 2023
– DESY - Hamburg, Germany
– https://indico.desy.de/event/39343/
HDF User Group Meetings
Conf-DDDD-IN
31
Thanks for your time!
Conf-DDDD-IN
32
This work was supported by NASA/GSFC under
Raytheon Technologies contract number
80GSFC21CA001.

More Related Content

Similar to The State of HDF (20)

PPTX
Hierarchical Data Formats (HDF) Update
The HDF-EOS Tools and Information Center
 
PPTX
HDF5 Roadmap 2019-2020
The HDF-EOS Tools and Information Center
 
PPT
HDF Status and Development
The HDF-EOS Tools and Information Center
 
PPT
HDF Software Process - Lessons Learned & Success Factors
The HDF-EOS Tools and Information Center
 
PPTX
HDF5 and Ecosystem: What Is New?
The HDF-EOS Tools and Information Center
 
PPTX
HDF5 and The HDF Group
The HDF-EOS Tools and Information Center
 
PPTX
HDF Tools Updates and Discussions
The HDF-EOS Tools and Information Center
 
PPT
HDF OPeNDAP project update and demo
The HDF-EOS Tools and Information Center
 
PPTX
HDF - Current status and Future Directions
The HDF-EOS Tools and Information Center
 
PDF
Transition from HDF4 to HDF5
The HDF-EOS Tools and Information Center
 
Ad

More from The HDF-EOS Tools and Information Center (20)

PDF
HDF5 2.0: Cloud Optimized from the Start
The HDF-EOS Tools and Information Center
 
PDF
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
The HDF-EOS Tools and Information Center
 
PDF
Cloud-Optimized HDF5 Files - Current Status
The HDF-EOS Tools and Information Center
 
PDF
Cloud Optimized HDF5 for the ICESat-2 mission
The HDF-EOS Tools and Information Center
 
PPTX
Access HDF Data in the Cloud via OPeNDAP Web Service
The HDF-EOS Tools and Information Center
 
PPTX
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
The HDF-EOS Tools and Information Center
 
PDF
Cloud-Optimized HDF5 Files
The HDF-EOS Tools and Information Center
 
PDF
Accessing HDF5 data in the cloud with HSDS
The HDF-EOS Tools and Information Center
 
PPTX
Highly Scalable Data Service (HSDS) Performance Features
The HDF-EOS Tools and Information Center
 
PDF
Creating Cloud-Optimized HDF5 Files
The HDF-EOS Tools and Information Center
 
PPTX
HDF5 OPeNDAP Handler Updates, and Performance Discussion
The HDF-EOS Tools and Information Center
 
PPTX
Hyrax: Serving Data from S3
The HDF-EOS Tools and Information Center
 
PPSX
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
The HDF-EOS Tools and Information Center
 
PDF
HDF - Current status and Future Directions
The HDF-EOS Tools and Information Center
 
PPSX
HDFEOS.org User Analsys, Updates, and Future
The HDF-EOS Tools and Information Center
 
PDF
H5Coro: The Cloud-Optimized Read-Only Library
The HDF-EOS Tools and Information Center
 
PPTX
MATLAB Modernization on HDF5 1.10
The HDF-EOS Tools and Information Center
 
PPTX
HDF for the Cloud - Serverless HDF
The HDF-EOS Tools and Information Center
 
PPTX
HDF for the Cloud - New HDF Server Features
The HDF-EOS Tools and Information Center
 
HDF5 2.0: Cloud Optimized from the Start
The HDF-EOS Tools and Information Center
 
Using a Hierarchical Data Format v5 file as Zarr v3 Shard
The HDF-EOS Tools and Information Center
 
Cloud-Optimized HDF5 Files - Current Status
The HDF-EOS Tools and Information Center
 
Cloud Optimized HDF5 for the ICESat-2 mission
The HDF-EOS Tools and Information Center
 
Access HDF Data in the Cloud via OPeNDAP Web Service
The HDF-EOS Tools and Information Center
 
Upcoming New HDF5 Features: Multi-threading, sparse data storage, and encrypt...
The HDF-EOS Tools and Information Center
 
Cloud-Optimized HDF5 Files
The HDF-EOS Tools and Information Center
 
Accessing HDF5 data in the cloud with HSDS
The HDF-EOS Tools and Information Center
 
Highly Scalable Data Service (HSDS) Performance Features
The HDF-EOS Tools and Information Center
 
Creating Cloud-Optimized HDF5 Files
The HDF-EOS Tools and Information Center
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
The HDF-EOS Tools and Information Center
 
Hyrax: Serving Data from S3
The HDF-EOS Tools and Information Center
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
The HDF-EOS Tools and Information Center
 
HDF - Current status and Future Directions
The HDF-EOS Tools and Information Center
 
HDFEOS.org User Analsys, Updates, and Future
The HDF-EOS Tools and Information Center
 
H5Coro: The Cloud-Optimized Read-Only Library
The HDF-EOS Tools and Information Center
 
MATLAB Modernization on HDF5 1.10
The HDF-EOS Tools and Information Center
 
HDF for the Cloud - Serverless HDF
The HDF-EOS Tools and Information Center
 
HDF for the Cloud - New HDF Server Features
The HDF-EOS Tools and Information Center
 
Ad

Recently uploaded (20)

PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PDF
Brief History of Internet - Early Days of Internet
sutharharshit158
 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PPTX
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
The Future of Mobile Is Context-Aware—Are You Ready?
iProgrammer Solutions Private Limited
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Safe Software
 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Agile Chennai 18-19 July 2025 | Emerging patterns in Agentic AI by Bharani Su...
AgileNetwork
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
Brief History of Internet - Early Days of Internet
sutharharshit158
 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
 
The Future of Artificial Intelligence (AI)
Mukul
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
AI and Robotics for Human Well-being.pptx
JAYMIN SUTHAR
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
A Strategic Analysis of the MVNO Wave in Emerging Markets.pdf
IPLOOK Networks
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 

The State of HDF

  • 1. Conf-DDDD-IN The State of HDF Summer ESIP 2023 This work was supported by NASA/GSFC under Raytheon Technologies contract number 80GSFC21CA001. This document does not contain technology or Technical Data controlled under either the U.S. International Traffic in Arms Regulations or the U.S. Export Administration Regulations. Dana Robinson Director of Software Engineering NASA EED-3 / The HDF Group [email protected]
  • 2. Conf-DDDD-IN 2 • About us • Software status • Current focus • How we can help each other Outline
  • 4. Conf-DDDD-IN 4 • Located in Champaign, IL • Spun off from NCSA in 2006 • Non-profit 501(c)(3) • ~25 employees The HDF Group
  • 5. Conf-DDDD-IN 5 • Mission-driven – Sustainable development of HDF technologies – Guarantee continual accessibility of HDF data • Services – Maintain and develop HDF products – Consulting and support contracts – Training The HDF Group
  • 6. Conf-DDDD-IN 6 • HDF5 1.0.0 was released in 1998 • What will the next 25 years bring? https://forum.hdfgroup.org/t/what-do-you-want-to-see-in-hdf5-2-0/10003 25 years of HDF5! 🎂 🎉
  • 8. Conf-DDDD-IN 8 • HDF5 • HDF4 • HDFView Release schedules
  • 9. Conf-DDDD-IN 9 • Latest: 1.14.1(May 2023) • 1.8 branch retired early this year • 1.10 and 1.12 retiring this year • Have not scheduled 2024 yet HDF5 Schedule
  • 10. Conf-DDDD-IN 10 • 1.14.0 – Multi-dataset I/O – Selection I/O – Subfiling – Onion VFD • 1.14.1 – Maintenance release – bugfixes, minor features HDF5 New Features https://www.hdfgroup.org/2023/05/release-of-hdf5-1-14-1-newsletter-194/
  • 11. Conf-DDDD-IN 11 • 1.14.2 – Read-only S3 VFD improvements • Better logging • Temporary security credential support – CVE-free HDF5 New Features
  • 12. Conf-DDDD-IN 12 • Latest: 4.2.16-2 (June 2023) – Patch release (fixes a Java issue w/ HDFView) – Bugfixes – Build system improvements HDF4 Schedule https://www.hdfgroup.org/2023/07/release-of-hdf-4-2-16-2-a-patch-release-newsletter-195/
  • 14. Conf-DDDD-IN 14 As always, we: • Strive to maintain API compatibility – HDF5's compatibility macro scheme, etc. • Are committed to file format forward and backward compatibility – Should always be able to create files in earlier formats – Should always be able to read earlier file formats HDF Product Change Policy
  • 15. Conf-DDDD-IN 15 Before implementing a breaking change, we will: • Create a discussion post on GitHub • Create a post in the HDF Forum • Make an announcement in the newsletter • Attempt to contact people who we know might be affected • Allow at least 30 days for comments HDF Product Change Policy https://forum.hdfgroup.org/t/hdf4-change-procedure/11240
  • 16. Conf-DDDD-IN 16 • Latest: HDFView 3.3.0 (April 2023) • Based on: – HDF5 1.14.1 – HDF4 4.2.16 (4.2.16-2 coming soon) • Releases based on older versions of HDF5 (e.g., 1.10) have been retired • Looking to modernize or replace this! HDFView https://www.hdfgroup.org/2023/04/release-of-hdfview-3-3-0-newsletter-193/
  • 18. Conf-DDDD-IN 18 • Improve software quality • Increase transparency • Strengthen our community • Modernize HDF Products What are we working on?
  • 21. Conf-DDDD-IN 21 • All CVE issues fixed – https://github.com/HDFGroup/cve_hdf5 – 3 currently unfixed • Testing improvements – Integration testing w/ key products (h5py, etc.) – More transparency w/ CDash • Resolve unfixed GitHub issues • Code cleanup Software quality
  • 22. Conf-DDDD-IN 22 HDF5, although open source, grew up in a walled garden and this mindset persists I'm working to fix this! The goal is to do all product-specific (e.g., HDF5) work with the community Transparency
  • 23. Conf-DDDD-IN 23 • HDF5 Working Group meeting – Every Thursday at 10:05 central time – Email me ([email protected]) for an invite – Covers pull requests, issues, tech discussion • Most project planning moving to GitHub • More testing via GitHub actions – Will use public CDash for non-GitHub tests Transparency
  • 24. Conf-DDDD-IN 24 • Aforementioned HDF5 WG meeting • Everything in the transparency slides, really • Adding external people as code owners • Spending more time connecting with our users, both new and old Community
  • 25. Conf-DDDD-IN 25 • Some of our key software has been around for a long time! – HDF(4): 35 years (1988) – HDF5: 25 years (1998) – HDFView: 16 years (2007) • Although we strive to keep our software up to date, the code is still old and could use an overhaul Modernization
  • 26. Conf-DDDD-IN 26 • Better support for new compilers – Especially Intel's oneAPI • Better support for AI/ML workflows • Better cloud integration • Better support for heterogeneous computing • Windows Unicode support Modernization
  • 27. Conf-DDDD-IN 27 • Better variable-length support • Multithreading • Support for sparse data • Improve performance • Internal cleanup – Easier to modify & debug – Refactor code that made sense in 1998 but not so much now Modernization
  • 28. Conf-DDDD-IN 28 How can we help each other?
  • 29. Conf-DDDD-IN 29 Support our non-profit mission Contact: [email protected] https://www.hdfgroup.org/donate
  • 30. Conf-DDDD-IN 30 • US HUG – August 16-18, 2023 – The Ohio State University - Columbus, OH – https://www.hdfgroup.org/hug/hug23/ • European HUG (focus on compression) – September 19-21, 2023 – DESY - Hamburg, Germany – https://indico.desy.de/event/39343/ HDF User Group Meetings
  • 32. Conf-DDDD-IN 32 This work was supported by NASA/GSFC under Raytheon Technologies contract number 80GSFC21CA001.