This document discusses bringing high-performance computing (HPC) algorithms to big data platforms. It describes using Spark, an integrated big data platform, for experimental facilities like synchrotrons. A key application discussed is ptychography image reconstruction, which can involve large datasets. The document proposes a Spark-MPI approach to leverage both Spark and MPI for distributed computing. It provides examples of benchmarking a ptychography algorithm on Spark versus MPI and discusses a path towards Spark-MPI applications.