1) The Group Steiner Problem involves finding a minimum-cost tree connecting at least one node from each of k disjoint groups in a graph. It has applications in VLSI circuit routing.
2) The authors propose a CUDA-aware MPI-based approach to solve large instances of the Group Steiner Problem using a heuristic that constructs 2-star trees in parallel.
3) Experimental results on a supercomputer show the parallel implementation achieves up to 302x speedup over sequential algorithms and finds accurate solutions to industry benchmark instances.