This document discusses using a heterogeneous CPU-GPU approach for particle-based simulations with particles of varying sizes. Large particles are handled by the CPU due to irregular workloads, while small particles are handled by the GPU due to uniform workloads. Optimizations include multi-threading large-small collisions on the CPU, spatially sorting small particles on the GPU to improve cache utilization, and load balancing work between the CPU and GPU. The approach leverages the strengths of the CPU and GPU on AMD's fusion architecture to efficiently simulate mixed particle systems.