Graspgen: A diffusion-based framework for 6-dof grasping with on-generator training

A Murali, B Sundaralingam, YW Chao, W Yuan… - arXiv preprint arXiv …, 2025 - arxiv.org
arXiv preprint arXiv:2507.13097, 2025arxiv.org
Grasping is a fundamental robot skill, yet despite significant research advancements,
learning-based 6-DOF grasping approaches are still not turnkey and struggle to generalize
across different embodiments and in-the-wild settings. We build upon the recent success on
modeling the object-centric grasp generation process as an iterative diffusion process. Our
proposed framework, GraspGen, consists of a DiffusionTransformer architecture that
enhances grasp generation, paired with an efficient discriminator to score and filter sampled …
Grasping is a fundamental robot skill, yet despite significant research advancements, learning-based 6-DOF grasping approaches are still not turnkey and struggle to generalize across different embodiments and in-the-wild settings. We build upon the recent success on modeling the object-centric grasp generation process as an iterative diffusion process. Our proposed framework, GraspGen, consists of a DiffusionTransformer architecture that enhances grasp generation, paired with an efficient discriminator to score and filter sampled grasps. We introduce a novel and performant on-generator training recipe for the discriminator. To scale GraspGen to both objects and grippers, we release a new simulated dataset consisting of over 53 million grasps. We demonstrate that GraspGen outperforms prior methods in simulations with singulated objects across different grippers, achieves state-of-the-art performance on the FetchBench grasping benchmark, and performs well on a real robot with noisy visual observations.
arxiv.org
Showing the best result for this search. See all results