Global deduplication for Ceph storage clusters can save up to 40% of total storage space by eliminating redundant data blocks. The document discusses two designs for implementing global deduplication in Ceph without adding a centralized metadata server: 1) using a double distribution hash to map data chunks to objects without redirection, and 2) storing deduplication metadata within self-contained objects to avoid complex linking between systems. It also describes the implementation of an extensible tier in Ceph for deduplication using object manifests and ongoing work to contribute these features to the upstream Ceph project. Remaining tasks are listed along with challenges around small chunk sizes and minimizing performance impacts.