This document summarizes a survey on using data mining techniques for cancer prediction. It discusses several semi-supervised clustering approaches that have been used for tasks like image processing, multimedia, and bioinformatics. Specifically, it proposes a transitive closure based constraint propagation approach and a random subspace based semi-supervised clustering ensemble framework to address limitations in traditional constrained clustering approaches. It evaluates these proposed approaches on 20 cancer gene expression datasets and compares them to other state-of-the-art semi-supervised clustering ensemble algorithms on 10 additional datasets from repositories. The experimental results show that the proposed approaches outperform other methods on most datasets.