This document discusses using Apache Spark to perform privacy preservation on big data through faster data anonymization techniques. It proposes implementing common anonymization algorithms like k-anonymity, l-diversity, and t-closeness using Spark's in-memory processing capabilities to overcome limitations of previous Hadoop-based approaches. The key advantages of Spark are its faster processing speed due to avoiding disk I/O and ability to support streaming data and real-time processing. The implementation section describes how k-anonymity, l-diversity, and t-closeness algorithms can be applied to anonymize medical datasets in Spark to preserve patient privacy while minimizing information loss.