This document discusses techniques for feature extraction in big data using distance covariance based principal component analysis (PCA). It provides background on big data and dimensionality reduction. It then explains distance covariance and how it can be used to calculate principal components for feature extraction in big data, which can help reduce computation time compared to traditional PCA. Some modifications of distance-PCA are proposed to eliminate the need for normalization of the data. Potential drawbacks and areas for future work are also outlined.
Related topics: