This paper addresses the challenge of topic extraction from large document collections using a correlated topic model (CTM) implemented in a MapReduce framework. It highlights the scalability issues faced by conventional topic modeling techniques, presenting a solution that employs a variational expectation-maximization algorithm to analyze crawled full-text documents. The proposed methodology includes data gathering, pre-processing, and topic extraction, demonstrating comparable performance with traditional LDA methods while enhancing the overall accuracy of topic modeling.