The document discusses using open-source technologies to build a big data processing platform on commodity machines. It outlines the challenges of big data including the volume, velocity and variety of data being created. It then describes the Hadoop ecosystem as a solution, including its use of MapReduce and various Apache projects for tasks like storage, transfer, search, messaging, logging, stream processing and machine learning.