This document summarizes UC Berkeley's AMPLab and its work on big data analytics. It discusses how AMPLab is developing machine learning algorithms and tools like Spark, Shark, and MLlib to handle massive and diverse datasets in real-time. These tools use techniques like distributed computing on clusters and human computation through crowdsourcing. AMPLab's goal is to help researchers and organizations gain better insights from large amounts of data through scalable analytics that balance cost, time, and answer quality.