This document provides an overview of Hadoop and distributed systems concepts. It discusses Hadoop's origins in addressing the need to store and process large volumes of data across clusters. The key components of Hadoop covered are HDFS for storage, MapReduce for processing, and YARN for resource management. The ecosystem of tools that work with Hadoop is also outlined. Examples of distributed system architectures like Cassandra are briefly described.