R is a programming language for statistical analysis and data science. While powerful, R has limitations when dealing with large datasets due to being single-threaded and RAM-limited. Hadoop is a framework for distributed processing of large datasets across clusters of machines using MapReduce. R can be used on Hadoop through approaches like streaming, RHadoop, and Spark. Streaming requires writing mapper and reducer scripts in R. RHadoop provides R APIs for Hadoop. The demo showed linear regression in R and on Hadoop for predictive analysis. Lessons were to do maximum preprocessing before R and consider performance for production versus exploratory analysis.