This document discusses how Lyft uses big data and machine learning pipelines at scale. It provides examples of how Lyft uses data to improve maps, calculate optimal pickup locations, detect inaccurate destinations, estimate routes and times, and forecast demand and supply. It describes Lyft's data ecosystem including 50PB of data in S3, 650k ETL pipeline runs processing 24M tasks monthly. It also summarizes the key differences between Airflow and Flyte orchestration tools, when each is best suited, and provides a live demo of Flyte.