This document discusses strategies for scaling APIs to handle increasing volumes of traffic. It outlines considerations and approaches for handling throughput from 1 transaction per second up to 100,000 transactions per second. Lower volumes can typically be handled through database and application tuning. Higher volumes require caching, database sharding or NoSQL databases, and scaling out application servers and other infrastructure. API and client design can also impact scaling by reducing unnecessary calls and payloads. Latency and network bandwidth also become important concerns at high volumes.