This document discusses common anti-patterns when using Spark with Cassandra. It begins by introducing the authors and their experience. The main section describes several common issues like out of memory errors, RPC failures, and slow performance. It then discusses the most common performance pitfall of collecting and re-parallelizing data. Alternative approaches are provided. Other topics covered include predicate pushdowns, serialization, and understanding how Catalyst optimizes queries.