This document discusses matrix storage and data serialization techniques for scientific computing with Hadoop and Dumbo. It provides examples of storing matrices in HDFS using different approaches like storing each row separately, storing two rows per record, or flattening the matrix into a single list. It also discusses optimizing data serialization and switching programming languages. The document then presents an example of outputting many small matrices to disk and compares two MapReduce implementations for computing the Cholesky QR decomposition, identifying which approach is usually better and why.