This document discusses methods for building models on imbalanced datasets without resampling the data. It presents an example dataset that is highly imbalanced between active and inactive compounds. Two approaches are described for adjusting the decision threshold of models to account for the imbalance: 1) selecting the threshold that maximizes the kappa on out-of-bag predictions, and 2) selecting the threshold at the point on the ROC curve closest to the upper left corner. Validation experiments on several datasets show that both approaches improve evaluation metrics like kappa compared to using a threshold of 0.5. Balanced random forests, which resample during training, are also evaluated and often perform similarly to threshold adjustment.