This paper presents an approach to train additive classifiers as efficiently as linear classifiers while maintaining accuracy close to non-linear kernels. Additive kernels can be approximated through embeddings that encode feature dimensions independently, allowing the use of fast linear SVM solvers. Experiments on pedestrian and object detection datasets show the approach trains classifiers up to 300x faster than kernel SVMs while achieving similar accuracy.