This document presents a new deep learning model for jointly performing image classification and annotation. The model uses a convolutional neural network (CNN) to extract features from images and classify semantic objects. It then annotates the images based on the identified objects. The model is evaluated on standard datasets like CIFAR-10, CIFAR-100 as well as a new dataset collected by the authors. Results show the model achieves comparable or better performance than baseline methods, while also enabling fast image annotation. A novel scalable implementation allows annotating large datasets within seconds.