This document describes a method for single-photon 3D imaging using deep sensor fusion. Single-photon avalanche diodes (SPADs) are used to capture sparse photon detections along with a conventional intensity image. A convolutional neural network fuses the SPAD measurements and intensity image to estimate depth maps in a photon-efficient manner. The method achieves improved depth estimation compared to prior single-photon techniques by leveraging the additional intensity image context. It offers a tradeoff of increased acquisition speed and resolution compared to pulsed time-of-flight systems at the cost of reduced maximum range. The technique is demonstrated through simulations and a proof-of-concept prototype using a single vertical line of SPAD pixels.