DLAU: A scalable deep learning accelerator unit on FPGA

C Wang, L Gong, Q Yu, X Li, Y Xie… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
C Wang, L Gong, Q Yu, X Li, Y Xie, X Zhou
IEEE Transactions on Computer-Aided Design of Integrated Circuits …, 2016ieeexplore.ieee.org
As the emerging field of machine learning, deep learning shows excellent ability in solving
complex learning problems. However, the size of the networks becomes increasingly large
scale due to the demands of the practical applications, which poses significant challenge to
construct a high performance implementations of deep learning neural networks. In order to
improve the performance as well as to maintain the low power cost, in this paper we design
deep learning accelerator unit (DLAU), which is a scalable accelerator architecture for large …
As the emerging field of machine learning, deep learning shows excellent ability in solving complex learning problems. However, the size of the networks becomes increasingly large scale due to the demands of the practical applications, which poses significant challenge to construct a high performance implementations of deep learning neural networks. In order to improve the performance as well as to maintain the low power cost, in this paper we design deep learning accelerator unit (DLAU), which is a scalable accelerator architecture for large-scale deep learning networks using field-programmable gate array (FPGA) as the hardware prototype. The DLAU accelerator employs three pipelined processing units to improve the throughput and utilizes tile techniques to explore locality for deep learning applications. Experimental results on the state-of-the-art Xilinx FPGA board demonstrate that the DLAU accelerator is able to achieve up to 36.1× speedup comparing to the Intel Core2 processors, with the power consumption at 234 mW.
ieeexplore.ieee.org
Showing the best result for this search. See all results