Understanding Convolution for Semantic Segmentation

Wang, Panqu; Chen, Pengfei; Yuan, Ye; Liu, Ding; Huang, Zehua; Hou, Xiaodi; Cottrell, Garrison

Computer Science > Computer Vision and Pattern Recognition

arXiv:1702.08502 (cs)

[Submitted on 27 Feb 2017 (v1), last revised 1 Jun 2018 (this version, v3)]

Title:Understanding Convolution for Semantic Segmentation

Authors:Panqu Wang, Pengfei Chen, Ye Yuan, Ding Liu, Zehua Huang, Xiaodi Hou, Garrison Cottrell

View PDF

Abstract:Recent advances in deep learning, especially deep convolutional neural networks (CNNs), have led to significant improvement over previous semantic segmentation systems. Here we show how to improve pixel-wise semantic segmentation by manipulating convolution-related operations that are of both theoretical and practical value. First, we design dense upsampling convolution (DUC) to generate pixel-level prediction, which is able to capture and decode more detailed information that is generally missing in bilinear upsampling. Second, we propose a hybrid dilated convolution (HDC) framework in the encoding phase. This framework 1) effectively enlarges the receptive fields (RF) of the network to aggregate global information; 2) alleviates what we call the "gridding issue" caused by the standard dilated convolution operation. We evaluate our approaches thoroughly on the Cityscapes dataset, and achieve a state-of-art result of 80.1% mIOU in the test set at the time of submission. We also have achieved state-of-the-art overall on the KITTI road estimation benchmark and the PASCAL VOC2012 segmentation task. Our source code can be found at this https URL .

Comments:	WACV 2018. Updated acknowledgements. Source code: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1702.08502 [cs.CV]
	(or arXiv:1702.08502v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1702.08502

Submission history

From: Panqu Wang [view email]
[v1] Mon, 27 Feb 2017 20:05:11 UTC (5,533 KB)
[v2] Thu, 9 Nov 2017 01:12:21 UTC (6,529 KB)
[v3] Fri, 1 Jun 2018 01:15:23 UTC (6,529 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding Convolution for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding Convolution for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators