Pyramidal Region Context Module for Semantic Segmentation

摘要：

　　Context modeling is widely exploited to enhance seman-tic correlation in semantic segmentation task.Recent ap-proaches(e.g.,OCNet,CCNet and DANet)apply non-local type of network to capture the context information.However,they are not accurate enough for handling scale-varying ob-jects due to that they consider very little local dependencies of the adjacent pixels.In this work,we address the complex scene segmentation problem by combining region depen-dencies and global contextual information.Motivated by the fact that scale of objects largely varies on images,we design the Pyramidal Region Context Module(PRCM)to handle the neighbor relationship of multi-scale regions.In addition,we adopt a depth-to-space layer(PixelShuffle)to form the Scale Transfer Classifier(STC).Based on the two newly proposed modules,we introduce an end-to-end segmentation network-Pyramidal Region Network(PRNet).We empirically demon-strate the effectiveness of our approach on Cityscapes dataset,the results have shown impressive improvement compared with baselines.Notably,PRNet obtains mean IoU of 81.3 on test set of Cityscapes.

关键词： Neural Networks Semantic Segmentation Self-Attention

作者: Tingting Liang Qijie Zhao Zhuoying Wang Kaiyu Shan Huan Zhang Yongtao Wang Zhi Tang

作者单位: Institute of Computer Science and Technology,Peking University Beijing,P.R.China

会议类型: 国际会议

会议名称: 2019国图灵大会(ACM Turing Celebration conference-China 2019 )

会议地点: 成都

会议语种:英文

页码: 237-242

在线出版日期: 2019-05-17（万方平台首次上网日期，不代表论文的发表时间）

会议专题

Pyramidal Region Context Module for Semantic Segmentation