复杂城市交通场景下的自动驾驶语义分割方法

首页 > 过刊浏览>2024年第38卷第4期 >241-247

复杂城市交通场景下的自动驾驶语义分割方法
DOI:
                        
                    
CSTR:
                        [cstr]
                    
作者:
                        
                        
                    
作者单位:昆明理工大学机电工程学院昆明650500
作者简介:
通讯作者:
中图分类号:TP391.41;U121
基金项目:

Automatic driving semantic segmentation method for complex urban traffic scene

Author:

Affiliation:

Faculty of Mechanical and Electrical Engineering, Kunming University of Science and Technology, Kunming 650500, China

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

多尺度特征金字塔可以缓解语义分割在复杂交通场景下通常存在漏分割、错误分割、边界分割不清晰等问题，但现有的多尺度特征金字塔在获取丰富的语义信息时，不得不下采样特征图，牺牲空间细节信息，而这导致了最终的分割结果仍然精度受限。针对该问题，本文提出了特征强化模块，使得有利于正确分类像素的相似特征在下采样过程之前得到基于不同矢量间余弦相似度的进一步加强，降低下采样带来的负面影响。另外，结合空洞卷积和条带卷积原理，本文对大卷积核进行了改造，并构建新的多尺度特征金字塔模块，以获取尺度不同且具备更大感受野的语义信息。该分割方法实时高效，能够满足自动驾驶语义分割要求，在VOC2012数据集上的实验表明，所提出的方法分割结果mIoU达到了74.36%，FPS达到了43，优于目前的主流语义分割方法。

Abstract:

The multi-scale feature pyramid can alleviate the problems of semantic segmentation in complex traffic scenes, such as missing segmentation, wrong segmentation and unclear boundary segmentation. However, the existing multi-scale feature pyramid has to downsample the feature maps and sacrifice the spatial detail information for rich semantic information, leading to the limited accuracy of the final segmentation result. Aiming at this problem, a feature enhancement module is proposed to further reinforce similar features based on cosine similarity between different vectors before downsampling, alleviating the negative influence of downsampling. In addition, combined with the principle of dilated convolution and strip convolution, the large convolution kernel is modified to build a new multi-scale feature pyramid module for semantic information with different scales and larger receptive fields. The proposed segmentation method is real-time and efficient, and can meet the requirements of automatic driving. Experiments on the VOC2012 dataset show that the mIoU of the proposed method reaches 74.36%, and the FPS reaches 43, which is superior than the current prevailing semantic segmentation methods.

参考文献

相似文献

引证文献

引用本文

周勇,刘泓滨,侯亚东.复杂城市交通场景下的自动驾驶语义分割方法[J].电子测量与仪器学报,2024,38(4):241-247

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2024-07-02
出版日期:

网站首页

杂志简介

投稿须知

在线阅读

欢迎订阅

招商合作

联系我们

English

引用本文

分享

文章指标

历史

文章二维码