多尺度局部特征融合的弱监督点云语义分割
DOI:
CSTR:
作者:
作者单位:

1.中北大学计算机科学与技术学院 太原 030051; 2.机器视觉与虚拟现实山西省重点实验室 太原 030051; 3.山西省视觉信息处理及智能机器人工程研究中心 太原 030051

作者简介:

通讯作者:

中图分类号:

TP391;TN958.98

基金项目:

国家自然科学基金(62272426)、山西省重点研发计划项目(202402020101001)、山西省自然科学基金(202303021211153,202403021212166)项目资助


Weakly supervised point cloud semantic segmentation via multi-scale local feature fusion
Author:
Affiliation:

1.School of Computer Science and Technology, North University of China, Taiyuan 030051, China; 2.Shanxi Key Laboratory of Machine Vision & Virtual Reality, Taiyuan 030051, China; 3.Shanxi Vision Information Processing and Intelligent Robot Engineering Research Center, Taiyuan 030051, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对现有基于弱监督的语义分割模型无法同时顾及点云局部细节特征的高相关性和泛化性,以及特征利用不充分的问题,以RAC-Net为基线模型,提出了多尺度局部特征融合的弱监督点云语义分割模型WS-MLF。首先,原始点云数据作为输入,提出了多尺度球域采样的方法MSSM提取多层的不同半径特征;其次,设计了特征增强模块MFA,充分利用点邻域内的几何特征;再次,提出了注意力模块SCH-Att以增强关键通道和关键点的特征捕捉;最后,运用解码器进行上采样,生成每个点不同的语义标签,完成语义分割任务。该模型在大规模室内场景数据集S3DIS和ScanNet-v2上进行了实验验证,结果表明在S3DIS数据集上,标签比率为0.02%和0.06%时,mIoU分别较RAC-Net提升了2.71%和0.54%,在ScanNet-v2数据集上,标签比率为20 pt时,mIoU较RAC-Net提升了1.55%。实验结果验证了该模型在弱监督场景中对点云关键特征的良好提取能力,提升了基于弱监督的点云语义分割精度。

    Abstract:

    To address the limitations of existing weakly supervised semantic segmentation models for point clouds,which struggle to balance local feature correlation, generalization, and feature utilization. This paper proposes WS-MLF, a weakly supervised point cloud semantic segmentation model via multi-scale local feature fusion, based on the RAC-Net baseline. Firstly, the raw point cloud data is taken as input, and a multi-scale spherical sampling methods (MSSM) is employed to capture hierarchical features across varying spatial radii. Secondly, a multi-local feature aggregation enhancement module (MFA) is designed to refine geometric context within neighborhoods. Thirdly, a spatial-channel-fused hybrid attention module (SCH-Att) is proposed to prioritize discriminative channels and key points. Finally, a decoder is utilized for upsampling to generate point-level semantic labels, thereby completing the semantic segmentation task. The proposed model is evaluated on large-scale indoor scene datasets, S3DIS and ScanNet-v2. Experimental results demonstrate that on the S3DIS dataset, when the label ratios are 0.02% and 0.06%, the mIoU surpasses RAC-Net by 2.71% and 0.54%, respectively. On the ScanNet-v2 dataset, with a label ratio of 20 pt, the mIoU increases by 1.55% compared with RAC-Net. These results validate WS-MLF′s effectiveness in extracting key features under weak supervision, enhancing segmentation accuracy.

    参考文献
    相似文献
    引证文献
引用本文

武佳颖,杨晓文,韩燮,韩慧妍,张元,赵融.多尺度局部特征融合的弱监督点云语义分割[J].电子测量技术,2026,49(6):167-176

复制
分享
相关视频

文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-05-13
  • 出版日期:
文章二维码

重要通知公告

①《电子测量技术》期刊收款账户变更公告