音频场景识别中非对称卷积和知识迁移方法研究

首页 > 过刊浏览>2021年第35卷第5期 >168-173

音频场景识别中非对称卷积和知识迁移方法研究
DOI:
                        
                    
CSTR:
                        
                    
作者:
                        刘炜杰刘炜杰
1. 天津大学 电气自动化与信息工程学院,2. 江苏中气环境科技有限公司
在期刊界中查找
在百度中查找
在本站中查找
梁晋华梁晋华
1. 天津大学 电气自动化与信息工程学院
在期刊界中查找
在百度中查找
在本站中查找
张 涛张 涛
1. 天津大学 电气自动化与信息工程学院
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:TN912. 3
基金项目:天津市研究生科研创新项目（2019YJSS146）资助

Investigation on asymmetric convolution and knowledge transfer in acoustic scene classification

Author:

Liu Weijie
Liu Weijie
1. School of Electrical and Information Engineering, Tianjin University,2. Jiangsu Zhongqi Environmental Technology Company
在期刊界中查找
在百度中查找
在本站中查找
Liang Jinhua
Liang Jinhua
1. School of Electrical and Information Engineering, Tianjin University
在期刊界中查找
在百度中查找
在本站中查找
Zhang Tao
Zhang Tao
1. School of Electrical and Information Engineering, Tianjin University
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献 [20]

引证文献

资源附件

文章评论

摘要:

针对当前音频场景识别中训练数据量不足的问题,设计了基于知识迁移的非对称卷积声音场景识别系统。相较于现有方法利用音频场景识别数据集从头训练网络模型,该系统在其他任务训练好的网络模型上进行调整和训练,从而保留了源领域的有效信息。与此同时,该系统针对声学特征的特点,采用了非对称卷积模块来增强网络的特征提取能力。实验结果为该系统的准确率相较基准系统提高了 0. 023,并且该系统的卷积核可视化结果观察到的特征纹理更清晰。结果表明知识迁移可以提升模型的特征表示能力,与非对称卷积结合能进一步提升系统性能。

关键词:音频场景识别;非对称卷积;知识迁移;卷积神经网络;模式识别

Abstract:

A novel acoustic scene classification (ASC) system based on asymmetric convolution and knowledge transfer is proposed to address the problem caused by limited ASC datasets. Compared with the existing methods which trained models from scratch, the proposed system fine-tunes a pretrained model of other tasks to preserve valid information from the source domain. Besides, targeted at the nature of acoustic features, it adopts asymmetric convolutions to enhance the network capability of feature extraction. Experiments shows that the proposed system outperforms the baseline system by 0. 023. Besides, as shown in the visualization results of convolutional filters, textures of the proposed system are more detailed than other methods. The experiment proves that knowledge transfer can boost model ability of feature representation, and it can further improve system performance by combining with asymmetric convolution.

Key words:acoustic scene classification; asymmetric convolution; knowledge transfer; convolutional neural network; pattern recognition

引用本文

刘炜杰,梁晋华,张涛.音频场景识别中非对称卷积和知识迁移方法研究[J].电子测量与仪器学报,2021,35(5):168-173

复制

文章指标

点击次数:1122
下载次数: 4
HTML阅读次数: 0
引用次数: 0

历史

收稿日期:
最后修改日期:
录用日期:
在线发布日期: 2023-02-27
出版日期:

网站首页

杂志简介

投稿须知

在线阅读

欢迎订阅

招商合作

联系我们

English

引用本文

分享

文章指标

历史

文章二维码

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码