基于DTW-GMM的光纤传感系统声纹识别方法
DOI:
CSTR:
作者:
作者单位:

1.太原理工大学电子信息与光学工程学院太原030024;2.太原理工大学新型传感器与智能控制教育部重点实验室 太原030024;3.山西省交通科技研发有限公司太原030024

作者简介:

通讯作者:

中图分类号:

TH741

基金项目:

山西省重点研发计划项目(202102130501021)、山西省水利科学技术研究与推广项目(2024GM18)、中央引导地方科技发展资金项目(YDZJSX20231B004)、山西省科技创新团队项目(201805D131003)资助


Voiceprint recognition method of optical fiber sensing system based on DTW-GMM
Author:
Affiliation:

1.College of Electronic Information and Optical Engineering, Taiyuan University of Technology, Taiyuan 030024, China; 2.Key Laboratory of Advanced Transducers and Intelligent Control System, Ministry of Education, Taiyuan University of Technology, Taiyuan 030024, China; 3.Shanxi Transportation Technology Research & Development Company Limited, Taiyuan 030024, China

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为了满足易燃易爆环境的声纹识别需求,设计了直线型萨格奈克干涉光纤声音传感系统,利用维纳滤波算法对语音数据进行了降噪,通过三电平削波法获取了基音周期特征,采用动态时间规整算法筛选了说话人样本,并提取了梅尔频率倒谱系数特征,运用高斯混合模型期望最大化算法开展了声纹识别实验研究,同时探究了光纤声音传感系统的频率响应特性与声纹特征,研究了采集语音幅值对声纹识别结果的影响。实验结果表明,系统可实现300~3 500 Hz频率段的声音信号感知,声音幅值从0.9 V降至0.15 V时最大与次大对数似然值之差由35.5降至10.9,识别结果从成功变为失败。重复性实验表明,在10 km的传感光纤上,距声源2 m位置处,传感系统可对400段时长为3~5 s之间的文本无关语音段实现准确检测,且综合识别准确率为94.75%。本系统有望为易燃易爆环境中的设备故障、应急救援、渗漏监测等领域提供声纹识别的解决方案。

    Abstract:

    In order to meet the demand of voiceprint recognition in flammable and explosive environment. A linear Sagnac interference optical fiber acoustic sensor system has been designed. Speech data was denoised using the Wiener filtering algorithm, and pitch features were extracted through three-level clipping. Speaker samples were screened using dynamic time warping, and Mel-frequency cepstral coefficients were extracted as features. Voiceprint recognition experiments were conducted utilizing the Gaussian mixture model-expectation maximization algorithm, concurrently investigating the frequency response characteristics of the optical fiber acoustic sensor system and their relationship with voiceprint features. The influence of the amplitude of acquired speech on voiceprint recognition outcomes was studied. Experimental results demonstrate that the system can realize the sound signal perception in the frequency range of 300~3 500 Hz. When the sound amplitude decreases from 0.9 to 0.15 V, the difference between the maximum and second-largest log-likelihood values drops from 35.5 to 10.9, the recognition result changed from success to failure. Repetition experiments show that, at a distance of 2 meters from the sound source along a 10-kilometer sensing fiber, the system accurately recognizes 400 speech segments of 3 to 5 seconds duration, unrelated to any specific text, achieving an overall identification accuracy rate of 94.75%. This system holds promise as a solution for voiceprint recognition in applications such as equipment fault diagnosis and emergency response within flammable and explosive environments.

    参考文献
    相似文献
    引证文献
引用本文

杨佳沛,王宇,彭广建,白清,刘昕,靳宝全.基于DTW-GMM的光纤传感系统声纹识别方法[J].电子测量与仪器学报,2024,38(4):176-186

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2024-07-02
  • 出版日期:
文章二维码