多源域迁移学习的肌电-惯性特征融合及手势识别
DOI:
作者:
作者单位:

1.燕山大学电气工程学院;2.燕山大学

作者简介:

通讯作者:

中图分类号:

基金项目:

教育部人文社科青年基金项目(21YJC890001); 河北省高等学校人文社会科学研究项目(SQ2021010); 国家自然科学基金项目(62076216); 河北省自然科学基金委员会重点项目(F2022203079); 河北省测试计量技术与仪器重点实验室; 河北省创新能力提升计划项目(22567619H)


Multi-source domain transfer learning for electromyography-inertial feature fusion and gesture recognition
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在跨用户手势识别研究中,针对单源域迁移学习存在的负迁移和模型泛化性能差的问题,本研究创新性地提出了一种基于肌电-惯性特征融合的多源域迁移学习策略,关键创新点在于整合多个源域的数据,并在此基础上采用域特有特征对齐与域分类器对齐的技术手段。这一方法旨在强化模型在不同用户间的手势识别性能,进而显著提升跨用户手势识别系统的准确性。首先,引入长短时记忆(Long Short-Term Memory, LSTM)网络模型,提取肌电-惯性信息的平均绝对值、方差、峰值等时序特征;其次进行域特有特征对齐与域分类器对齐,利用多个源域数据完成对目标域的特征提取;最后融合分类损失、域特有特征差异损失和域分类器差异损失三个损失函数,协同优化整体损失。实验结果表明,所提方法与单源域、源域组合等多种传统方法相比,识别平均率有所提高,在NinaPro DB5数据集上,目标用户的手势识别平均准确率达到80%以上。

    Abstract:

    In the realm of cross-user gesture recognition research, addressing the challenges of negative transfer and subpar model generalization observed in single-source domain transfer learning, this study presents a multi-source domain transfer learning strategy centered around the fusion of EMG and inertial features. The pivotal innovation lies in amalgamating data resources originating from diverse source domains, and subsequently employing techniques for domain-specific feature alignment and domain classifier alignment. The primary objective of this approach is to bolster model performance in gesture recognition across different users, thus significantly enhancing the accuracy of cross-user gesture recognition systems. Initially, the Long Short-Term Memory (LSTM) network model is introduced to extract time series features, encompassing metrics such as average absolute value, variance, and peak value derived from EMG and inertia data. Subsequently, domain-specific feature alignment and domain classifier alignment procedures are executed, facilitating feature extraction within the target domain utilizing data from multiple source domains. Lastly, the fusion of three loss functions—classification loss, domain-specific feature difference loss, and domain classifier difference loss—is undertaken to collectively optimize the overall loss. The experimental results demonstrate that the proposed method exhibits an improvement in average recognition rate compared to various traditional methods, such as single-source domain and source domain combination approaches. On the NinaPro DB5 dataset, the average gesture recognition accuracy for the target users exceeds 80%.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2024-02-27
  • 最后修改日期:2024-06-14
  • 录用日期:2024-06-18
  • 在线发布日期:
  • 出版日期: