Integrating multi-scale features and attention mechanisms for food image recognition
DOI:
CSTR:
Author:
Affiliation:

1.School of Computer Science and Engineering, Sichuan University of Science & Engineering,Yibin 644000, China; 2.Sichuan Big Data Visualization Analysis Technology Engineering Laboratory,Yibin 644000, China; 3.School of Automation and Information Engineering, Sichuan University of Science & Engineering,Yibin 644000, China; 4.Key Laboratory of Measurement and Control Technology of Enterprise Informatization and Internet of Things of Sichuan Universities,Yibin 644000,China

Clc Number:

TP393;TN98

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    To address the challenges in food image recognition caused by small inter-class differences, large intra-class variations, and complex structures, this paper proposes a food image recognition method that integrates multi-scale features and an attention mechanism. First, the ConvNeXt model, which has stronger feature extraction capabilities, is used as the backbone network to better capture the detailed features of food images. Next, an improved ASPP module is introduced to expand the receptive field and utilize multi-scale information, enhancing the model′s ability to capture features at different scales. Finally, an attention mechanism is added after each convolutional block to improve feature representation and the ability to capture contextual information. Experimental results show that the proposed method achieves accuracies of 91.56% and 87.22% on the extended Vireo Food172 dataset and the ETH Food101 dataset, respectively, which represents an improvement of 2.05% and 1.66% over the original model, thus verifying the effectiveness of the proposed method.

    Reference
    Related
    Cited by
Get Citation
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:
  • Revised:
  • Adopted:
  • Online: December 20,2024
  • Published:
Article QR Code