Commodity classification in livestreaming marketing based on a conv-transformer network

被引:1
|
作者
Zhang, Rongze [1 ]
Wang, Xiuhui [1 ]
机构
[1] China Jiliang Univ, Coll Informat Engn, Hangzhou 310018, Peoples R China
关键词
Commodity classification; Convolutional neural network; Layered transformer network; Parallel feature extraction network; QUERY EXPANSION; RETRIEVAL; FEEDBACK;
D O I
10.1007/s11042-023-17786-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To obtain category information of goods in livestreaming marketing, we designed a recognition algorithm based on an improved transformer network that obtains the category of goods in live broadcast marketing via real-time detection of video images. First, based on the proposed parallel network structure, the local characteristics of the convolution module and global characteristics of the transformer module were merged, enriching the overall characteristics of the backbone network. Second, two different levels of global attention modules were used to calculate the feature map in groups to optimize the transfer module structure. The experimental results demonstrated that, under the same detector conditions, the detection accuracy and network parameters of the feature extraction network proposed in this paper was significantly improved. Furthermore, the detection effect on the commodity detection task in livestreaming marketing scene was significant.
引用
收藏
页码:54909 / 54924
页数:16
相关论文
共 50 条
  • [1] Commodity classification in livestreaming marketing based on a conv-transformer network
    Rongze Zhang
    Xiuhui Wang
    Multimedia Tools and Applications, 2024, 83 : 54909 - 54924
  • [2] A Conv-Transformer network for heart rate estimation using ballistocardiographic signals
    Zhang, Miao
    Qiu, Lishen
    Chen, Yuhang
    Yang, Shuchen
    Zhang, Zhiming
    Wang, Lirong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80
  • [3] Imbalanced Conditional Conv-Transformer for Mathematical Expression Recognition
    Ji, Shuaijian
    Zhou, Zhaokun
    Wang, Yuqing
    Duan, Baishan
    Weng, Zhenyu
    Xu, Liang
    Zhu, Yuesheng
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VI, 2023, 14259 : 446 - 458
  • [4] Deep Multi-Instance Conv-Transformer Frameworks for Landmark-Based Brain MRI Classification
    Li, Guannan
    Ji, Zexuan
    Sun, Quansen
    ELECTRONICS, 2024, 13 (05)
  • [5] STCT: Spatial-Temporal Conv-Transformer Network for Cardiac Arrhythmias Recognition
    Qiu, Yixuan
    Chen, Weitong
    Yue, Lin
    Xu, Miao
    Zhu, Baofeng
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2021, PT I, 2022, 13087 : 86 - 100
  • [6] Scene-level buildings damage recognition based on Cross Conv-Transformer
    Shi, Lingfei
    Zhang, Feng
    Xia, Junshi
    Xie, Jibo
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2023, 16 (02) : 3987 - 4007
  • [7] ECTFormer: An efficient Conv-Transformer model design for image recognition
    Sa, Jaewon
    Ryu, Junhwan
    Kim, Heegon
    PATTERN RECOGNITION, 2025, 159
  • [8] Single image deraining via nonlinear recursive Conv-Transformer
    Liang, Zhenyuan
    Chen, Hui
    Zhu, Songhao
    Liang, Zhiwei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024,
  • [9] 基于Conv-Transformer的RNA碱基配对概率预测
    李大舟
    陈梅
    高巍
    计算机与数字工程, 2025, 53 (03) : 781 - 785
  • [10] Conv-transformer architecture for unconstrained off-line Urdu handwriting recognition
    Nauman Riaz
    Haziq Arbab
    Arooba Maqsood
    Khuzaeymah Nasir
    Adnan Ul-Hasan
    Faisal Shafait
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 373 - 384