Enhancing answer selection model of graрe рlanting using multimodal knowledge graрh

被引:0
|
作者
Yang S. [1 ]
Li S. [1 ]
机构
[1] College of Inforтation Engineering, Northwest A&F University, Yangling
关键词
agriculture; answer selection; grape cultivation; graph representation; knowledge graph; multi-modal; natural language processing(NLP);
D O I
10.11975/j.issn.1002-6819.202304240
中图分类号
学科分类号
摘要
Answer selection is one of the most important tasks during natural language processing in the downstream tasks, such as question-answering systems, and search ranking. The most relevant answer can be selected to the given question from a candidate answer pool, which is usually regarded as a relevance ranking task. However, the current models of answer selection cannot discover the deep semantic relationships between questions and answers using the limited information in the text of the question-answer pairs. Fortunately, knowledge graph can be expected to serve as the background knowledge, in order to enhance the deep semantics of the answer selection model. It is still lacking on the multi-modal background knowledge support, because the answer selection models can rely solely on their own information. In this research, a multi-modal knowledge graph enhanced answer selection model was proposed, including the embedding layer, representation learning layer, knowledge graph enhancement layer, and output layer. Among them, the Glove model was used to obtain the word embeddings for the question-answer texts in the embedding layer. Furthermore, a ComplEx-based method (complex embedding) was designed to learn the entity embeddings for the multi-modal knowledge graph. The image entity information was considered to extract the image feature representations using the Vision Transformer (VIT). Bi-directional long short-term memory (Bi-LSTM) was used for the representation learning of question-answer texts in the representation layer. The context-guided multi-modal knowledge graph question and answer vector representations were obtained using context-guided attention mechanism. In the knowledge graph enhancement layer, the interaction attention mechanism was used to fuse the semantic representation of the question-answer texts with the background knowledge features that provided by the multi-modal knowledge graph, particularly for the feature representations of the multi-modal knowledge graph enhanced question and answer. The feature representations of the knowledge graph enhanced question and answer were concatenated with the additional semantic features in the output layer. The softmax function was used to predict the probability distribution of answer labels for a given question. Taking the grape planting as an example, the multi-modal entity linking was realized using the longest common subsequence algorithm. The entity recognition was also implemented to extract the knowledge using the Bert-LSTM-CRF framework and Bert pre-training model. The reference of knowledge graph was collected from the literature and experts. Finally, a multi-modal knowledge graph was constructed in the grape planting field. A grape planting question and answer dataset was also constructed using grape forums, smart agricultural platforms, agricultural managers, and agricultural benefit networks as data sources, followed by text cleaning and dataset expansion. Experimental results show that the better performance of the model was achieved to obtain more information using the multi-modal knowledge graphs. Specifically, the mean reciprocal rank and mean average precision reached 85.02% and 84.21%, respectively, in the grape question answering dataset. The mean reciprocal rank and mean average precision increased by 2.57 and 3.96 percentage points, respectively. The answer selection model with the knowledge of multi-modal knowledge graph can be expected to improve the better performance of answer selection model. The embedding representation with attention mechanism can be utilized to enhance the background knowledge from the multimodal knowledge graph. The finding can provide a technical basis for the downstream applications of multi-modal knowledge graphs, such as the search and question answering. © 2023 Chinese Society of Agricultural Engineering. All rights reserved.
引用
收藏
页码:207 / 214
页数:7
相关论文
共 34 条
  • [1] LASKAR M, HUANG J, HOQUE E., Contextualized embeddings based transformer encoder for sentence similarity modeling in answer selection task, Language Resources and Evaluation Conference, (2020)
  • [2] HUANG H, WEI X, NIE L, Et al., From question to text: question-oriented feature attention for answer selection, Acm Transactions on Information Systems, 37, 1, pp. 1-33, (2019)
  • [3] ZHANG S, ZHANG X, WANG H, Et al., Chinesemedical question answer matching using end-to-end character-level multi-scale CNNs, Applied Sciences, 7, 8, (2017)
  • [4] GANCHAO B, YUAN W, XIN S, Et al., Double attention recurrent convolution neural network for answer selection, Royal Society Open Science, 7, 5, (2020)
  • [5] ZHANG B, WANG H, JIANG L, Et al., A novel bidirectional lstm and attention mechanism based neural network for answer selection in community question answering, Computers, Materials & Continua, 2020, 3, (2020)
  • [6] MA F, YOU Q, XIAO H, Et al., KAME: Knowledge-based attention model for diagnosis prediction in healthcare[C], The 27 th ACM International Conference, (2018)
  • [7] HUANG W, QU Q, YANG M., Interactive knowledge-enhanced attention network for answer selection, Neural Computing and Applications, 32, 1, pp. 1-17, (2020)
  • [8] YANG M, CHEN L, LYU Z, Et al., Hierarchical fusion of common sense knowledge and classifier decisions for answer selection in community question answering, Neural Networks, 132, 9, pp. 53-65, (2020)
  • [9] JING F, REN H, CHENG W, Et al., Knowledge-enhanced attentive learning for answer selection in community question answering systems, Knowledge-Based Systems, 250, (2022)
  • [10] DENG Y, XIE Y, LI Y, Et al., Contextualized knowledge-aware attentive neural network: Enhancing answer selection with knowledge[J], ACM Transactions on Information Systems, 40, 1, pp. 1-33, (2021)