Simultaneous Contextualization and Interpretation with Keyword Awareness

被引:0
|
作者
Yoshino, Teppei [1 ]
Matsumori, Shoya [1 ]
Fukuchi, Yosuke [1 ]
Imai, Michita [1 ]
机构
[1] Keio Univ, Yokohama, Kanagawa, Japan
关键词
Dialogue context; Polysemy; Keyword extraction; SCAIN; SLAM;
D O I
10.1007/978-3-030-87897-9_36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most natural-language-processing methods are designed for estimating context given an entire set of sentences at once. However, dialogue is incremental in nature. SCAIN (Simultaneous Contextualization and Interpretation) is an algorithm for incremental dialogue processing. Along with the progress of the dialogue, it can solve the interdependence problem in which the interpretation of words depends on the context, and the context is determined by the interpreted words. However, SCAIN cannot process texts that contain more words insignificant to context estimation such as in longer texts. We propose SCAIN with keyword extraction (SCAIN/KE), which extracts keywords that contribute to context estimation and eliminates the effect of insignificant words so that it can process longer texts. In the case study, SCAIN/KE updates context and interpretation better than SCAIN and obtains the keywords that contribute to context estimation better than other statistical methods. In the experiments, we evaluated SCAIN/KE on solving the ambiguity of polysemous words using the Wikipedia disambiguation pages. The results indicate that SCAIN/KE is more accurate than SCAIN.
引用
收藏
页码:403 / 413
页数:11
相关论文
共 50 条