A Phrase Disambiguation Method of "Quanbu V de N" Based on SBERT Model and Syntactic Rule

被引:0
|
作者
Xie, Siqi [1 ]
Yang, Quan [1 ]
机构
[1] Beijing Normal Univ, Sch Int Chinese Language Educ, Beijing, Peoples R China
来源
关键词
Structural ambiguity; Disambiguation; Syntactic restriction; Semantic preference; SBERT;
D O I
10.1007/978-3-031-28956-9_29
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A method combining syntactic rule and semantic preference in resolving ambiguous structures "(sic)V(sic)N (Quanbu V de N)" is proposed in this paper. First, a rule base is constructed, and the rules of ambiguity division are described in accordance with the syntactic structural constraints and semantic representation function. Secondly, the homomorphic and heterogeneous phrase is converted into two heteromorphic phrases "(sic)V(sic)N (Quanbu dou V de N)" and "(sic)V(sic)N (Quanbu de V de N)". We use the SBERT model to calculate the semantic similarity between the original sentence and the two candidate sentences so that the optimal phrase segmentation is obtained. The experiment was conducted on 646 texts with ambiguous phrases "Quanbu V de N" as the object for testing. The disambiguation accuracy and the F1 value are both over 95%, which proves that this method can effectively disambiguate this phrase in natural language.
引用
收藏
页码:364 / 374
页数:11
相关论文
共 50 条
  • [11] A Syntactic Rule-based Method for Automatic Pathway Information Extraction From Biomedical Literature
    Wang, Yanhua
    Yang, Zhihao
    Lin, Hongfei
    Li, Yanpeng
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [12] A general and multi-lingual phrase chunking model based on masking method
    Wu, YC
    Chang, CH
    Lee, YS
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2006, 3878 : 144 - 155
  • [13] Rule-ranking method based on item utility in adaptive rule model
    Hikmawati, Erna
    Maulidevi, Nur Ulfa
    Surendro, Kridanto
    [J]. PEERJ COMPUTER SCIENCE, 2022, 8
  • [14] Comparison of web-based unsupervised translation disambiguation word model and N-gram model
    Institute of Computational Linguistics, Peking University, Beijing 100871, China
    不详
    [J]. Dianzi Yu Xinxi Xuebao, 2009, 12 (2969-2974):
  • [15] Improving the Distribution of N-Grams in Phrase Tables Obtained by the Sampling-Based Method
    Luo, Juan
    Lardilleux, Adrien
    Lepage, Yves
    [J]. HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 419 - 431
  • [16] A rule-based method for automated surrogate model selection
    Jia, Liangyue
    Alizadeh, Reza
    Hao, Jia
    Wang, Guoxin
    Allen, Janet K.
    Mistree, Farrokh
    [J]. ADVANCED ENGINEERING INFORMATICS, 2020, 45
  • [17] Chinese News Hot Subtopic Discovery and Recommendation Method Based on Key Phrase and the LDA Model
    Ge, Bin
    He, Chun-hui
    Hu, Sheng-ze
    Guo, Cheng
    [J]. 2018 INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL, AUTOMATION AND ROBOTICS (ECAR 2018), 2018, 307 : 349 - 358
  • [18] An Approach to N-Gram Language Model Evaluation in Phrase-Based Statistical Machine Translation
    Su, Jinsong
    Liu, Qun
    Dong, Huailin
    Chen, Yidong
    Shi, Xiaodong
    [J]. 2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 201 - 204
  • [19] Software Defect Prediction Method Based on Rule Knowledge Extraction Model
    柴海燕
    闫然
    韩新宇
    唐龙利
    [J]. Journal of Donghua University(English Edition), 2018, 35 (05) : 422 - 425
  • [20] A Novel Shadow Detection Method using Fuzzy Rule based Model
    Das, Sonia
    Meher, Sukadev
    [J]. PROCEEDINGS OF THE 2017 IEEE 15TH STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2017, : 493 - 498