Morphology;
Morphological reconstruction;
Igbo;
Unknown words prediction;
Part-of-speech tagging;
D O I:
10.1007/978-3-319-45510-5_24
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
The effective handling of previously unseen words is an important factor in the performance of part-of-speech taggers. Some trainable POS taggers use suffix (sometimes prefix) strings as cues in handling unknown words (in effect serving as a proxy for actual linguistic affixes). In the context of creating a tagger for the African language Igbo, we compare the performance of some existing taggers, implementing such an approach, to a novel method for handling morphologically complex unknown words, based on morphological reconstruction (i.e. a linguistically-informed segmentation into root and affixes). The novel method outperforms these other systems by several percentage points, achieving accuracies of around 92% on morphologically-complex unknown words.
机构:
Hong Kong Polytech Univ, Dept Chinese & Bilingual Studies, Kowloon, Hong Kong, Peoples R ChinaHong Kong Polytech Univ, Dept Chinese & Bilingual Studies, Kowloon, Hong Kong, Peoples R China
Maeng, Junghwan
Kim, Sun-A
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Dept Chinese & Bilingual Studies, Kowloon, Hong Kong, Peoples R ChinaHong Kong Polytech Univ, Dept Chinese & Bilingual Studies, Kowloon, Hong Kong, Peoples R China
机构:
Univ Helsinki, Fac Med, Dept Psychol & Logoped, Cognit Brain Res Unit, POB 21, FIN-00014 Helsinki, Finland
Univ Helsinki, Fac Arts, Cognit Sci, Dept Digital Humanitiers, POB 9, FIN-00014 Helsinki, FinlandHelsinki Univ Hosp, Dept Otorhinolaryngol & Phoniatr, POB 250, FIN-00029 Helsinki, Finland
Leminen, Alina
Smolander, Sini
论文数: 0引用数: 0
h-index: 0
机构:
Helsinki Univ Hosp, Dept Otorhinolaryngol & Phoniatr, POB 250, FIN-00029 Helsinki, Finland
Univ Helsinki, POB 250, FIN-00029 Helsinki, Finland
Univ Oulu, Res Unit Logoped, FIN-90014 Oulu, FinlandHelsinki Univ Hosp, Dept Otorhinolaryngol & Phoniatr, POB 250, FIN-00029 Helsinki, Finland
Smolander, Sini
Arkkila, Eva
论文数: 0引用数: 0
h-index: 0
机构:
Helsinki Univ Hosp, Dept Otorhinolaryngol & Phoniatr, POB 250, FIN-00029 Helsinki, Finland
Univ Helsinki, POB 250, FIN-00029 Helsinki, FinlandHelsinki Univ Hosp, Dept Otorhinolaryngol & Phoniatr, POB 250, FIN-00029 Helsinki, Finland
Arkkila, Eva
Shtyrov, Yury
论文数: 0引用数: 0
h-index: 0
机构:
Aarhus Univ, Inst Clin Med, CFIN, DK-8000 Aarhus C, Denmark
St Petersburg State Univ, Lab Behav Neurodynam, Makarova Emb 6, St Petersburg 199034, RussiaHelsinki Univ Hosp, Dept Otorhinolaryngol & Phoniatr, POB 250, FIN-00029 Helsinki, Finland
Shtyrov, Yury
论文数: 引用数:
h-index:
机构:
Laasonen, Marja
Kujala, Teija
论文数: 0引用数: 0
h-index: 0
机构:
Univ Helsinki, Fac Med, Dept Psychol & Logoped, Cognit Brain Res Unit, POB 21, FIN-00014 Helsinki, FinlandHelsinki Univ Hosp, Dept Otorhinolaryngol & Phoniatr, POB 250, FIN-00029 Helsinki, Finland
机构:
Max Planck Inst Human Dev MPIB, Berlin, Germany
Univ Gottingen, Dept Educ Psychol, Gottingen, GermanyMax Planck Inst Human Dev MPIB, Berlin, Germany
Mousikou, Petroula
Nueesch, Lorena
论文数: 0引用数: 0
h-index: 0
机构:
Max Planck Inst Human Dev MPIB, Berlin, GermanyMax Planck Inst Human Dev MPIB, Berlin, Germany
Nueesch, Lorena
Hasenacker, Jana
论文数: 0引用数: 0
h-index: 0
机构:
Int Sch Adv Studies SISSA, Trieste, ItalyMax Planck Inst Human Dev MPIB, Berlin, Germany
Hasenacker, Jana
Schroeder, Sascha
论文数: 0引用数: 0
h-index: 0
机构:
Max Planck Inst Human Dev MPIB, Berlin, Germany
Univ Gottingen, Dept Educ Psychol, Gottingen, GermanyMax Planck Inst Human Dev MPIB, Berlin, Germany