Modeless Japanese Input Method Using Multiple Character Sequence Features

被引:4
|
作者
Ikegami, Yukino [1 ]
Sakurai, Yoshitaka [1 ]
Tsuruta, Setsuo [1 ]
机构
[1] Tokyo Denki Univ, Inzai, Japan
关键词
modeless Japanese input; multiple character sequence features; multilingual text; n-gram;
D O I
10.1109/SITIS.2012.93
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, the rapid growth of globalization requires writing a large number of multilingual texts. However, Japanese PC users need to switch the input mode between Japanese and the Latin alphabet on conventional Japanese input method. That is cumbersome. Meanwhile, the solution system using a dictionary is hard to maintain because new words are created every year with high frequency. This paper proposes a modeless Japanese input method which automatically switches the input mode without using a dictionary. Using the model called "multiple character sequence features", this method discriminates whether to convert alphabet into Kana or not. There are multiple character sequence features, namely, character surface features and character type features both based on n-gram. These model features are learned by a Support Vector Machine from corpora especially from those of a large number of living words on Web. The evaluation of this method showed that the statistical accuracy by F-measure for both chatting texts and news texts was over 90% (mostly over 99%).
引用
收藏
页码:613 / 618
页数:6
相关论文
共 50 条
  • [1] Hybrid method for modeless Japanese input using N-gram based binary classification and dictionary
    Yukino Ikegami
    Setsuo Tsuruta
    [J]. Multimedia Tools and Applications, 2015, 74 : 3933 - 3946
  • [2] Hybrid method for modeless Japanese input using N-gram based binary classification and dictionary
    Ikegami, Yukino
    Tsuruta, Setsuo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (11) : 3933 - 3946
  • [3] One Touch Character: A Simplified Japanese Character Input Method for Mobile Computing
    Higashida, Masanobu
    Ishida, Toru
    Murakami, Jin'ichi
    Oku, Masahiro
    [J]. 2015 INTERNATIONAL CONFERENCE ON CULTURE AND COMPUTING (CULTURE COMPUTING), 2015, : 119 - 126
  • [4] JAPANESE CHARACTER INPUT - ITS STATE AND PROBLEMS
    MORITA, I
    [J]. JOURNAL OF LIBRARY AUTOMATION, 1981, 14 (01): : 6 - 23
  • [5] Micro Touch Board Specially Designed for SliT that Is the Japanese Character Input Method for Smartwatches
    Tanaka, Toshimitsu
    Saka, Koutaro
    Akita, Kohei
    Sagawa, Yuji
    [J]. HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 49 - 61
  • [6] A method of generating calligraphy of Japanese character using deformable contours
    Wang, L
    Nakamura, T
    Wang, M
    Seki, H
    Itoh, H
    [J]. IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 1050 - 1055
  • [7] Multiple sequence alignment using biological features classification
    Besharati, Arezoo
    Mehrdadjalali
    [J]. 2014 INTERNATIONAL CONGRESS ON TECHNOLOGY, COMMUNICATION AND KNOWLEDGE (ICTCK), 2014,
  • [8] A Japanese Input Method Using Leap Motion in Virtual Reality
    Komiya, Kosuke
    Nakajima, Tatsuo
    [J]. 2017 TENTH INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND UBIQUITOUS NETWORK (ICMU), 2017, : 65 - 66
  • [9] The effect of color coding for the characters on computer keyboards for multilingual input using modeless methods
    Tang, KHE
    Tsai, LC
    [J]. COMPUTER HUMAN INTERACTION: PROCEEDINGS, 2004, 3101 : 461 - 470
  • [10] The image retrieval method using multiple features
    Ha, JeungYo
    Choi, HyungIl
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2007, PT 1, PROCEEDINGS, 2007, 4705 : 981 - +