On the Use of Multi-lingual Approach for a Cloud-based Transcription System for the 'Ilonggoish' Dialect

被引:1
|
作者
Alibagon, Rowena [1 ]
Elijorde, Frank [2 ]
De Castro, Joel [2 ]
Byun, Yungcheol [3 ]
机构
[1] St Vincent Coll Sci & Technol, Iloilo, Philippines
[2] West Visayas State Univ, Coll Informat & Commun Technol, Iloilo, Philippines
[3] Jeju Natl Univ, Dept Comp Engn, Jeju, South Korea
来源
INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING | 2018年 / 11卷 / 03期
关键词
Hidden Markov Model; Mel Frequency Cepstral Coefficients; Speech Recognition; Transcription System;
D O I
10.14257/ijgdc.2018.11.3.12
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The study is aimed at the development of a Transcription System for 'Ilonggoish' Dialect, which is a widely-spoken local language in the Philippines. It is a software that records speech in. wav file format, transcribes speech into text, and generates text file containing the transcribed text. The system has a built in speech recognition that has the capability to recognize pre-recorded speeches spoken in different languages such as English, Filipino, Hiligaynon, and Ilonggoish dialect. Integrated into the system are the recording tool for the input speech data, data storing capability in. wav format, and text storing capability in. txt format. This study presents an approach to extract features of the spoken words by using the Mel Frequency Cepstral Coefficients (MFCC) algorithm from speech signals of isolated spoken words, and Hidden Markov Model (HMM) method in presenting the recognized spoken words in text format. The system uses the Google Cloud's database of words as the baseline for standard words. It was evaluated by linguists specializing in Filipino, English, and Hiligaynon languages, and IT experts in different fields such as the academe and industry.
引用
收藏
页码:139 / 153
页数:15
相关论文
共 50 条
  • [1] A multi-lingual augmentative communication system
    Alm, N
    Iwabuchi, M
    Andreasen, PN
    Nakamura, K
    UNIVERSAL ACCESS: THEORETICAL PERSPECTIVES, PRACTICE, AND EXPERIENCE, 2003, 2615 : 398 - 408
  • [2] Multi-lingual web querying: A parametric linguistics based approach
    Kapetanios, Epaminondas
    Sugumaran, Vijayan
    Tanase, Diana
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2006, 3999 : 94 - 105
  • [3] An operational framework for the multi-lingual system simulation based on π-calculus
    Windisch, A
    Monjau, D
    SCCC 2001: XXI INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY, PROCEEDINGS, 2001, : 282 - 291
  • [4] Language, diversity and culture: a multi-lingual approach
    Alba Santamaria, Flor
    INFANCIAS IMAGENES, 2022, 21 (01):
  • [5] A multi-lingual speech recognition system using a neural network approach
    Chen, OTC
    Chen, CY
    Chang, HT
    Hsu, FR
    Yang, HL
    Lee, YG
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 1576 - 1581
  • [6] Cloudlet-Based Multi-Lingual Dictionaries
    Achanta, Vamsi Subhash
    Sureshbabu, Nishanth Talanki
    Thomas, Veena
    Sahitya, M. Lakshmi
    Rao, Shrisha
    2012 THIRD INTERNATIONAL CONFERENCE ON SERVICES IN EMERGING MARKETS (ICSEM), 2012, : 30 - 36
  • [7] Multi-lingual geoparsing based on machine translation
    Chen, Xu
    Gelernter, Judith
    Zhang, Han
    Liu, Jin
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 667 - 677
  • [8] Mobile multi-lingual automatic interpretation system Mobilingual
    Anon
    2002, Hitachi Ltd.
  • [9] Input Text Repairing for Multi-lingual Chat System
    Yoshida, Kenichi
    Hattori, Fumio
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION AND INTERACTION, PT II, 2009, 5618 : 210 - 217
  • [10] Development of the "VoiceTra" Multi-Lingual Speech Translation System
    Matsuda, Shigeki
    Hayashi, Teruaki
    Ashikari, Yutaka
    Shiga, Yoshinori
    Kashioka, Hidenori
    Yasuda, Keiji
    Okuma, Hideo
    Uchiyama, Masao
    Sumita, Eiichiro
    Kawai, Hisashi
    Nakamura, Satoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2017, E100D (04): : 621 - 632