Validating Syntactic Correctness Using Unsupervised Clustering Algorithms

被引:0
|
作者
Noh, Sanguk [1 ]
Chung, Kihyun [2 ]
Shim, Jaebock [3 ]
机构
[1] Catholic Univ Korea, Sch Comp Sci & Informat Engn, Bucheon Si 14662, South Korea
[2] Ajou Univ, Div Elect Engn, Suwon 16499, South Korea
[3] Deltaindex Inc, Daejeon 34027, South Korea
关键词
recommendation of syntactically correct sentence; unsupervised clustering algorithms; autoencoding procedure; software requirement specifications; CLASSIFICATION;
D O I
10.3390/electronics11142113
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When developing a complex system in an open platform setting, users need to compose and maintain a systematic requirement specification. This paper proposes a solution to guarantee a syntactically accurate requirement specification that minimizes the ambiguity caused by ungrammatical sentences. Our system has a set of standard jargon and templates that are used as a guideline to write grammatically correct sentences. Given a database of standard technical Korean (STK) templates, the system that we have designed and implemented divides a new sentence into a specific cluster. If the system finds an identical template in a cluster, it confirms the new sentence as a sound one. Otherwise, the system uses unsupervised clustering algorithms to return the template that most closely resembles the syntax of the inputted sentence. We tested our proposed system in the field of open platform development for a railway train. In the experiment, our system learned to partition templates into clusters while reducing null attributes of an instance using the autoencoding procedure. Given a set of clusters, the system was able to successfully recommend templates that were syntactically similar to the structure of the inputted sentence. Since the degree of similarity for 500 instances was 97.00% on average, we conclude that our robust system can provide an appropriate template that users can use to modify their syntactically incorrect sentences.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Clustering Morphological Paradigms Using Syntactic Categories
    Can, Burcu
    Manandhar, Suresh
    MULTILINGUAL INFORMATION ACCESS EVALUATION I: TEXT RETRIEVAL EXPERIMENTS, 2010, 6241 : 641 - 648
  • [32] Correctness of dataflow and systolic algorithms using algebras of streams
    Meinke, K
    Steggles, LJ
    ACTA INFORMATICA, 2001, 38 (01) : 45 - 88
  • [33] Correctness of dataflow and systolic algorithms using algebras of streams
    K. Meinke
    L.J. Steggles
    Acta Informatica, 2001, 38 : 45 - 88
  • [34] On the Correctness of Parallel Algorithms
    Karpov, Y. G.
    Borshchev, A. V.
    Rudakov, V. V.
    Programming and Computer Software (English Translation of Programmirovanie), 22 (04):
  • [35] On the correctness of parallel algorithms
    Karpov, YG
    Borshchev, AV
    Rudakov, VV
    PROGRAMMING AND COMPUTER SOFTWARE, 1996, 22 (04) : 164 - 171
  • [36] CORRECTNESS OF NUMERICAL ALGORITHMS
    HULL, TE
    SIAM REVIEW, 1971, 13 (02) : 270 - &
  • [37] Detection of Cardiac Arrhythmias Using Unsupervised Learning: A Preliminary Approach Based on PAM and CLARA Clustering Algorithms
    Alvarino-Duran, Jessica
    Hernandez-Ocana, Betania
    Hernandez-Torruco, Jose
    Chavez-Bosquez, Oscar
    ADVANCES IN DIGITAL HEALTH AND MEDICAL BIOENGINEERING, VOL 1, EHB-2023, 2024, 109 : 594 - 601
  • [38] Unsupervised Learning for Syntactic Disambiguation
    Gelbukh, Alexander
    COMPUTACION Y SISTEMAS, 2014, 18 (02): : 329 - 344
  • [39] A new fuzzy possibility clustering algorithms based on unsupervised mahalanobis distances
    Liu, Hsiang-Chuan
    Yih, Jeng-Ming
    Sheu, Tian-Wei
    Liu, Shin-Wu
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 3939 - 3944
  • [40] Hybridization of Particle Swarm Optimization with Unsupervised Clustering Algorithms for Image Segmentation
    Liu, Wenping
    McGrath, Ethan
    Hung, Chih-Cheng
    Kuo, Bor-Chen
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2008, 10 (03) : 217 - 230