Comparing set-covering strategies for optimal corpus design

被引:0
|
作者
Chevelu, Jonathan [1 ]
Barbot, Nelly [1 ]
Boeffard, Olivier [1 ]
Delhay, Arnaud [1 ]
机构
[1] Univ Rennes 1, IRISA, Lannion, France
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This article is interested in the problem of the linguistic content of a speech corpus. Depending on the target task, the phonological and linguistic content of the corpus is controlled by collecting a set of sentences which covers a preset description of phonological attributes under the constraint of an overall duration as small as possible. This goal is classically achieved by greedy algorithms which however do not guarantee the optimality of the desired cover. In recent works, a lagrangian-based algorithm, called LamSCP, has been used to extract coverings of diphonemes from a large corpus in French, giving better results than a greedy algorithm. We propose to keep comparing both algorithms in terms of the shortest duration, stability and robustness by achieving multi-represented diphoneme or triphoneme covering. These coverings correspond to very large scale optimization problems, from a corpus in English. For each experiment, LamSCP improves the greedy results from 3.9 to 9.7 percent.
引用
收藏
页码:2951 / 2956
页数:6
相关论文
共 50 条
  • [41] A set-covering based heuristic algorithm for the periodic vehicle routing problem
    Cacchiani, V.
    Hemmelmayr, V. C.
    Tricoire, F.
    [J]. DISCRETE APPLIED MATHEMATICS, 2014, 163 : 53 - 64
  • [42] A set-covering approach to specific search for literature about human genes
    Jenssen, TK
    Vinterbo, S
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 384 - 388
  • [43] Dynamic set-covering for real-time multiple fault diagnosis
    Kodali, Anuradha
    Singh, Satnarn
    Choi, Kihoon
    Pattipati, Krishna
    [J]. 2008 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2008, : 3758 - 3768
  • [44] Note: a local-search heuristic for large set-covering problems
    Jacobs, L.W.
    Brusco, M.J.
    [J]. Naval Research Logistics, 1995, 42 (07)
  • [45] Location set-covering inspired models for designing harvesting and cable road layouts
    Leo Gallus Bont
    Richard L. Church
    [J]. European Journal of Forest Research, 2018, 137 : 771 - 792
  • [46] A set-covering model for optimizing selection of portfolio of microcontrollers in an automotive supplier company
    Nepal, Bimal
    Lassan, Gregg
    Drow, Baba
    Chelst, Kenneth
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2009, 193 (01) : 272 - 281
  • [47] Assessment of Different Algorithms to Solve the Set-Covering Problem in a Relay Selection Technique
    Laurindo, Suelen
    Moraes, Ricardo
    Montez, Carlos
    Vasque, Francisco
    [J]. 2020 25TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2020, : 206 - 213
  • [49] Multi-faults diagnostic model based on set-covering and certainty factor
    Xu, Changsheng
    [J]. Zhongguo Kuangye Daxue Xuebao/Journal of China University of Mining & Technology, 1994, 23 (02):
  • [50] Optimization of Disposition for Terminal Air Defense System Based on Set-Covering Model
    Han Yunjun
    Wang Jinhua
    Liao Xiaojing
    [J]. 2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 8461 - 8464