Comparing set-covering strategies for optimal corpus design

被引:0
|
作者
Chevelu, Jonathan [1 ]
Barbot, Nelly [1 ]
Boeffard, Olivier [1 ]
Delhay, Arnaud [1 ]
机构
[1] Univ Rennes 1, IRISA, Lannion, France
关键词
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
This article is interested in the problem of the linguistic content of a speech corpus. Depending on the target task, the phonological and linguistic content of the corpus is controlled by collecting a set of sentences which covers a preset description of phonological attributes under the constraint of an overall duration as small as possible. This goal is classically achieved by greedy algorithms which however do not guarantee the optimality of the desired cover. In recent works, a lagrangian-based algorithm, called LamSCP, has been used to extract coverings of diphonemes from a large corpus in French, giving better results than a greedy algorithm. We propose to keep comparing both algorithms in terms of the shortest duration, stability and robustness by achieving multi-represented diphoneme or triphoneme covering. These coverings correspond to very large scale optimization problems, from a corpus in English. For each experiment, LamSCP improves the greedy results from 3.9 to 9.7 percent.
引用
收藏
页码:2951 / 2956
页数:6
相关论文
共 50 条
  • [1] OPTIMAL TAXIWAY REPAIR - A SET-COVERING APPROACH
    HARNETT, RM
    KIEL, GC
    [J]. INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1985, 14 (06): : 405 - 419
  • [2] Comparing performance of Different Set-Covering Strategies for Linguistic Content Optimization in Speech Corpora
    Barbot, Nelly
    Boeffard, Olivier
    Delhay, Arnaud
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 969 - 974
  • [3] SET-COVERING PROBLEM
    BALAS, E
    PADBERG, MW
    [J]. OPERATIONS RESEARCH, 1972, 20 (06) : 1152 - 1161
  • [4] Heuristic and optimal solutions for set-covering problems in conservation biology
    Moore, JL
    Folkmann, M
    Balmford, A
    Brooks, T
    Burgess, N
    Rahbek, C
    Williams, PH
    Krarup, J
    [J]. ECOGRAPHY, 2003, 26 (05) : 595 - 601
  • [5] ALGORITHM FOR SET-COVERING PROBLEMS
    GONDRAN, M
    LAURIERE, JL
    [J]. REVUE FRANCAISE D AUTOMATIQUE INFORMATIQUE RECHERCHE OPERATIONNELLE, 1975, (NV2): : 33 - 51
  • [6] The probabilistic set-covering problem
    Beraldi, P
    Ruszczynski, A
    [J]. OPERATIONS RESEARCH, 2002, 50 (06) : 956 - 967
  • [7] APPLICATIONS OF LOCATION SET-COVERING PROBLEM
    REVELLE, C
    TOREGAS, C
    FALKSON, L
    [J]. GEOGRAPHICAL ANALYSIS, 1976, 8 (01) : 67 - 76
  • [8] Optimal sensor allocation by integrating causal models and set-covering algorithms
    Li, Jing
    Jin, Jionghua
    [J]. IIE TRANSACTIONS, 2010, 42 (08) : 564 - 576
  • [9] A LAGRANGIAN HEURISTIC FOR SET-COVERING PROBLEMS
    BEASLEY, JE
    [J]. NAVAL RESEARCH LOGISTICS, 1990, 37 (01) : 151 - 164
  • [10] AN APPROACH TO THE SOLUTION OF THE SET-COVERING PROBLEM
    ROSHCHIN, VA
    SERGIENKO, IV
    [J]. CYBERNETICS, 1984, 20 (06): : 849 - 855