VARIABLE-LENGTH SEQUENCE MODELING - MULTIGRAMS

被引:6
|
作者
BIMBOT, F
PIERACCINI, R
LEVIN, E
ATAL, B
机构
[1] ENST, Dept. Signal, CNRS, Paris
[2] Speech Research Department, AT&T Bell Laboratories, Murray Hill
关键词
D O I
10.1109/97.388911
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The conventional n-gram language model exploits dependencies between words and their fixed-length past, This letter presents a model that represents sentences as a concatenation of variable-length sequences of units and describes an algorithm for unsupervised estimation of the model parameters. The approach is illustrated for the segmentation of sequences of letters into subword-like units, It is evaluated as a language model on a corpus of transcribed spoken sentences. Multigrams can provide a significantly lower test set perplexity than n-gram models.
引用
收藏
页码:111 / 113
页数:3
相关论文
共 50 条
  • [1] Variable-length sequence modeling: multigrams
    CNRS, Paris, France
    IEEE Signal Process Lett, 6 (111-113):
  • [2] Inference of variable-length linguistic and acoustic units by multigrams
    Deligne, S
    Bimbot, F
    SPEECH COMMUNICATION, 1997, 23 (03) : 223 - 241
  • [3] Variable-Length Constrained Sequence Codes
    Steadman, Andrew
    Fair, Ivan
    IEEE COMMUNICATIONS LETTERS, 2013, 17 (01) : 139 - 142
  • [4] Synchronization of Variable-Length Constrained Sequence Codes
    Cao, Congzhe
    Fair, Ivan
    IEEE ACCESS, 2021, 9 : 45864 - 45878
  • [5] Variable-length sequence model for attribute detection in the image
    Li, Xin
    Gu, Jiaming
    Lu, Xiaoyuan
    Ning, Yan
    Zhang, Liang
    Shen, Peiyi
    Gu, Chaochen
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2023, 23 (04) : 1913 - 1927
  • [6] VARIABLE-LENGTH TO VARIABLE-LENGTH ENCODERS ARE ASYMPTOTICALLY MEAN STATIONARY.
    Kieffer, John C.
    Dunham, James G.
    Proceedings - Annual Allerton Conference on Communication, Control, and Computing, 1980, : 438 - 439
  • [7] On the Modeling and Simulation of Variable-Length Pendulum Systems: A Review
    Yakubu, Godiya
    Olejnik, Pawel
    Awrejcewicz, Jan
    ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2022, 29 (04) : 2397 - 2415
  • [8] Statistical language modeling based on variable-length sequences
    Zitouni, I
    Smaïli, K
    Haton, JP
    COMPUTER SPEECH AND LANGUAGE, 2003, 17 (01): : 27 - 41
  • [9] Synapsing variable-length crossover: Meaningful crossover for variable-length genomes
    Hutt, Benjamin
    Warwick, Kevin
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2007, 11 (01) : 118 - 131
  • [10] On the Modeling and Simulation of Variable-Length Pendulum Systems: A Review
    Godiya Yakubu
    Paweł Olejnik
    Jan Awrejcewicz
    Archives of Computational Methods in Engineering, 2022, 29 : 2397 - 2415