共 50 条
- [1] Stochastic Tokenization with a Language Model for Neural Text Classification [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1620 - 1629
- [2] Language-Independent Text Tokenization Using Unsupervised Deep Learning [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (01): : 321 - 334
- [3] Character N-Gram Tokenization for European Language Text Retrieval [J]. Information Retrieval, 2004, 7 : 73 - 97
- [4] Character N-gram tokenization for European language text retrieval [J]. INFORMATION RETRIEVAL, 2004, 7 (1-2): : 73 - 97
- [5] Tokenization-based data augmentation for text classification [J]. 2022 19TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE 2022), 2022,
- [6] Text-based Language Identification for Some of the Under-resourced Languages of South Africa [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND ENGINEERING (ICACCE 2016), 2016, : 303 - 307
- [8] Morpheme based language models for speech recognition of Czech [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 211 - 216
- [9] Sentence Tokenization Using Statistical Unsupervised Machine Learning and Rule-Based Approach for Running Text in Gujarati Language [J]. EMERGING TRENDS IN EXPERT APPLICATIONS AND SECURITY, 2019, 841 : 319 - 326
- [10] Morpheme-based language modeling for Arabic LVCSR [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1053 - 1056