Harnessing Pre-Trained Neural Networks with Rules for Formality Style Transfer

被引:0
|
作者
Wang, Yunli [1 ]
Wu, Yu [2 ]
Mou, Lili [3 ]
Li, Zhoujun [1 ]
Chao, Wenhan [1 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing, Peoples R China
[2] Microsoft Res, Beijing, Peoples R China
[3] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Formality text style transfer plays an important role in various NLP applications, such as non-native speaker assistants and child education. Early studies normalize informal sentences with rules, before statistical and neural models become a prevailing method in the field. While a rule-based system is still a common preprocessing step for formality style transfer in the neural era, it could introduce noise if we use the rules in a naive way such as data preprocessing. To mitigate this problem, we study how to harness rules into a state-of-the-art neural network that is typically pretrained on massive corpora. We propose three fine-tuning methods in this paper and achieve a new state-of-the-art on benchmark datasets.
引用
收藏
页码:3573 / 3578
页数:6
相关论文
共 50 条
  • [31] Transfer learning with pre-trained deep convolutional neural networks for the automatic assessment of liver steatosis in ultrasound images
    Constantinescu, Elena Codruta
    Udristoiu, Anca-Loredana
    Udristoiu, Stefan Cristinel
    Iacob, Andreea Valentina
    Gruionu, Lucian Gheorghe
    Gruionu, Gabriel
    Sandulescu, Larisa
    Saftoiu, Adrian
    MEDICAL ULTRASONOGRAPHY, 2021, 23 (02) : 135 - 139
  • [32] DyArtbank: Diverse artistic style transfer via pre-trained stable diffusion and dynamic style prompt Artbank
    Zhang, Zhanjie
    Zhang, Quanwei
    Li, Guangyuan
    Luan, Junsheng
    Yang, Mengyuan
    Wang, Yun
    Zhao, Lei
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [33] Recognizing Malaysia Traffic Signs with Pre-Trained Deep Convolutional Neural Networks
    How, Dickson Neoh Tze
    Sahari, Khairul Salleh Mohamed
    Hou, Yew Cheong
    Basubeit, Omar Gumaan Saleh
    2019 4TH INTERNATIONAL CONFERENCE ON CONTROL, ROBOTICS AND CYBERNETICS (CRC 2019), 2019, : 109 - 113
  • [34] Towards data-free gating of heterogeneous pre-trained neural networks
    Kang, Chen Wen
    Hong, Chua Meng
    Maul, Tomas
    APPLIED INTELLIGENCE, 2021, 51 (11) : 8045 - 8056
  • [35] Towards data-free gating of heterogeneous pre-trained neural networks
    Chen Wen Kang
    Chua Meng Hong
    Tomas Maul
    Applied Intelligence, 2021, 51 : 8045 - 8056
  • [36] Age Estimation Based on Face Images and Pre-trained Convolutional Neural Networks
    Anand, Abhinav
    Labati, Ruggero Donida
    Genovese, Angelo
    Munoz, Enrique
    Piuri, Vincenzo
    Scotti, Fabio
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 3357 - 3363
  • [37] Efficient Aspect Object Models Using Pre-trained Convolutional Neural Networks
    Wilkinson, Eric
    Takahashi, Takeshi
    2015 IEEE-RAS 15TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2015, : 284 - 289
  • [38] Recognizing breast tumors based on mammograms combined with pre-trained neural networks
    Bai, Yujie
    Li, Min
    Ma, Xiaojian
    Gan, Xiaojing
    Chen, Cheng
    Chen, Chen
    Lv, Xiaoyi
    Li, Hongtao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (18) : 27989 - 28008
  • [39] Leveraging Small Software Engineering Data Sets with Pre-trained Neural Networks
    Robbes, Romain
    Janes, Andrea
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: NEW IDEAS AND EMERGING RESULTS (ICSE-NIER 2019), 2019, : 29 - 32
  • [40] The Impact of Padding on Image Classification by Using Pre-trained Convolutional Neural Networks
    Tang, Hongxiang
    Ortis, Alessandro
    Battiato, Sebastiano
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT II, 2019, 11752 : 337 - 344