Automatic generation of regular expressions for the Regex Golf challenge using a local search algorithm

被引:0
|
作者
André de Almeida Farzat
Márcio de Oliveira Barros
机构
[1] Federal University of the State of Rio de Janeiro,
关键词
Regular expressions; Regex Golf; Local search; Heuristic search;
D O I
暂无
中图分类号
学科分类号
摘要
Regular expression is a technology widely used in software development for extracting textual data, validating the structure of textual documents, or formatting data. Regex Golf is a challenge that consists in finding the smallest possible regular expression given a set of sentences to perform matches and another set not to match. An algorithm capable of meeting the Regex Golf requirements is a relevant contribution to the area of semi-structured document data extraction. In this paper, we propose a heuristic search algorithm based on local search, combined with a regular expression shrinker, to find valid results for Regex Golf problems. An experimental study was conducted to compare the proposed technique with an exact algorithm and a genetic programming algorithm designed for the Regex Golf challenge. The proposed local search was shown to outperform both competing algorithms in six out of fifteen problem instances, tying in another three instances. On the other hand, all algorithms still lack the ability to outperform human software developers in designing regular expressions for the challenge.
引用
收藏
页码:105 / 131
页数:26
相关论文
共 50 条
  • [41] Automatic Unsupervised Feature Selection using Gravitational Search Algorithm
    Kumar, Vijay
    Chhabra, Jitender Kumar
    Kumar, Dinesh
    IETE JOURNAL OF RESEARCH, 2015, 61 (01) : 22 - 31
  • [42] Research on the Automatic Generation Algorithm of Model Local Features Based on LOD Technology
    Li, Min
    Sun, Dapeng
    2014 2ND INTERNATIONAL CONFERENCE IN HUMANITIES, SOCIAL SCIENCES AND GLOBAL BUSINESS MANAGEMENT (ISSGBM 2014), VOL 29, 2014, 29 : 218 - 222
  • [43] Automatic boiling water reactor control rod pattern design using particle swarm optimization algorithm and local search
    Wang, Cheng-Der
    Lin, Chaung
    NUCLEAR ENGINEERING AND DESIGN, 2013, 255 : 273 - 279
  • [44] TCSC based automatic generation control of deregulated power system using quasi-oppositional harmony search algorithm
    Nandi, Mahendra
    Shiva, C. K.
    Mukherjee, V.
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2017, 20 (04): : 1380 - 1395
  • [45] A Novel Approach for Automatic Generation Control of Multi Area Power Systems with Nonlinearity Using General Relativity Search Algorithm
    Kumar, Amit
    Srungavarapu, Gopalakrishna
    Beiranvand, Hamzeh
    Rokrok, Esmaeel
    2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [46] Automatic Generation Control of the Two Area Non-reheat Thermal Power System using Gravitational Search Algorithm
    Duman, Serhat
    Yorukeren, Nuran
    PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (10A): : 254 - 259
  • [47] ASCII Art Generation using the Local Exhaustive Search on the GPU
    Takeuchi, Yuji
    Takafuji, Daisuke
    Ito, Yasuaki
    Nakano, Koji
    2013 FIRST INTERNATIONAL SYMPOSIUM ON COMPUTING AND NETWORKING (CANDAR), 2013, : 194 - 199
  • [48] Solution Approach to Automatic Generation Control Problem Using Hybridized Gravitational Search Algorithm Optimized PID and FOPID Controllers
    Dahiya, Preeti
    Sharma, Veena
    Naresh, Ram
    ADVANCES IN ELECTRICAL AND COMPUTER ENGINEERING, 2015, 15 (02) : 23 - 34
  • [49] Development of Automatic Generation of Maintenance Worker Schedule Using Tabu Search
    Kokubo, Tatsuya
    Kato, Satoshi
    Nakahigashi, Taichi
    Quarterly Report of RTRI (Railway Technical Research Institute), 2024, 65 (03) : 170 - 175
  • [50] Impact of PEVs on Automatic Generation Control Using TLBO Algorithm
    Dixit, Mukul
    Roy, Ranjit
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,