Automatic generation of regular expressions for the Regex Golf challenge using a local search algorithm

被引:0
|
作者
André de Almeida Farzat
Márcio de Oliveira Barros
机构
[1] Federal University of the State of Rio de Janeiro,
关键词
Regular expressions; Regex Golf; Local search; Heuristic search;
D O I
暂无
中图分类号
学科分类号
摘要
Regular expression is a technology widely used in software development for extracting textual data, validating the structure of textual documents, or formatting data. Regex Golf is a challenge that consists in finding the smallest possible regular expression given a set of sentences to perform matches and another set not to match. An algorithm capable of meeting the Regex Golf requirements is a relevant contribution to the area of semi-structured document data extraction. In this paper, we propose a heuristic search algorithm based on local search, combined with a regular expression shrinker, to find valid results for Regex Golf problems. An experimental study was conducted to compare the proposed technique with an exact algorithm and a genetic programming algorithm designed for the Regex Golf challenge. The proposed local search was shown to outperform both competing algorithms in six out of fifteen problem instances, tying in another three instances. On the other hand, all algorithms still lack the ability to outperform human software developers in designing regular expressions for the challenge.
引用
收藏
页码:105 / 131
页数:26
相关论文
共 50 条
  • [21] Automatic Generation and Optimization of Test case using Hybrid Cuckoo Search and Bee Colony Algorithm
    Lakshminarayana, P.
    SureshKumar, Dr T. V.
    JOURNAL OF INTELLIGENT SYSTEMS, 2021, 30 (01) : 59 - 72
  • [22] Automatic Generation Control of Multi-area Power System Using Gravitational Search Algorithm
    Sahu, Rabindra Kumar
    Rout, Umesh Kumar
    Panda, Sidhartha
    SWARM, EVOLUTIONARY, AND MEMETIC COMPUTING, PT I (SEMCCO 2013), 2013, 8297 : 537 - 546
  • [23] Memetic Algorithm for Intense Local Search Methods Using Local Search Chains
    Molina, Daniel
    Lozano, Manuel
    Garcia-Martinez, C.
    Herrera, Francisco
    HYBRID METAHEURISTICS, PROCEEDINGS, 2008, 5296 : 58 - +
  • [24] Constraint-Based Local Search for the Automatic Generation of Architectural Tests
    Van Hentenryck, Pascal
    Coffrin, Carleton
    Gutkovich, Boris
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING, 2009, 5732 : 787 - +
  • [25] Automatic Generation Control Using Genetic Algorithm
    Joshi, G. K.
    Mathur, Sumit
    Mathur, Sanjay
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (06): : 1 - 6
  • [26] Application of a Semantic Search Algorithm to Semi-Automatic GUI Generation
    Pazienza, Maria Teresa
    Scarpato, Noemi
    Stellato, Armando
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 3631 - 3638
  • [27] Automatic Generation of Dance and Facial Expressions Linked to Music using HMM
    Sato, Taiki
    Osana, Yuko
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3999 - 4006
  • [28] Automatic generation control using disrupted gravitational search algorithm based proportional integral derivative controller
    Preeti
    Sharma, Veena
    Naresh, R.
    Pulluri, Harish
    2015 2ND INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN ENGINEERING & COMPUTATIONAL SCIENCES (RAECS), 2015,
  • [29] Automatic Generation Control of Hydropower Systems Using a Novel Quasi-oppositional Harmony Search Algorithm
    Shiva, Chandan Kumar
    Mukherjee, Vivekananda
    ELECTRIC POWER COMPONENTS AND SYSTEMS, 2016, 44 (13) : 1478 - 1491
  • [30] Automatic generation control of power system using a novel quasi-oppositional harmony search algorithm
    Shiva, Chandan Kumar
    Shankar, G.
    Mukherjee, V.
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2015, 73 : 787 - 804