Solving String Problems on Graphs Using the Labeled Direct Product

被引:4
|
作者
Rizzo, Nicola [1 ]
Tomescu, Alexandru, I [1 ]
Policriti, Alberto [2 ]
机构
[1] Univ Helsinki, Dept Comp Sci, Helsinki, Finland
[2] Univ Udine, Dept Math Comp Sci & Phys, Udine, Italy
基金
芬兰科学院; 欧洲研究理事会;
关键词
Longest repeated substring; Longest common substring; String algorithm; Graph algorithm; Motif discovery; Fine-grained complexity; SUFFIX TREE; FINITE AUTOMATA; AMBIGUITY; COMPLEXITY;
D O I
10.1007/s00453-022-00989-x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Suffix trees are an important data structure at the core of optimal solutions to many fundamental string problems, such as exact pattern matching, longest common substring, matching statistics, and longest repeated substring. Recent lines of research focused on extending some of these problems to vertex-labeled graphs, either by using efficient ad-hoc approaches which do not generalize to all input graphs, or by indexing difficult graphs and having worst-case exponential complexities. In the absence of an ubiquitous and polynomial tool like the suffix tree for labeled graphs, we introduce the labeled direct product of two graphs as a general tool for obtaining optimal algorithms in the worst case: we obtain conceptually simpler algorithms for the quadratic problems of string matching (SMLG) and longest common substring (LCSP) in labeled graphs. Our algorithms run in time linear in the size of the labeled product graph, which may be smaller than quadratic for some inputs, and their run-time is predictable, because the size of the labeled direct product graph can be precomputed efficiently. We also solve LCSP on graphs containing cycles, which was left as an open problem by Shimohira et al. in 2011. To show the power of the labeled product graph, we also apply it to solve the matching statistics (MSP) and the longest repeated string (LRSP) problems in labeled graphs. Moreover, we show that our (worst-case quadratic) algorithms are also optimal, conditioned on the Orthogonal Vectors Hypothesis. Finally, we complete the complexity picture around LRSP by studying it on undirected graphs.
引用
收藏
页码:3008 / 3033
页数:26
相关论文
共 50 条
  • [1] Solving String Problems on Graphs Using the Labeled Direct Product
    Nicola Rizzo
    Alexandru I. Tomescu
    Alberto Policriti
    Algorithmica, 2022, 84 : 3008 - 3033
  • [2] Compressed indexes for string searching in labeled graphs
    Dipartimento di Informatica, University of Pisa, Italy
    Proc. Int. Conf. World Wide Web, WWW, (322-332):
  • [3] Compressed Indexes for String Searching in Labeled Graphs
    Ferragina, Paolo
    Piccinno, Francesco
    Venturini, Rossano
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 322 - 332
  • [4] Using contracted solution graphs for solving reconfiguration problems
    Bonsma, Paul
    Paulusma, Daniel
    ACTA INFORMATICA, 2019, 56 (7-8) : 619 - 648
  • [5] Using planning graphs for solving HTN planning problems
    Lotem, A
    Nau, DS
    Hendler, JA
    SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 534 - 540
  • [6] Using contracted solution graphs for solving reconfiguration problems
    Paul Bonsma
    Daniël Paulusma
    Acta Informatica, 2019, 56 : 619 - 648
  • [7] Solving submodular text processing problems using influence graphs
    Vardasbi, Ali
    Faili, Heshaam
    Asadpour, Masoud
    SOCIAL NETWORK ANALYSIS AND MINING, 2019, 9 (01)
  • [8] Solving submodular text processing problems using influence graphs
    Ali Vardasbi
    Heshaam Faili
    Masoud Asadpour
    Social Network Analysis and Mining, 2019, 9
  • [9] Using Matrices and Graphs for Solving Design Optimization Problems.
    Kalaschnikow, Waleriy
    Lissjak, Wladimir
    Tischtschenko, Walentin
    Wissenschaftliche Zeitschrift - Technische Hochschule Ilmenau, 1980, 26 (04): : 159 - 168
  • [10] On direct product cancellation of graphs
    Hammack, Richard H.
    DISCRETE MATHEMATICS, 2009, 309 (08) : 2538 - 2543