Fragmentary pattern matching: Complexity, algorithms and applications for analyzing classic literary works

Hideaki Hori, Shinichi Shimozono, Masayuki Takeda, Ayumi Shinohara

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

A fragmentary pattern is a multiset of non-empty strings, and it matches a string w if all the strings in it occur within w without any overlaps. We study some fundamental issues on computational complexity related to the matching of fragmentary patterns. We show that the fragmentary pattern matching problem is NP-complete, and the problem to find a fragmentary pattern common to two strings that maximizes the pattern score is NP-hard. Moreover, we propose a polynomialtime approximation algorithm for the fragmentary pattern matching, and show that it achieves a constant worst-case approximation ratio if either the strings in a pattern have the same length, or the importance weights of strings in a pattern are proportional to their lengths.

Original languageEnglish
Title of host publicationAlgorithms and Computation - 12th International Symposium, ISAAC 2001, Proceedings
Pages719-730
Number of pages12
DOIs
Publication statusPublished - 2001
Event12th International Symposium on Algorithms and Computation, ISAAC 2001 - Christchurch, New Zealand
Duration: 2001 Dec 192001 Dec 21

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2223 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference12th International Symposium on Algorithms and Computation, ISAAC 2001
Country/TerritoryNew Zealand
CityChristchurch
Period01/12/1901/12/21

Keywords

  • Fragmentary pattern
  • NP-completeness
  • Polynomial-time approximation
  • String matching
  • String resemblance

Fingerprint

Dive into the research topics of 'Fragmentary pattern matching: Complexity, algorithms and applications for analyzing classic literary works'. Together they form a unique fingerprint.

Cite this