Lecture Notes in Computer Science, 2003, Volume 2676/2003, 237-253, DOI: 10.1007/3-540-44888-8_18

On Minimizing Pattern Splitting in Multi-track String Matching

Kjell Lemström and Veli Mäkinen

View Related Documents

Abstract

Given a pattern string P=p 1 p 2···p m and K parallel text strings $ \mathbb{T} = \left\{ {T^k = t_1^k \cdots t_n^k |1 \leqslant k \leqslant K} \right\} $ \mathbb{T} = \left\{ {T^k = t_1^k \cdots t_n^k |1 \leqslant k \leqslant K} \right\} over an integer alphabet Σ, our task is to find the smallest integer κ > 0 such that P can be split into κ pieces P=P 1...P κ , where each P i has an occurrence in some text track $ T^{k_i } $ T^{k_i } and these partial occurrences retain the order. We study some variations of this minimum splitting problem, such as splittings with limited gaps and transposition invariance, and show how to use sparse dynamic programming to solve the variations efficiently. In particular, we show that the minimum splitting problem can be interpreted as a shortest path problem on line segments.

Fulltext Preview

Image of the first page of the fulltext document