Given a pattern string
P=
p
1
p
2···
p
m
and
K parallel text strings
$
\mathbb{T} = \left\{ {T^k = t_1^k \cdots t_n^k |1 \leqslant k \leqslant K} \right\}
$
\mathbb{T} = \left\{ {T^k = t_1^k \cdots t_n^k |1 \leqslant k \leqslant K} \right\}
over an integer alphabet
Σ, our task is to find the smallest integer
κ > 0 such that
P can be split into
κ pieces
P=
P
1...
P
κ
, where each
P
i has an occurrence in some text track
$
T^{k_i }
$
T^{k_i }
and these partial occurrences retain the order. We study some variations of this minimum splitting problem, such as splittings
with limited gaps and transposition invariance, and show how to use sparse dynamic programming to solve the variations efficiently.
In particular, we show that the minimum splitting problem can be interpreted as a shortest path problem on line segments.