Segment Match Refinement and Applications
Aaron L. Halpern6
, Daniel H. Huson6
and Knut Reinert6 
| (6) |
Informatics Research, Celera Genomics Corp., 45 W Gude Drive, 20850 Rockville, MD, USA |
Abstract
Comparison of large, unfinished genomic sequences requires fast methods that are robust to misordering, misorientation, and
duplications. A number of fast methods exist that can compute local similarities between such sequences, from which an optimal
one-to-one correspondence might be desired. However, existing methods for computing such a correspondence are either too costly
to run or are inappropriate for unfinished sequence. We propose an efficient method for refining a set of segment matches
such that the resulting segments are of maximal size without non-identity overlaps. This resolved set of segments can be used
in various ways to compute a similarity measure between any two large sequences, and hence can be used in alignment, matching,
or tree construction algorithms for two or more sequences.
New address: WSI-AB, Tübingen University, Sand 14, 72076 Tübingen, Germany.
References secured to subscribers.