Lecture Notes in Computer Science, 2004, Volume 2945/2004, 305-308, DOI: 10.1007/978-3-540-24630-5_37

Sentence Alignment for Spanish-Basque Bitexts: Word Correspondences vs. Markup Similarity

Arantza Casillas, Idoia Fernández and Raquel Martínez

View Related Documents

Abstract

In this paper, we present an evaluation of two different sentence alignment techniques. One is the well-known SIMR algorithm based on word correspondences on both sides of a bitext. The other one is the ALINOR algorithm, which is based on the similarity of the markup on both sides of a bitext. Both algorithms are accurate in 1-1 alignment, but ALINOR works slightly better in the case of N-M alignment.

Fulltext Preview

Image of the first page of the fulltext document