Lecture Notes in Computer Science, 2005, Volume 3614/2005, 490, DOI: 10.1007/11540007_2

Cross-Document Transliterated Personal Name Coreference Resolution

Houfeng Wang

View Related Documents

Abstract

This paper presents a two-step approach to determining whether a transliterated personal name from different Chinese texts stands for the same referent. A heuristic strategy based on biographical information and “colleague” names is firstly used to form an initial set of coreference chains, and then, a clustering algorithm based Vector Space Model (VSM) is applied to merge chains under the control of a full name consistent constraint. Experimental results show that this approach achieves a good performance.
Supported by National Natural Science Foundation of China (No.60473138, 60173005)

Fulltext Preview

Image of the first page of the fulltext document