In order to evaluate the security rank of document owner (sender or Web station), the documents discrimination is widely adopted
to classify Web documents. The old strategy based on keywords matching often leads to low precision. This article proposes
a new model called CKPU (Classifying by key Phrase Understanding) including the key sentence template, mining threshold vector,
objective and, subjective discriminating. The experiment result shows that the algorithms are efficient for discriminating
documents.
Keywords File discrimination - Key sentence template - Phrase-understanding
Supported by the of National Science Foundation of China grant #60073046.