View Related Documents

Abstract

In order to evaluate the security rank of document owner (sender or Web station), the documents discrimination is widely adopted to classify Web documents. The old strategy based on keywords matching often leads to low precision. This article proposes a new model called CKPU (Classifying by key Phrase Understanding) including the key sentence template, mining threshold vector, objective and, subjective discriminating. The experiment result shows that the algorithms are efficient for discriminating documents.

Keywords  File discrimination - Key sentence template - Phrase-understanding

Supported by the of National Science Foundation of China grant #60073046.

Fulltext Preview

Image of the first page of the fulltext document