Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
|
 |
Adjacency Matrix Based Full-Text Indexing Models
| |
|
Adjacency Matrix Based Full-Text Indexing Models
Shuigeng Zhou7 , Jihong Guan8 , Yunfa Hu9 , Jiangtao Hu9 and Aoying Zhou9 
| (7) |
State Key Lab of Software Engineering, Wuhan University, Wuhan, 430072 |
| (8) |
School of Computer Science, Wuhan University, Wuhan, 430072 |
| (9) |
Computer Science Department, Fudan University, Shanghai, 200433 |
Abstract
This paper proposes two new character-based full-text indexing models, i.e., adjacency matrix based inverted file and adjacency matrix based PAT array. Formally, the former is a kind of reorganization
of the traditional inverted file, and the latter is a kind of decomposition of the traditional PAT array. Both organize text-indexing
information in the form of adjacency matrix. Query algorithms for the new models are developed and performance comparisons
between the new models and the traditional models are carried out. The new models can improve query-processing efficiency
considerably at the cost of much less amount of extra storage overhead compared to the size of original text database, so
are suitable for applications of large-scale text databases, especially Chinese text databases.
This work was supported by China Postdoctoral Science Foundation and National 863 Hi-Tech Foundation (No. 863-306-ZT04-02-2).
Fulltext Preview (Small, Large)
 References secured to subscribers.
|
|
|
|
|
|