Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Parallel Information Retrieval on an SCI-Based PC-NOW

Sang-Hwa ChungContact Information, Hyuk-Chul KwonContact Information, Kwang Ryel RyuContact Information, Han-Kook JangContact Information, Jin-Hyuk KimContact Information and Cham-Ah ChoiContact Information

(5)  Division of Computer Science and Engineering, Pusan National University, Pusan, 609-735, Korea
Abstract
This paper presents an efficient parallel information retrieval (IR) system which provides fast information service for the Internet users on low-cost high-performance PC-NOW environment. The IR system is implemented on a PC cluster based on the Scalable Coherent Interface (SCI), a powerful interconnecting mechanism for both shared memory models and message passing models. In the IR system, the inverted-index file (IIF) is partitioned into pieces using a greedy declustering algorithm and distributed to the cluster nodes to be stored on each node’s hard disk. For each incoming user’s query with multiple terms, terms are sent to the corresponding nodes which contain the relevant pieces of the IIF to be evaluated in parallel. According to the experiments, the IR system outperforms an MPI-based IR system using Fast Ethernet as an interconnect. Speed- up of up to 4.0 was obtained with an 8-node cluster in processing each query on a 500,000-document IIF.

Contact Information Sang-Hwa Chung
Email: shchung@hyowon.pusan.ac.kr

Contact Information Hyuk-Chul Kwon
Email: hckwon@hyowon.pusan.ac.kr

Contact Information Kwang Ryel Ryu
Email: krryu@hyowon.pusan.ac.kr

Contact Information Han-Kook Jang
Email: hkjang@hyowon.pusan.ac.kr

Contact Information Jin-Hyuk Kim
Email: variant@hyowon.pusan.ac.kr

Contact Information Cham-Ah Choi
Email: cca@hyowon.pusan.ac.kr
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.109 • Server: mpweb20
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)