Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Text-Based Content Search and Retrieval in Ad-hoc P2P Communities

Francisco Matias Cuenca-AcunaContact Information and Thu D. NguyenContact Information

(9)  Department of Computer Science, Rutgers University, 110 Frelinghuysen Rd, Piscataway, NJ, 08854
Abstract
We consider the problem of content search and retrieval in peer-to-peer (P2P) communities. P2P computing is a potentially powerful model for information sharing between ad hoc groups of users because of its low cost of entry and natural model for resource scaling. As P2P communities grow, however, locating information distributed across the large number of peers becomes problematic. We address this problem by adapting a state-of-the-art text-based document ranking algorithm, the vector-space model instantiated with the TFxIDF ranking rule, to the P2P environment. We make three contributions: (a) we show how to approximate TFxIDF using compact summaries of individual peers’ inverted indexes rather than the inverted index of the entire communal store; (b) we develop a heuristic for adaptively determining the set of peers that should be contacted for a query; and (c) we show that our algorithm tracks TFxIDF’s performance very closely, giving P2P communities a search and retrieval algorithm as good as that possible assuming a centralized server.
This work was supported in part by NSF grants EIA-0103722 and EIA-9986046.

Contact Information Francisco Matias Cuenca-Acuna
Email: mcuenca@cs.rutgers.edu

Contact Information Thu D. Nguyen
Email: tdnguyen@cs.rutgers.edu
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.108 • Server: mpweb05
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)