Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
|
 |
Prefetching in Content Distribution Networks via Web Communities Identification and Outsourcing
| |
|
Prefetching in Content Distribution Networks via Web Communities Identification and Outsourcing
Antonis Sidiropoulos1 , George Pallis1 , Dimitrios Katsaros1, 2 , Konstantinos Stamos1 , Athena Vakali1 and Yannis Manolopoulos1 
| (1) |
Informatics Department, Aristotle University, Thessaloniki, 54124, Greece |
| (2) |
Computer & Communication Engineering Department, University of Thessaly, Gklavani 37, Volos, 38221, Greece |
Received: 30 November 2005 Revised: 5 March 2007 Accepted: 19 March 2007 Published online: 15 August 2007
Abstract Content distribution networks (CDNs) improve scalability and reliability, by replicating content to the “edge” of the Internet.
Apart from the pure networking issues of the CDNs relevant to the establishment of the infrastructure, some very crucial data
management issues must be resolved to exploit the full potential of CDNs to reduce the “last mile” latencies. A very important
issue is the selection of the content to be prefetched to the CDN servers. All the approaches developed so far, assume the
existence of adequate content popularity statistics to drive the prefetch decisions. Such information though, is not always
available, or it is extremely volatile, turning such methods problematic. To address this issue, we develop self-adaptive
techniques to select the outsourced content in a CDN infrastructure, which requires no apriori knowledge of request statistics.
We identify clusters of “correlated” Web pages in a site, called Web site communities, and make these communities the basic outsourcing unit. Through a detailed simulation environment, using both real and synthetic
data, we show that the proposed techniques are very robust and effective in reducing the user-perceived latency, performing
very close to an unfeasible, off-line policy, which has full knowledge of the content popularity.
Keywords data dissemination techniques on the web - content distribution networks - web communities - web prefetching - internet and web-based - web data mining
Fulltext Preview (Small, Large)
 References secured to subscribers.
|
|
|
|
|
|