Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Prefetching in Content Distribution Networks via Web Communities Identification and Outsourcing

Antonis SidiropoulosContact Information, George PallisContact Information, Dimitrios Katsaros1, 2 Contact Information, Konstantinos StamosContact Information, Athena VakaliContact Information and Yannis ManolopoulosContact Information

(1)  Informatics Department, Aristotle University, Thessaloniki, 54124, Greece
(2)  Computer & Communication Engineering Department, University of Thessaly, Gklavani 37, Volos, 38221, Greece

Received: 30 November 2005  Revised: 5 March 2007  Accepted: 19 March 2007  Published online: 15 August 2007

Abstract  Content distribution networks (CDNs) improve scalability and reliability, by replicating content to the “edge” of the Internet. Apart from the pure networking issues of the CDNs relevant to the establishment of the infrastructure, some very crucial data management issues must be resolved to exploit the full potential of CDNs to reduce the “last mile” latencies. A very important issue is the selection of the content to be prefetched to the CDN servers. All the approaches developed so far, assume the existence of adequate content popularity statistics to drive the prefetch decisions. Such information though, is not always available, or it is extremely volatile, turning such methods problematic. To address this issue, we develop self-adaptive techniques to select the outsourced content in a CDN infrastructure, which requires no apriori knowledge of request statistics. We identify clusters of “correlated” Web pages in a site, called Web site communities, and make these communities the basic outsourcing unit. Through a detailed simulation environment, using both real and synthetic data, we show that the proposed techniques are very robust and effective in reducing the user-perceived latency, performing very close to an unfeasible, off-line policy, which has full knowledge of the content popularity.

Keywords  data dissemination techniques on the web - content distribution networks - web communities - web prefetching - internet and web-based - web data mining


Contact Information Antonis Sidiropoulos
Email: asidirop@csd.auth.gr

Contact Information George Pallis (Corresponding author)
Email: gpallis@csd.auth.gr

Contact Information Dimitrios Katsaros
Email: dkatsaro@csd.auth.gr

Contact Information Konstantinos Stamos
Email: kstamos@csd.auth.gr

Contact Information Athena Vakali
Email: avakali@csd.auth.gr

Contact Information Yannis Manolopoulos
Email: manolopo@csd.auth.gr
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this article
Export this article as RIS | Text
 
Referenced by
4 newer articles

  1. Bianchini, Devis (2009) Emergent Semantics and Cooperation in Multi-knowledge Communities: the ESTEEM Approach. World Wide Web
    [CrossRef]
  2. YANG, Bo (2009) . Journal of Software 20(1)
    [CrossRef]
  3. Wei, Fang (2009) Detecting Overlapping Community Structures in Networks. World Wide Web
    [CrossRef]
  4. Zhang, Yanchun (2009) On web communities mining and recommendation. Concurrency and Computation Practice and Experience
    [CrossRef]
Remote Address: 38.107.191.114 • Server: mpweb17
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)