Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

WebVigiL: User Profile-Based Change Detection for HTML/XML Documents

N. PandrangiContact Information, J. JacobContact Information, A. SankaContact Information and S. ChakravarthyContact Information

(6)  Information Technology Laboratory and Computer Science and Engineering Department, The University of Texas at Arlington, Arlington, TX, 76019
Abstract
With the exponential increase of information on the web, the emphasis has shifted from mere viewing of information to efficient retrieval and notification of selective information. Currently, users have to poll the pages manually to check for changes of interest, resulting in waste of resources and associated high cost. Hence, an efficient and effective change detection and notification mechanism is needed. WebVigiL, a general-purpose, active capability-based information monitoring and notification system, handles specification, management, and propagation of customized changes as requested by a user. The emphasis of change detection in WebVigiL is to detect customized changes on the document, based on user intent. In this paper, we propose two different algorithms to handle change detection to contents of semi-structured and unstructured documents. Though the approach taken is general, we will explain the change detection in the context of HTML (unstructured) and XML (semistructured) documents. We also provide a simple change presentation scheme to display the changes computed. We highlight the change detection in the context of WebVigiL and briefly describe the rest of the system.
This work was supported, in part, by the Office of Naval Research & the SPAWAR System Center-San Diego & by the Rome Laboratory grant F30602-01-2-05430, and by NSF grants IIS-0123730 and ITR 0121297.

Contact Information N. Pandrangi
Email: pandrang@cse.uta.edu

Contact Information J. Jacob
Email: jacob@cse.uta.edu

Contact Information A. Sanka
Email: asanka@cse.uta.edu

Contact Information S. Chakravarthy
Email: sharma@cse.uta.edu
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.105 • Server: mpweb21
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)