Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

A Markovian Approach for Web User Profiling and Clustering

Younes Hafri5, 6 Contact Information, Chabane DjerabaContact Information, Peter StanchevContact Information and Bruno BachimontContact Information

(5)  Institut National de l’Audiovisuel, 4 avenue de l’Europe, 94366 Bry-sur-Marne Cedex, France
(6)  Institut de Recherche en Informatique de Nantes, 2 rue de la Houssiniere, 43322 Nantes Cedex, France
(7)  Kettering University, Flint, MI 48504, USA
Abstract
The objective of this paper is to propose an approach that extracts automatically web user profiling based on user navigation paths. Web user profiling consists of the best representative behaviors, represented by Markov models (MM). To achieve this objective, our approach is articulated around three notions: (1) Applying probabilistic exploration using Markov models. (2) Avoiding the problem of Markov model high-dimensionality and sparsity by clustering web documents, based on their content, before applying the Markov analysis. (3) Clustering Markov models, and extraction of their gravity centers. On the basis of these three notions, the approach makes possible the prediction of future states to be visited in k steps and navigation sessions monitoring, based on both content and traversed paths. The original application of the approach concerns the exploitation of multimedia archives in the perspective of the Copyright Deposit that preserves French’s WWW documents. The approach may be the exploitation tool for any web site.

Contact Information Younes Hafri
Email: yhafri@ina.fr

Contact Information Chabane Djeraba
Email: djeraba@irin.univ-nantes.fr

Contact Information Peter Stanchev
Email: pstanchev@kettering.edu

Contact Information Bruno Bachimont
Email: bbachimont@ina.fr
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.105 • Server: mpweb07
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)