A Divergence-Oriented Approach for Web Users Clustering

TitleA Divergence-Oriented Approach for Web Users Clustering
Publication TypeConference Paper
Year of Publication2006
AuthorsPetridou, Sophia G., Vassiliki A. Koutsonikola, Athena Vakali, and Georgios I. Papadimitriou
EditorGavrilova, Marina L., Osvaldo Gervasi, Vipin Kumar, Chih Jeng Kenne Tan, David Taniar, Antonio LaganĂ, Youngsong Mun, and Hyunseung Choo
Book TitleICCSA (2)
ISBN Number3-540-34072-6

Clustering web users based on their access patterns is a quite significanttask in Web Usage Mining. Further to clustering it is important to evaluatethe resulted clusters in order to choose the best clustering for a particular framework.This paper examines the usage of Kullback-Leibler divergence, aninformation theoretic distance, in conjuction with the k-means clusteringalgorithm. It compares KL-divergence with other well known distance measures(Euclidean, Standardized Euclidean and Manhattan) and evaluates clusteringresults using both objective function’s value and Davies-Bouldin index.Since it is imperative to assess whether the results of a clustering process aresusceptible to noise, especially in noisy environments such as Web environment,our approach takes the impact of noise into account. The clusters obtainedwith KL approach seem to be superior to those obtained with the otherdistance measures in case our data have been corrupted by noise.

auth logo

Location & Contact

Department of Informatics
Aristotle University of Thessaloniki
Thessaloniki GR-54124

t  | (+30) 2310 998415
e | oswinds@csd.auth.gr