Title | A Divergence-Oriented Approach for Web Users Clustering |
Publication Type | Conference Paper |
Year of Publication | 2006 |
Authors | Petridou, Sophia G., Vassiliki A. Koutsonikola, Athena Vakali, and Georgios I. Papadimitriou |
Editor | Gavrilova, Marina L., Osvaldo Gervasi, Vipin Kumar, Chih Jeng Kenne Tan, David Taniar, Antonio LaganĂ, Youngsong Mun, and Hyunseung Choo |
Book Title | ICCSA (2) |
Publisher | Springer |
ISBN Number | 3-540-34072-6 |
Abstract | Clustering web users based on their access patterns is a quite significanttask in Web Usage Mining. Further to clustering it is important to evaluatethe resulted clusters in order to choose the best clustering for a particular framework.This paper examines the usage of Kullback-Leibler divergence, aninformation theoretic distance, in conjuction with the k-means clusteringalgorithm. It compares KL-divergence with other well known distance measures(Euclidean, Standardized Euclidean and Manhattan) and evaluates clusteringresults using both objective function’s value and Davies-Bouldin index.Since it is imperative to assess whether the results of a clustering process aresusceptible to noise, especially in noisy environments such as Web environment,our approach takes the impact of noise into account. The clusters obtainedwith KL approach seem to be superior to those obtained with the otherdistance measures in case our data have been corrupted by noise. |