In English

Identity Bridging Cluster Website Visits using Model-based Clustering

Àsbjörn Hagalín Pétursson ; Rúnar Kristinsson
Göteborg : Chalmers tekniska högskola, 2015. 64 s.
[Examensarbete på avancerad nivå]

Model-based clustering is becoming increasingly popular with the rise in computational power. Cluster analysis is used in many disciplines, for example biology, geography, image analysis and marketing. In this thesis we developed a model-based unsupervised clustering method to cluster website visits into clusters that represent a unique Internet user.

As no ground truth exists we developed two evaluation methods to measure the quality of the clusters, one based on cluster content and size, and the other based on user behavior.

The model-based clustering method was compared with a simple deterministic clustering model, the results were very similar. With further development of the model-based clustering we believe that it can generate better clusters of website visits that likely represent a single user.

Nyckelord: Machine learning, Clustering, Model-based clustering, Website visits, Cross-device tracking, Probabilistic model.



Publikationen registrerades 2015-06-30. Den ändrades senast 2015-06-30

CPL ID: 219110

Detta är en tjänst från Chalmers bibliotek