Summer Semester 2009/10
Winter Semester 2011/12
Winter Semester 2012/13
Winter Semester 2013/14
Clustering algorithms in Web systems and e-business ITI1102d
Course content:
1.Systems of analysis of www data used in existing applications
2.Recommender systems. Collaborative filtering methods.
3.Clustering methods in recommender systems.
4.Partitioning and hierarchical clustering methods.
5.Clustering techniques: density-based, grid-based and model-based.
6.Data preprocessing. Similarity measures. Evaluation of results.
7.Web usage mining.
8.Adaptive WWW servers.
9.Clustering of html documents.
10.Search results recommendations.
11.Clustering of search results.
12.Data mining in analysis financial data.
13.User's reputation systems used in auctions services.
14.Social networks
15.Non-contextual and contextual rank of www pages
Learning outcomes:
Knowledge about new clustering techniques and collaborative filtering methods and their application in Web systems. Acquirement of ability to process and analysis WWW serwers log files.
(in Polish) Rodzaj przedmiotu
Course coordinators
Bibliography
a) basic references:
1) Z. Markov, D. T. Larose, Data mining the Web. Uncovering patterns in Web content, structure and usage. Wiley, 2007
2) D.T. Larose: Discovering Knowledge in Data, Wiley, 2004
3) Wikipedia: http://en.wikipedia.org
4) Google Research: http://research.google.com
5) G. Linden's Blog: http://glinden.blogspot.com/
b) supplementary references:
1) D.R., Greening, Data Mining on the Web, Web Techniques, 2000
2) M. Kłopotek, Intelligent search engines, Akademicka Oficyna Wydawnicza, Warszawa, 2001 (in Polish)