Description

This course explores the basic concepts of web data mining, including Ranking, Clustering, Similarity and Classification, and basic methods togather contents of web data, structures of web data and behaviors of web users from web based systems. Also, this course explores the analysis methods for the gathered web data. To improve the criticism of the recent research related to web data mining, extensive paper reading assignments and team projects are conducted.

Instructor

Kyungbaek Kim
Office : Engineering Building #6, 715
Tel : +82-62-530-3438
Email : kyungbaekkim@chonnam.ac.kr
Office Hours : Tuesday 10:00am ~ 11:00am

Time and Location

Thur 09:00am-12:00pm, Engineering Building #6, 102

Main Text

DATA MINING THE WEB, Uncovering Patterns in Web Content, Structure, and Usage, by ZDRAVKO MARKOV and DANIEL T. LAROSE

Reference Texts

Grading Policy

Lecture Notes

Lecture notes are accessible through the eClass of JNU portal.

Homeworks, Quiz, Midterm/Final Exam

All of the materials related to homeworks, quiz, midterm exam and final exam, including solutions, are accessible through the eClass of JNU portal.

Presentation Schedule and Reading Assignment

Submit the summary of papers before the date of presentation. Here is a template of summary. Only txt file format is allowed for the summary.
2013.March.20 Sungmin Hwang [Recommender Systems and OSN]_[From Amateurs to Connoisseurs-Modeling the Evolution of User Expertise through Online Reviews]
Rajashree Sokasane [Geoanalysis]_[Uncovering Locally Characterizing Regions within Geotagged Data]
2013.March.27 Lingling Zhang [geoanalysis]_[Using Stranger as Sensors temporal and geosensitive question answering via social media]
Hiep Tuan Nguyen Tri [Recommender Systems and OSN]_[How to Grow More Pairs_ Suggesting Review Targets For Comparison-Friendly Review Ecosystems]
2013.April.03 Nam Hoai Nguyen [Recommender Systems and OSN]_[TopRec Domain-Specific Recommendation through Community Topic Mining in Social Network]
Rischan Mafrur [User Behavior modeling]_[Anatomy of a Web-Scale Resale Market A Data Mining Approach]
2013.April.10 Muhammad Fiqri Muthohar [Web Security - Attacks and Defenses]_[The Role of Web Hosting Providers in Detecting Compromised Websites]
Nhat Quang Vo [Social Web UI]_[Perception and Understanding of Social Annotations in Web Search]
2013.April.17 Ngoc Do Luu [OSN Analysis and Characterization]_[Google+ or Google- Dissecting the Evolution of the New OSN in its First Year]
2013.May.08 Sungmin Hwang [Trust and Enterprise Social Networks]_[Mining Expertise and Interests from Social Media]
Rajashree Sokasane [Web-Mining iii]_[AMIE- Association Rule Mining under Incomplete Evidence]
2013.May.15 Lingling Zhang [log analysis]_ [from cookies to cooks insights on dietary patterns via analysis of web usage logs]
Hiep Tuan Nguyen Tri [Recommender Systems I]_[Is It Time For a Career Switch]
2013.May.22 Nam Hoai Nguyen [Content Analysis]_[No country for old members User lifecycle and linguistic change in online communities]
Rischan Mafrur [User Behavior modeling]_[Timespent Based Models for Predicting User Retention]
2013.May.29 Muhammad Fiqri Muthohar [Understanding and Combating Abuse]_[Two Years of Short URLs Internet Measurement_ Security Threats and Countermeasures]
Nhat Quang Vo [Social Web UI]_[Google+ Ripples A Native Visualization of Information Flow]
2013.June.05 Ngoc Do Luu [Web Mining]_[Mining Collective Intelligence in Groups]

Team Project

Team Members Subject of Team Project Slide Report
Sungmin Hwang, Lingling Zhang, Rajashree Sokasane Survey of Reccomendation System : Collaborative Filtering slides report
Hiep Tuan Nguyen Tri, Nam Hoai Nguyen Survey on Web Structure Mining slides report
Rischan Mafrur, Muhammad Fiqri Muthohar Who are Tweeting in the 2014 Indonesia's Legislative Election? slides report
Ngoc Do Luu, Nhat Quang Vo Review of the web page classification approaches and applications slides report