Description
This course explores the basic concepts of web data mining, including Ranking, Clustering, Similarity and Classification, and basic methods togather contents of web data, structures of web data and behaviors of web users from web based systems. Also, this course explores the analysis methods for the gathered web data. To improve the criticism of the recent research related to web data mining, extensive paper reading assignments and team projects are conducted.
Instructor
Kyungbaek Kim
Office : Engineering Building #6, 715
Tel : +82-62-530-3438
Email : kyungbaekkim@chonnam.ac.kr
Office Hours : Tuesday 10:00am ~ 11:00am
Time and Location
Thur 09:00am-12:00pm, Engineering Building #6, 102
Main Text
DATA MINING THE WEB, Uncovering Patterns in Web Content, Structure, and Usage, by ZDRAVKO MARKOV and DANIEL T. LAROSE
Reference Texts
- Mining the Social Web, by Matthew A. Russell
Grading Policy
- Attendance : 10%
- Reading Assignments : 20%
- Tentatively Two papers per week : around 26 papers.
- Paper Presentations : 20% - Tentatively two papers per person.
- Team Project : 30%
- Final Exam : 20%
Lecture Notes
- 0.Syllabus
- 1.Introduction
- 2.IR and Web Search
Lecture notes are accessible through the eClass of JNU portal.
Homeworks, Quiz, Midterm/Final Exam
All of the materials related to homeworks, quiz, midterm exam and final exam, including solutions, are accessible through the eClass of JNU portal.
Presentation Schedule and Reading Assignment
Submit the summary of papers before the date of presentation. Here is a template of summary. Only txt file format is allowed for the summary.Team Project
Team Members | Subject of Team Project | Slide | Report |
Sungmin Hwang, Lingling Zhang, Rajashree Sokasane | Survey of Reccomendation System : Collaborative Filtering | slides | report |
Hiep Tuan Nguyen Tri, Nam Hoai Nguyen | Survey on Web Structure Mining | slides | report |
Rischan Mafrur, Muhammad Fiqri Muthohar | Who are Tweeting in the 2014 Indonesia's Legislative Election? | slides | report |
Ngoc Do Luu, Nhat Quang Vo | Review of the web page classification approaches and applications | slides | report |