Data Mining (HWS2024)

The course provides an introduction to advanced data analysis techniques as a basis for analyzing business data and providing input for decision support systems. The course will cover the following topics:

  • The Data Mining Process
  • Data Representation and Preprocessing
  • Clustering
  • Classification
  • Regression
  • Association Analysis

The course consists of a lecture together with accompanying practical exercises as well as student team projects.  In the exercises the participants will gather initial expertise in applying state of the art data mining libraries on realistic data sets. The team projects take place in the last third of the term. Within the projects, groups of students realize more sophisticated data mining projects of personal choice and report about the results of their projects in the form of a written report as well as an oral presentation.

Exam Review

The exam review for HWS2024 will take place on Wednesday, 30 January 2025, starting from 13:00.

You have to register for the exam review by writing a mail to Ezgi Yilmaz  until Monday, 27 January 2025. 

Outline and Course Materials

WeekMonday (Lecture)Thursday (Exercise)
02.09.2024

no lecture

Introduction to Python (13:45–15:15)
09.09.2024Introduction to Data Mining (PDF, 3 MB)Intro
16.09.2024Classification 1 (PDF, 2 MB)Classification 1
23.09.2024Classification 2 (PDF, 2 MB)Classification 2
30.09.2024Introduction to the student projects (see Ilias)Public holiday
07.10.2024Regression (PDF, 2 MB) Regression
14.10.2024 Preprocessing (PDF, 2 MB)Preprocessing
21.10.2024Feedback on project outlinesProject Work
28.10.2024Clustering and Anomalies (PDF, 2 MB)Clustering
04.11.2024Association Analysis and Subgroup Discovery (PDF, 1 MB)Association Analysis
11.11.2024Project feedback sessionProject Work
18.11.2024Project feedback sessionProject Work
25.11.2024Project feedback sessionProject Work
02.12.2024Project PresentationsProject Presentations

Important dates for the student projects:

  • Sunday, October, 13th, 23:59: Submission of project outlines
  • Sunday, December 8th, 23:59: Submission of final project reports

For all students which are not familiar with Python/Jupyter Notebooks, we offer an introduction on Thursday, 5 September 2024 between 13:45 and 15:15 in exercise room D007 (2)  Building B6,27 Part D (in the backyard of  B6, 23 Part A).