Building on the Data Mining fundamentals course, this course deepens the theory and practice of advanced data mining topics, such as:
The course consists of a lecture together with accompanying practical exercises as well as student team projects. In the exercises the participants will gather initial expertise in applying state of the art data mining tools on realistic data sets.
Like in the previous years, students enrolled in the course will participate in a larger data mining competition (details to be announced). In addition to the submission of an entry to the competition, the approaches and results of the project have to be compiled into a written project report, and presented in a plenary session.
Lecture:
We'll have two alternatives for the exercise:
Both exercises are equivalent, you are supposed to attend one.
Lecture slides will be made available here as the course progresses.
Week | Exercise (Monday) | Lecture (Tuesday) |
12.2. | -- | Introduction & Data Preprocessing |
19.2. | Introduction & Data Preprocessing | -- |
26.2. | -- | Ensembles |
4.3. | Ensembles | Time Series |
11.3. | Time Series | -- |
18.3. | -- | Neural Networks & Deep Learning |
25.3. | -- | Easter Break |
1.4. | -- | Easter Break |
8.4. | Neural Networks & Deep Learning | Anomaly Detection & Challenge Kick-off |
15.4. | Anomaly Detection | Challenge Session |
22.4. | -- | Hyperparameter Tuning |
29.4. | Hyperparameter Tuning | Challenge Session |
6.5. | -- | Model Verification |
13.5. | Model Verification | Challenge Session |
20.5. | -- | Challenge Session |
Deadlines for the challenge (see here):
Pang-Ning Tan, Michael Steinbach, Vipin Kumar: Introduction to Data Mining, Pearson.
Ian H. Witten, Eibe Frank, Mark A. Hall: Data Mining: Practical Machine Learning Tools and Techniques, 3rd Edition, Morgan Kaufmann.
Bing Liu: Web Data Mining, 2nd Edition, Springer.
Further literature on specific topics will be announced in the lecture.