Prof. Dr. Han van der Aa
Prof. Dr. Christian Bizer
Prof. Dr. Rainer Gemulla
Prof. Dr. Goran Glavaš
Prof. Dr. Simone Paolo Ponzetto
Prof. Dr. Heiko Paulheim
Prof. Dr. Heiner Stuckenschmidt
Postdoctoral Research Fellows
Dr. Tobias Weller
Dr. Ines Rehbein
Dr. Ioana Hulpus
Dr. Melisachew Wudage Chekol
Dr. Christian Meilicke
Dr. Federico Nanni
Fabian David Schmidt
Affiliated PhD Students
Dr. Anne Lauscher
Prof. Dr.-Ing. Margret Keuper
Dr. Taha Alhersh
Dr. Jakob Huber
Dr. Timo Sztyler
Dr. Dmitry Ustalov
Dr. Oliver Lehmberg
Dr. Yaser Oulabi
Web-based Systems (Prof. Bizer)
Data Analytics (Prof. Gemulla)
Web Data Mining (Prof. Paulheim)
Natural Language Processing and Information Retrieval (Prof. Ponzetto)
Artificial Intelligence (Prof. Stuckenschmidt)
Master Thesis Topics in Artificial Intelligence
Process Analytics (Prof. Van der Aa)
MMDS Industry Partner Network
Courses for Master Candidates
IE 500 Data Mining
IE 560 Decision Support
IE 650 Semantic Web Technologies
IE 661 Text Analytics
IE 663 Information Retrieval and Web Search
IE 670 Web Data Integration
IE 671 Web Mining
IE 672 Data Mining 2
IE 674 Hot Topics in Machine Learning
IE 675b Machine Learning
IE 676 Network Analysis
IE 689 Relational Learning
CS 460 Database Technology
CS 560 Large-Scale Data Management
CS 704 Social Simulation Seminar
CS 707 Data and Web Science Seminar
CS 709 Text Analytics Seminar
CS 710 Seminar Research on Wikipedia
CS 715: Large-Scale Data Integration Seminar
CS 718 AI and Data Science in Fiction and Society
CS 719 Process Analysis Seminar (FSS 2021)
IE 677 Advanced Process Mining
Courses for Bachelor Candidates
Praktische Informatik II
Wirtschaftsinformatik für WiPäds
Wirtschaftsinformatik für BaKuWis
Courses for PhD Candidates
Computational Text Analysis
Uni Mannheim Process Mining Meet-ups
University of Mannheim
Data and Web Science Group
DWS Area: Web-based Systems
44.2 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 11.9 million websites published
The DWS group is happy to announce the new release of the WebDataCommons Microdata, JSON-LD, RDFa and Microformat data corpus. The data has been extracted from the November 2019 version of the Common Crawl covering 2.4 billion HTML pages which originate from 32 million websites (pay-level domains).
Christian Bizer gives keynote at JIST 2019
Professor Bizer was invited to give the keynote speech at the 9th Joint International Semantic Technology Conference (JIST2019) in Hangzhou, China.
Christian Bizer wins SWSA Ten-Year Award
We are happy to announce that Professor Christian Bizer has received the SWSA Ten-Year Award at the 18th International Semantic Web Conference (ISWC2019) in Aukland, New Zealand.
WDC Product Data Corpus and Gold Standard for Large-Scale Product Matching Version 2.0 released
We are happy to announce the release of Version 2.0 of the Web Data Commons Product Data Corpus and Gold Standard for Large-Scale Product Matching. The product data corpus consits of 26 million product offers (16 million English language offers) originating from 79 thousand different e-shops. The ...
Oliver Lehmberg has sucessfully defended his PhD thesis
Oliver Lehmberg has successfully defended his PhD thesis on „Web Table Integration and Profiling for Knowledge Base Augmentation“ today.
Best Paper Award at WIMS 2019
The paper „Robust Active Learning of Expressive Linkage Rules“ by Anna Primpeli and Christian Bizer has won the best paper award at the 9th International Conference on Web Intelligence, Mining and Semantics (WIMS) in South Korea.
Christian Bizer gives Keynote at LDK 2019
Prof. Christian Bizer has given the keynote speech at the Language Data and Knowledge (LDK 2019) conference in Leipzig Germany.
Product Matching Training Dataset presented at ECNLP Workshop at WWW2019
Anna Primpeli has presented today the Web Data Commons – Training Dataset and Gold Standard for Large-Scale Product Matching at the Workshop on e-Commerce and NLP held at The Web Conference (WWW2019) in San Francisco. Abstract A current research question in the area of entity resolution (also ...
Article accepted at Datenbank Spektrum
The article „Using the Semantic Web as a Source of Training Data“ by Christian Bizer, Anna Primpeli, Ralph Peeters has been accpeted for the upcoming special issue on „Data and Repeatability“ of Datenbank Spektrum.
31.5 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 9.6 million websites published
The DWS group is happy to announce the new release of the WebDataCommons Microdata, JSON-LD, RDFa and Microformat data corpus. The data has been extracted from the November 2018 version of the Common Crawl covering 2.5 billion HTML pages which originate from 32 million websites (pay-level domains).
Tracking cookies are currently allowed.
Do not allow tracking cookies
Tracking cookies are currently not allowed.
Allow tracking cookies