Former researcher at the Chair of Information Systems V: Web-based Systems of Prof. Dr. Christian Bizer on the topic of expanding cross-domain knowledge bases using structured web data. In my research I focused on the tasks of Long-Tail Entity Extraction and Time-Dependent Data Fusion to augment knowledge bases like DBpedia or Wikidata using web table corpora provided by the Web Data Commons Project.

Research Interests

  • Knowledge Base Completion
  • Web Table Data Consolidation
  • Long-Tail Entity Extraction
  • Time-Dependent Web Data Fusion
  • Weak Supervision

Published Datasets

Time-Dependent Ground-Truth Dataset
Ground truth of time-dependet data from various domains. The datset allows the development and evaluation of methods dealing with time-dependent data. 

Web Tables for Long-Tail Entity Extraction
Gold standard for evaluating the extraction of long-tail entities from web tables. The dataset was built with the motivation of augmenting a cross-domain knowledge base with previously unknown entities from web data.

Code & Projects

Extracting Long Tail Entities from Web Tables for Augmenting Cross-Domain Knowledge Bases
Code und instructions on replicating our research on the extraction of long-tail entities from web tables and the use of weak supervision approaches for this task. 

Time-Aware Fusion for Web Table Data
Code and instructions on replicating our research on the fusion of time-dependent data from web tables.