Paper accepted for DI2KG

The paper "Intermediate Training of BERT for Product Matching" by Ralph Peeters, Christian Bizer and Goran Glavaš has been accepted for the 2nd International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs (DI2KG) held in conjunction with VLDB 2020.

Paper accepted for CIKM

The paper "Profiling Entity Matching Benchmark Tasks" by Anna Primpeli and Christian Bizer has been accepted for the 29th International Conference on Information and Knowledge Management (CIKM) which will be held online this year.

Yaser Oulabi has successfully defended his PhD thesis

Yaser Oulabi has successfully defended his PhD thesis on „Augmenting Cross-Domain Knowledge Bases Using Web Tables“ today.

CfP: Benchmark Competition on Product Data Integration at ISWC 2020

Together with the University of Sheffield, we are organizing a benchmark competition on product data integration at the 19th International Semantic Web Conference (ISWC 2020). The competition consists of two tasks: Product Offer Matching and Product Classification. Submissions to both tasks are ...

44.2 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 11.9 million websites published

The DWS group is happy to announce the new release of the WebDataCommons Microdata, JSON-LD, RDFa and Microformat data corpus. The data has been extracted from the November 2019 version of the Common Crawl covering 2.4 billion HTML pages which originate from 32 million websites (pay-level domains).

Christian Bizer gives keynote at JIST 2019

Professor Bizer was invited to give the keynote speech at the 9th Joint International Semantic Technology Conference (JIST2019) in Hangzhou, China.

Christian Bizer wins SWSA Ten-Year Award

We are happy to announce that Professor Christian Bizer has received the SWSA Ten-Year Award at the 18th International Semantic Web Conference (ISWC2019) in Aukland, New Zealand.

WDC Product Data Corpus and Gold Standard for Large-Scale Product Matching Version 2.0 released

We are happy to announce the release of Version 2.0 of the Web Data Commons Product Data Corpus and Gold Standard for Large-Scale Product Matching. The product data corpus consits of 26 million product offers (16 million English language offers) originating from 79 thousand different e-shops. The ...

Oliver Lehmberg has sucessfully defended his PhD thesis

Oliver Lehmberg has successfully defended his PhD thesis on „Web Table Integration and Profiling for Knowledge Base Augmentation“ today.

Best Paper Award at WIMS 2019

The paper "Robust Active Learning of Expressive Linkage Rules" by Anna Primpeli and Christian Bizer has won the best paper award at the 9th International Conference on Web Intelligence, Mining and Semantics (WIMS) in South Korea.