Product Matching Training Dataset presented at ECNLP Workshop at WWW2019
Anna Primpeli has presented today the Web Data Commons – Training Dataset and Gold Standard for Large-Scale Product Matching at the Workshop on e-Commerce and NLP held at The Web Conference (WWW2019) in San Francisco. Abstract A current research question in the area of entity resolution (also ...
Article accepted at Datenbank Spektrum
The article „Using the Semantic Web as a Source of Training Data“ by Christian Bizer, Anna Primpeli, Ralph Peeters has been accpeted for the upcoming special issue on “Data and Repeatability” of Datenbank Spektrum.
31.5 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 9.6 million websites published
The DWS group is happy to announce the new release of the WebDataCommons Microdata, JSON-LD, RDFa and Microformat data corpus. The data has been extracted from the November 2018 version of the Common Crawl covering 2.5 billion HTML pages which originate from 32 million websites (pay-level domains).
WDC Training Dataset and Gold Standard for Large-Scale Product Matching released
The research focus in the field of entity resolution (aka link discovery or duplicate detection) is moving from traditional symbolic matching methods to embeddings and deep neural network based matching. A problem with evaluating deep learning based matchers is that they are rather training data ...
Paper accepted at EDBT 2019
Our systems and applications paper Extending Cross-Domain Knowledge Bases with Long Tail Entities using Web Table Data (Yaser Oulabi, Christian Bizer) got accepted at the 22nd International Conference on Extending Database Technology (EDBT 2019), one of the top-tier conferences in the data ...
WInte.r Web Data Integration Framework Version 1.3 released
We are happy to announce the release of Version 1.3 of the Web Data Integration Framework (WInte.r). WInte.r is a Java framework for end-to-end data integration. The framework implements a wide variety of different methods for data pre-processing, schema matching, identity resolution, data fusion, ...
Data Science Conference LWDA 2018 in Mannheim
The Data and Web Science Group is hosting the Data Science Conference LWDA 2018 in Mannheim on August 22–24, 2018. LWDA, which expands to „Lernen, Wissen, Daten, Analysen“ („Learning, Knowledge, Data, Analytics“), covers recent research in areas such as knowledge discovery, machine learning & data ...
Third Cohort of Students starts Part-time Master in Data Science
The third cohort consisting of 32 students has started their studies in the part-time master program in Data Science that professors of the DWS group offer together with the Hochschule Albstadt-Sigmaringen.
SWSA Ten-Year Award won by DBpedia Paper
We are happy to announce that Professor Christian Bizer has received the SWSA Ten-Year Award at the 16th International Semantic Web Conference (ISWC2017) in Vienna for the paper DBpedia: A Nucleus for a Web of Open Data that he co-authored in 2007.
RapidMiner Data Search Extension wins ESWC2016 Best Demo Award
We are happy to announce that our demonstration Extending RapidMiner with Data Search and Integration Capabilities did win the best demonstration award at the 13th European Semantic Web Conference (ESWC2016).