Paper Accepted for Pervasive and Mobile Computing Journal
The Paper “newNECTAR: Collaborative active learning for knowledge-based probabilistic activity recognition” by Gabriele Civitarese, ClaudioBettini, TimoSztyler, DanieleRiboni, and HeinerStuckenschmidt has been accpeted for Pervasive and Mobile Computing (Impact Factor 2.974)
Paper Accepted at European Journal on Operational Research
The joint paper with the Chair of Logistics and Supply Chain Management “A Data-Driven Newsvendor Problem: From Data to Decision” by Jakob Huber, Sebastian Müller, Moritz Fleischmann and Heiner Stuckenschmidt has been accepted for the European Journal on Operational Research (Impact Factor 3.428).
Paper Accepted at IEEE Transactions on Smart Grid
The paper “Real-Time Smart Charging Based on Precomputed Schedules” by Oliver Frendo, Nadine Gärtner and Heiner Stuckenschmidt has been accepted by IEEE Transactions on Smart Grid (Impact Factor 7.364)
Article accepted at Computational Linguistics: Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction
The article “Watset: Local-Global Graph Clustering with Applications in Sense and Frame Induction” by Dmitry Ustalov, Alexander Panchenko, Chris Biemann, and Simone Paolo Ponzetto has been accepted for publication at the Computational Linguistics (CL) journal by MIT Press. Abstract: We present a ...
Article accepted at ACM TODS: A Unified Framework for Frequent Sequence Mining with Subsequence Constraints
The article „A Unified Framework for Frequent Sequence Mining with Subsequence Constraints“ by Kaustubh Beedkar, Rainer Gemulla und Wim Martens has been accepted for publication in ACM Transactions on Database Systems (TODS). Abstract: Frequent sequence mining methods often make use of constraints ...
Paper accepted at ICDE 2019: Scalable Frequent Sequence Mining With Flexible Subsequence Constraints
The paper „Scalable Frequent Sequence Mining With Flexible Subsequence Constraints“ by Alexander Renz-Wieland, Matthias Bertsch, and Rainer Gemulla has been accepted at the 2019 IEEE International Conference on Data Engineering (ICDE). Abstract: We study scalable algorithms for frequent sequence ...
31.5 billion quads Microdata, Embedded JSON-LD, RDFa, and Microformat data originating from 9.6 million websites published
The DWS group is happy to announce the new release of the WebDataCommons Microdata, JSON-LD, RDFa and Microformat data corpus. The data has been extracted from the November 2018 version of the Common Crawl covering 2.5 billion HTML pages which originate from 32 million websites (pay-level domains).
WDC Training Dataset and Gold Standard for Large-Scale Product Matching released
The research focus in the field of entity resolution (aka link discovery or duplicate detection) is moving from traditional symbolic matching methods to embeddings and deep neural network based matching. A problem with evaluating deep learning based matchers is that they are rather training data ...
Paper accepted at EDBT 2019
Our systems and applications paper Extending Cross-Domain Knowledge Bases with Long Tail Entities using Web Table Data (Yaser Oulabi, Christian Bizer) got accepted at the 22nd International Conference on Extending Database Technology (EDBT 2019), one of the top-tier conferences in the data ...
WInte.r Web Data Integration Framework Version 1.3 released
We are happy to announce the release of Version 1.3 of the Web Data Integration Framework (WInte.r). WInte.r is a Java framework for end-to-end data integration. The framework implements a wide variety of different methods for data pre-processing, schema matching, identity resolution, data fusion, ...