Focus Group: Web-based Systems

(Prof. Bizer)

We explore technical and empirical questions concerning the development of global, decentralized information environments. Our current focus is the evolution of the World Wide Web from a medium for the publication of documents into a global dataspace. Our empirical work is accompanying this evolution by monitoring the adoption of Semantic Markup and Linked Data technologies on the Web. Our technical work focuses on integrating data from large numbers of Web data sources and includes topics such as information extraction, identity resolution, schema matching, data fusion, and data search. We apply the developed methods for the tasks of integrating product data from large numbers of e-shops as well as for creating large-scale knowledge bases such as DBpedia.

People

Current Team:

Alumni:

  • Dr. Yaser Oulabi (2020)
  • Dr. Oliver Lehmberg (2019)
  • Benedikt Kleppmann (2018)
  • Dr. Dominique Ritze (2017)
  • Petar Petrovski (2017)
  • Dr. Anna Lisa Gentile (2017)
  • Dr. Robert Meusel (2016)
  • Prof. Dr. Kai Eckert (2015)
  • Dr. Volha Bryl (2015)
  • Max Schlachtenberg (2014)
  • Dr. Robert Isele (2013)

Awards

Publications

2020

2019

2018

  • Bizer, C., Vidal, M.-E. und Skaf-Molli, H. (2018). Linked Open Data. In , Encyclopedia of Database Systems (S. 2096-2101). New York, NY: Springer.
  • Bizer, C., Vidal, M.-E. und Weiss, M. (2018). RDF Technology. In , Encyclopedia of Database Systems (S. 3106-3109). New York, NY: Springer.
  • Bizer, C., Vidal, M.-E. und Weiss, M. (2018). Resource Description Framework. In , Encyclopedia of Database Systems (S. 3221-3224). New York, NY: Springer.
  • Kleppmann, B., Bizer, C., Yaqub, E., Temme, F., Schlunder, P., Arnu, D. und Klinkenberg, R. (2018). Density- and correlation-based table extension. In , LWDA 2018 : Proceedings of the Conference „Lernen, Wissen, Daten, Analysen“ Mannheim, Germany, August 22-24, 2018 (S. 191-194). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., Petrovski, P., Mika, P. und Paulheim, H. (2018). A machine learning approach for product matching and categorization. Semantic Web, 9, 707-728.

2017

2016

  • Auer, S., Heath, T., Bizer, C. und Berners-Lee, T. (2016). LDOW2016: 9th Workshop on Linked Data on the Web. In , Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11-15, 2016, Companion Volume (S. 1039-1040). , ACM: Geneva, Switzerland.
  • Basile, P., Caputo, A., Gentile, A. L. und Rizzo, G. (2016). Overview of the EVALITA 2016 Named Entity rEcognition and Linking in Italian Tweets (NEEL-IT) Task. In , Proceedings CLiC-it 2016 and EVALITA 2016 : Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016) (S. Paper 7, 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Bizer, C., Dong, L., Ilyas, I. und Vidal, M.-E. (2016). Editorial: Special issue on web data quality. Journal of Data and Information Quality : JDIQ, 8, 1:1-1:3.
  • Bryl, V., Bizer, C. und Paulheim, H. (2016). Gathering alternative surface forms for DBpedia entities. In , NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 13-24). CEUR Workshop Proceedings, RWTH: Aachen.
  • van Erp, M., Mendes, P. N., Paulheim, H., Ilievski, F., Plu, J., Rizzo, G. und Waitelonis, J. (2016). Evaluating entity linking: an analysis of current benchmark datasets and a roadmap for doing a better job. In , Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 4373-4379). , European Language Resources Association, ELRA-ELDA: Paris.
  • Faralli, S., Bizer, C., Eckert, K., Meusel, R. und Ponzetto, S. P. (2016). A Web application to search a large repository of taxonomic relations from the Web. In , ISWC-P&D 2016 : Proceedings of the ISWC 2016 Posters & Demonstrations Track co-located with 15th International Semantic Web Conference (ISWC 2016) Kobe, Japan, October 19, 2016 (S. Paper 58). CEUR Workshop Proceedings, RWTH: Aachen.
  • Gentile, A. L., Kirstein, S., Paulheim, H. und Bizer, C. (2016). Extending RapidMiner with data search and integration capabilities. In , The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 161-171). Lecture Notes in Computer Science, Springer: Cham.
  • Hertling, S., Schröder, M., Jilek, C. und Dengel, A. (2016). Top-k shortest paths in directed labeled multigraphs. In , Semantic web challenges : third SemWebEval Challenge at ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016 : revised selected papers (S. 200-212). Communications in Computer and Information Science, Springer: Cham.
  • Huelss, J. und Paulheim, H. (2016). What SPARQL query logs tell and do not tell about semantic relatedness in LOD : or: the unsuccessful attempt to improve the browsing experience of DBpedia by exploiting query logs. In , The Semantic Web: ESWC 2015 Satellite Events : ESWC 2015 Satellite Events Portorož, Slovenia, May 31 – June 4, 2015, Revised Selected Papers (S. 297-308). Lecture Notes in Computer Science, Springer: Cham.
  • Lehmberg, O. und Bizer, C. (2016). Web table column categorisation and profiling. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 4, 1-7). , ACM: New York, NY.
  • Lehmberg, O., Ritze, D., Meusel, R. und Bizer, C. (2016). A large public corpus of web tables containing time and context metadata. In , WWW '16 Companion : Proceedings of the 25th International Conference Companion on World Wide Web : Montreal, Canada, April 11 - 15, 2016 (S. 75-76). , ACM: New York, NY.
  • Melo, A., Paulheim, H. und Völker, J. (2016). Type prediction in RDF knowledge bases using hierarchical multilabel classification. In , Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, WIMS 2016, Nîmes, France, June 13-15, 2016 (S. Article 14, 1-10). , ACM: New York, NY.
  • Müller, A. C. und Paulheim, H. (2016). Towards combining ontology matchers via anomaly detection. In , OM 2015 : Proceedings of the 10th International Workshop on Ontology Matching collocated with the 14th International Semantic Web Conference (ISWC 2015) Bethlehem, PA, USA, October 12, 2015 (S. 40-44). CEUR Workshop Proceedings, RWTH: Aachen.
  • Nuzzolese, A. G., Gentile, A. L., Presutti, V. und Gangemi, A. (2016). Conference Linked Data: the ScholarlyData project. In , The Semantic Web – ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17–21, 2016, Proceedings, Part II (S. 150-158). Lecture Notes in Computer Science, Springer: Cham.
  • Nuzzolese, A. G., Gentile, A. L., Presutti, V. und Gangemi, A. (2016). Semantic Web Conference ontology - a refactoring solution. In , The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 84-87). Lecture Notes in Computer Science, Springer: Cham.
  • Oulabi, Y., Meusel, R. und Bizer, C. (2016). Fusing time-dependent web table data. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 3, 1-7). , ACM: New York, NY.
  • Paulheim, H. (2016). 14th International Semantic Web Conference 2015 Bethlehem, PA, USA; October 11–15. Künstliche Intelligenz : KI ; Forschung, Entwicklung, Erfahrungen ; Organ des Fach­bereichs 1 Künstliche Intelligenz der Gesellschaft für Informatik e.V., GI / Fach­bereich 1 der Gesellschaft für Informatik e.V, 30, 207-208.
  • Paulheim, H. und Stuckenschmidt, H. (2016). Fast approximate A-box consistency checking using machine learning. In , The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings (S. 135-150). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Paulheim, H. und Unger, C. (2016). Can predicate lexicalizations help in named entity disambiguation? In , NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 92-97). CEUR Workshop Proceedings, RWTH: Aachen.
  • Petrovski, P. und Gentile, A. L. (2016). Can you judge a music album by its cover? In , Know(at)LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and the 1st International Workshop on Completing and Debugging the Semantic Web (Know(at)LOD-2016, CoDeS-2016) ... with 13th ESWC 2016 (S. 1-4). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. und Mika, P. (2016). Enriching product ads with Metadata from HTML annotations. In , The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings (S. 151-167). Lecture Notes in Computer Science, Springer: Cham.
  • Ristoski, P. und Paulheim, H. (2016). Analyzing statistics with background knowledge from Linked Open Data. In , SemStats 2013 : Proceedings of the 1st International Workshop on Semantic Statistics co-located with 13th International Semantic Web Conference (ISWC 2013), Sydney, Australia, October 11th, 2013 (S. Article 12). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. und Paulheim, H. (2016). RDF2Vec: RDF graph embeddings for data mining. In , The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part I (S. 498-514). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Ristoski, P. und Paulheim, H. (2016). Semantic Web in data mining and knowledge discovery: a comprehensive survey. Web Semantics, 36, 1-22.
  • Ristoski, P., Paulheim, H., Svatek, V. und Zeman, V. (2016). The Linked Data Mining Challenge 2016. In , Know(at)LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and 1st International Workshop on Completing and Debugging the Semantic Web ...with 13th ESWC 2016, Heraklion, Greece, May 30th 2016 (S. 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., de Vries, G. und Paulheim, H. (2016). A collection of benchmark datasets for systematic evaluations of machine learning on the Semantic Web. In , The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part II (S. 186-194). Lecture Notes in Computer Science, Springer: Cham.
  • Ritze, D., Lehmberg, O., Oulabi, Y. und Bizer, C. (2016). Profiling the potential of web tables for augmenting cross-domain knowledge bases. In , Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11 - 15, 2016 (S. 251-261). , ACM: Geneva, Switzerland.
  • Rosati, J., Ristoski, P., Di Noia, T., de Leone, R. und Paulheim, H. (2016). RDF graph embeddings for content-based recommender systems. In , CBRecSys 2016 : Proceedings of the 3rd Workshop on New Trends in Content-Based Recommender Systems co-located with ACM Conference on Recommender Systems (RecSys 2016) Boston, MA, USA, September 16, 2016 (S. 23-30). CEUR Workshop Proceedings, RWTH: Aachen.
  • Seitner, J., Bizer, C., Eckert, K., Faralli, S., Meusel, R., Paulheim, H. und Ponzetto, S. P. (2016). A large DataBase of hypernymy relations extracted from the Web. In , Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 360-367). , European Language Resources Association, ELRA-ELDA: Paris.

2015

2014

2013

2012

2011

2010