Photo credit: Anna Logue

Focus Group: Web-based Systems

(Prof. Bizer)

We explore technical and empirical questions concerning the development of global, decentralized information environments. Our current focus is the evolution of the World Wide Web from a medium for the publication of documents into a global dataspace. Our empirical work is accompanying this evolution by monitoring the adoption of Semantic Markup and Linked Data technologies on the Web. Our technical work focuses on integrating data from large numbers of Web data sources and includes topics such as information extraction, identity resolution, schema matching, data fusion, and data search. We apply the developed methods for the tasks of integrating product data from large numbers of e-shops as well as for creating large-scale knowledge bases such as DBpedia.

People

Current Team:

Alumni:

  • Benedikt Kleppmann (2018)
  • Dr. Dominique Ritze (2017)
  • Petar Petrovski (2017)
  • Dr. Anna Lisa Gentile (2017)
  • Dr. Robert Meusel (2016)
  • Prof. Dr. Kai Eckert (2015)
  • Dr. Volha Bryl (2015)
  • Max Schlachtenberg (2014)
  • Dr. Robert Isele (2013)

Awards

  • SWSA Ten-Year Award at International Semantic Web Conference 2017 
  • Best Demo Award at Extended Semantic Web Conference 2016
  • 7 Years Best Paper Award at Extended Semantic Web Conference 2016
  • Yahoo Faculty Research and Engagement Program (FREP) Award 2015
  • Semantic Web Challenge 2014 - Winner of the Open Track
  • Semantic Web Challenge 2014 - Winner of the Big Data Track
  • Semantic Web Journal - Outstanding Paper Award 2014

Publications

2019

2018

  • Bizer, C., Vidal, M.-E. and Skaf-Molli, H. (2018). Linked Open Data. In Liu, L., Encyclopedia of Database Systems (S. 2096-2101). New York, NY: Springer.
  • Bizer, C., Vidal, M.-E. and Weiss, M. (2018). RDF Technology. In Liu, L., Encyclopedia of Database Systems (S. 3106-3109). New York, NY: Springer.
  • Bizer, C., Vidal, M.-E. and Weiss, M. (2018). Resource Description Framework. In Liu, L., Encyclopedia of Database Systems (S. 3221-3224). New York, NY: Springer.
  • Kleppmann, B., Bizer, C., Yaqub, E., Temme, F., Schlunder, P., Arnu, D. and Klinkenberg, R. (2018). Density- and correlation-based table extension. In Gemulla, R., LWDA 2018 : Proceedings of the Conference „Lernen, Wissen, Daten, Analysen“ Mannheim, Germany, August 22-24, 2018 (S. 191-194). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., Petrovski, P., Mika, P. and Paulheim, H. (2018). A machine learning approach for product matching and categorization. Semantic Web, 9, 707-728.

2017

2016

  • Auer, S., Heath, T., Bizer, C. and Berners-Lee, T. (2016). LDOW2016: 9th Workshop on Linked Data on the Web. In Bourdeau, J., Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11-15, 2016, Companion Volume (S. 1039-1040). , ACM: Geneva, Switzerland.
  • Basile, P., Caputo, A., Gentile, A. L. and Rizzo, G. (2016). Overview of the EVALITA 2016 Named Entity rEcognition and Linking in Italian Tweets (NEEL-IT) Task. In Basile, P., Proceedings CLiC-it 2016 and EVALITA 2016 : Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016) (S. Paper 7, 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Bizer, C., Dong, L., Ilyas, I. and Vidal, M.-E. (2016). Editorial: Special issue on web data quality. Journal of Data and Information Quality : JDIQ, 8, 1:1-1:3.
  • Bryl, V., Bizer, C. and Paulheim, H. (2016). Gathering alternative surface forms for DBpedia entities. In Paulheim, H., NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 13-24). CEUR Workshop Proceedings, RWTH: Aachen.
  • van Erp, M., Mendes, P. N., Paulheim, H., Ilievski, F., Plu, J., Rizzo, G. and Waitelonis, J. (2016). Evaluating entity linking: an analysis of current benchmark datasets and a roadmap for doing a better job. In Calzolari, N., Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 4373-4379). , European Language Resources Association, ELRA-ELDA: Paris.
  • Faralli, S., Bizer, C., Eckert, K., Meusel, R. and Ponzetto, S. P. (2016). A Web application to search a large repository of taxonomic relations from the Web. In Kawamura, T., ISWC-P&D 2016 : Proceedings of the ISWC 2016 Posters & Demonstrations Track co-located with 15th International Semantic Web Conference (ISWC 2016) Kobe, Japan, October 19, 2016 (S. Paper 58). CEUR Workshop Proceedings, RWTH: Aachen.
  • Gentile, A. L., Kirstein, S., Paulheim, H. and Bizer, C. (2016). Extending RapidMiner with data search and integration capabilities. In Sack, H., The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 161-171). Lecture Notes in Computer Science, Springer: Cham.
  • Hertling, S., Schröder, M., Jilek, C. and Dengel, A. (2016). Top-k shortest paths in directed labeled multigraphs. In Sack, H., Semantic web challenges : third SemWebEval Challenge at ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016 : revised selected papers (S. 200-212). Communications in Computer and Information Science, Springer: Cham.
  • Huelss, J. and Paulheim, H. (2016). What SPARQL query logs tell and do not tell about semantic relatedness in LOD : or: the unsuccessful attempt to improve the browsing experience of DBpedia by exploiting query logs. In Gandon, F., The Semantic Web: ESWC 2015 Satellite Events : ESWC 2015 Satellite Events Portorož, Slovenia, May 31 – June 4, 2015, Revised Selected Papers (S. 297-308). Lecture Notes in Computer Science, Springer: Cham.
  • Lehmberg, O. and Bizer, C. (2016). Web table column categorisation and profiling. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 4, 1-7). , ACM: New York, NY.
  • Lehmberg, O., Ritze, D., Meusel, R. and Bizer, C. (2016). A large public corpus of web tables containing time and context metadata. In Bourdeau, J., WWW '16 Companion : Proceedings of the 25th International Conference Companion on World Wide Web : Montreal, Canada, April 11 - 15, 2016 (S. 75-76). , ACM: New York, NY.
  • Melo, A., Paulheim, H. and Völker, J. (2016). Type prediction in RDF knowledge bases using hierarchical multilabel classification. In Akerkar, R., Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, WIMS 2016, Nîmes, France, June 13-15, 2016 (S. Article 14, 1-10). , ACM: New York, NY.
  • Müller, A. C. and Paulheim, H. (2016). Towards combining ontology matchers via anomaly detection. In Shvaiko, P., OM 2015 : Proceedings of the 10th International Workshop on Ontology Matching collocated with the 14th International Semantic Web Conference (ISWC 2015) Bethlehem, PA, USA, October 12, 2015 (S. 40-44). CEUR Workshop Proceedings, RWTH: Aachen.
  • Nuzzolese, A. G., Gentile, A. L., Presutti, V. and Gangemi, A. (2016). Conference Linked Data: the ScholarlyData project. In Groth, P., The Semantic Web – ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17–21, 2016, Proceedings, Part II (S. 150-158). Lecture Notes in Computer Science, Springer: Cham.
  • Nuzzolese, A. G., Gentile, A. L., Presutti, V. and Gangemi, A. (2016). Semantic Web Conference ontology - a refactoring solution. In Sack, H., The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 84-87). Lecture Notes in Computer Science, Springer: Cham.
  • Oulabi, Y., Meusel, R. and Bizer, C. (2016). Fusing time-dependent web table data. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 3, 1-7). , ACM: New York, NY.
  • Paulheim, H. (2016). 14th International Semantic Web Conference 2015 Bethlehem, PA, USA; October 11–15. Künstliche Intelligenz : KI ; Forschung, Entwicklung, Erfahrungen ; Organ des Fach­bereichs 1 Künstliche Intelligenz der Gesellschaft für Informatik e.V., GI / Fach­bereich 1 der Gesellschaft für Informatik e.V, 30, 207-208.
  • Paulheim, H. and Stuckenschmidt, H. (2016). Fast approximate A-box consistency checking using machine learning. In Sack, H., The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings (S. 135-150). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Paulheim, H. and Unger, C. (2016). Can predicate lexicalizations help in named entity disambiguation? In Paulheim, H., NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 92-97). CEUR Workshop Proceedings, RWTH: Aachen.
  • Petrovski, P. and Gentile, A. L. (2016). Can you judge a music album by its cover? In Paulheim, H., Know@LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and the 1st International Workshop on Completing and Debugging the Semantic Web (Know@LOD-2016, CoDeS-2016) ... with 13th ESWC 2016 (S. 1-4). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. and Mika, P. (2016). Enriching product ads with Metadata from HTML annotations. In Sack, H., The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings (S. 151-167). Lecture Notes in Computer Science, Springer: Cham.
  • Ristoski, P. and Paulheim, H. (2016). Analyzing statistics with background knowledge from Linked Open Data. In Capadisli, S., SemStats 2013 : Proceedings of the 1st International Workshop on Semantic Statistics co-located with 13th International Semantic Web Conference (ISWC 2013), Sydney, Australia, October 11th, 2013 (S. Article 12). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. and Paulheim, H. (2016). RDF2Vec: RDF graph embeddings for data mining. In Groth, P., The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part I (S. 498-514). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Ristoski, P. and Paulheim, H. (2016). Semantic Web in data mining and knowledge discovery: a comprehensive survey. Web Semantics, 36, 1-22.
  • Ristoski, P., Paulheim, H., Svatek, V. and Zeman, V. (2016). The Linked Data Mining Challenge 2016. In Paulheim, H., Know@LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and 1st International Workshop on Completing and Debugging the Semantic Web ...with 13th ESWC 2016, Heraklion, Greece, May 30th 2016 (S. 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., de Vries, G. and Paulheim, H. (2016). A collection of benchmark datasets for systematic evaluations of machine learning on the Semantic Web. In Groth, P., The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part II (S. 186-194). Lecture Notes in Computer Science, Springer: Cham.
  • Ritze, D., Lehmberg, O., Oulabi, Y. and Bizer, C. (2016). Profiling the potential of web tables for augmenting cross-domain knowledge bases. In Bourdeau, J., Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11 - 15, 2016 (S. 251-261). , ACM: Geneva, Switzerland.
  • Rosati, J., Ristoski, P., Di Noia, T., de Leone, R. and Paulheim, H. (2016). RDF graph embeddings for content-based recommender systems. In Bogers, T., CBRecSys 2016 : Proceedings of the 3rd Workshop on New Trends in Content-Based Recommender Systems co-located with ACM Conference on Recommender Systems (RecSys 2016) Boston, MA, USA, September 16, 2016 (S. 23-30). CEUR Workshop Proceedings, RWTH: Aachen.
  • Seitner, J., Bizer, C., Eckert, K., Faralli, S., Meusel, R., Paulheim, H. and Ponzetto, S. P. (2016). A large DataBase of hypernymy relations extracted from the Web. In Calzolari, N., Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 360-367). , European Language Resources Association, ELRA-ELDA: Paris.

2015

2014

2013

2012

  • Aguirre, J. L., Eckert, K., Euzenat, J., Ferrara, A., van Hage, W., Hollink, L., Meilicke, C., Nikolov, A., Ritze, D., Scharffe, F., Shvaiko, P., Svab-Zamazal, O., Trojahn, C., Jiménez-Ruiz, E., Grau, B. C. and Zapilko, B. (2012). Results of the Ontology Alignment Evaluation Initiative 2012. In Shvaiko, P., OM-2012 Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) Boston, MA, USA, November 11, 2012 (S. 1-43). CEUR Workshop Proceedings, RWTH: Aachen.
  • Boland, K., Ritze, D., Eckert, K. and Mathiak, B. (2012). Identifying References to Datasets in Publications. In Zaphiris, P., Theory and Practice of Digital Libraries : Second International Conference, TPDL 2012, Paphos, Cyprus, September 23-27, 2012. Proceedings (S. 150-161). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Dang, T. T., Gabriel, A., Hertling, S., Roskosch, P., Wlotzka, M., Zilke, J. R., Janssen, F. and Paulheim, H. (2012). HotMatch results for OEAI 2012. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 145-151). CEUR Workshop Proceedings, RWTH: Aachen.
  • Eckert, K. (2012). Metadata Provenance in Europeana and the Semantic Web. Berlin: Humboldt-Univ.
  • Efremov, M., Zdraveski, V., Ristoski, P. and Trajanov, D. (2012). Semantic Stored Procedures Programming Environment and Performance Analysis. In Kocarev, L., ICT innovations 2011 (S. 357-366). Advances in Intelligent and Soft Computing, Springer: Berlin [u.a.].
  • Heath, T. and Bizer, C. (2012). Web de données : Méthodes et outils pour les données liées. Paris: Pearson.
  • Hertling, S. (2012). Hertuda results for OEAI 2012. In Shvaiko, P., OM 2012 : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012), Boston, MA, USA, November 11, 2012 (S. 141-144). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hertling, S. and Paulheim, H. (2012). WikiMatch - Using Wikipedia for Ontology Matching. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 37-48). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hertling, S. and Paulheim, H. (2012). WikiMatch results for OEAI 2012. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 220-225). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hienert, D., Wegener, D. and Paulheim, H. (2012). Automatic Classification and Relations­hip Extraction for Multi-Lingual and Multi-Granular Events from Wikipedia. In Erp, M., DeRiVE 2012 : Proceedings of the Workhop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2012); Workshop in conjunction with the 11th International Semantic Web Conference 2012 (ISWC 2012) (S. 1-10). CEUR Workshop Proceedings, RWTH: Aachen.
  • Isele, R. and Bizer, C. (2012). Learning Expressive Linkage Rules using Genetic Programming. Proceedings of the VLDB Endowment, 5, 1638-1649.
  • Paulheim, H. (2012). Browsing Linked Open Data with Auto Complete. 10th Semantic Web Challenge 2012 at the 11th International Semantic Web Conference, Boston, Mass..
  • Paulheim, H. (2012). Explain-a-LOD: Using Linked Open Data for Interpreting Statistics. In Duarte, C., IUI'12 : proceedings of the 17th International Conference on Intelligent User Interfaces; February 14 - 17, 2012, Lisbon, Portugal (S. 313-314). , ACM: [New York, NY].
  • Paulheim, H. (2012). Generating Possible Interpretations for Statistics from Linked Open Data. In Simperl, E., The Semantic Web: Research and Applications : 9th Extended Semantic Web Conference, ESWC 2012, Heraklion, Crete, Greece, May 27-31, 2012. Proceedings (S. 560-574). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Paulheim, H. (2012). Ontologie­basierte Applikations­integration auf Nutzerschnittstellenebene. In Hölldobler, S. Ausgezeichnete Informatikdissertationen 2011 (S. 133-142). Bonn: Ges. für Informatik.
  • Paulheim, H. (2012). WeSeE-Match results for OEAI 2012. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 213-219). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. and Fürnkranz, J. (2012). Unsupervised Generation of Data Mining Features from Linked Open Data. In Burdescu, D., WIMS '12 : Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics (S. Article 31). , ACM: New York, NY.
  • Paulheim, H., Oberle, D., Plendl, R. and Probst, F. (2012). An Architecture for Information Exchange based on Reference Models. In Sloane, A., Software Language Engineering : 4th international conference; revised selected papers (S. 160-179). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Paulheim, H. and Pan, J. Z. (2012). Why the Semantic Web Should Become More Imprecise. In , What will the Semantic Web look like 10 years from now? In conjunction with the 11th International Semantic Web Conference 2012 (ISWC 2012) (S. ). , Univ. of Calif., Dept. of Geography: Santa Barbara, Calif..
  • Paulheim, H. and Probst, F. (2012). Ontology-Enhanced User Interfaces: A Survey. In Sheth, A. Semantic-Enabled Advancements on the Web : Applications Across Industries (S. 214-238). Hershey, PA: IGI Global.
  • Ristoski, P., Efremov, M., Zdraveski, V. and Trajanov, D. (2012). JDeveloper 11g R2 Jena Adapter Extension. In Bakeva, V., Proceedings of the Ninth International Conference on Informatics and Information Technology, CIIT 2012, April 19 - 22, 2012, Molika, Bitola, Macedonia (S. 1-5). , Univ. „Ss. Cyril and Methodius“, Fac. of Computer Science and Engineering: Skopje.
  • Ritze, D. and Eckert, K. (2012). Thesaurus mapping: a challenge for ontology alignment? In Shvaiko, P., OM-2012 Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012), Boston, MA, USA, November 11, 2012 (S. 248-249). CEUR Workshop Proceedings, RWTH: Aachen.
  • Schulz, A. and Paulheim, H. (2012). Combining Government and Linked Open Data in Emergency Management. AI Mashup Challenge 2012 colocated with 9th Extended Semantic Web Conference (ESWC 2012), Heraklion, Greece.
  • Schulz, A., Paulheim, H. and Probst, F. (2012). Crisis Information Management in the Web 3.0 Age. In Rothkrantz, L., ISCRAM 2012 Conference Proceedings : book of papers; 9 th International Conference on Information Systems for Crisis Response and Management (S. ID: 160). , Simon Fraser Univ.: Vancouver.
  • Seeliger, A. and Paulheim, H. (2012). A Semantic Browser for Linked Open Data. 10th Semantic Web Challenge 2012 at the 11th International Semantic Web Conference in Boston, USA, Boston, Mass..
  • Trajanov, D., Stojanov, R., Jovanovik, M., Zdraveski, V., Ristoski, P., Georgiev, M. and Filiposka, S. (2012). Semantic Sky A Platform for Cloud Service Integration based on Semantic Web Technologies. In Presutti, V., I-SEMANTICS 2012 : proceedings of the 8th International Conference on Semantic Systems, September 5-7 2012, Graz, Austria (S. 109-116). , ACM Press: New York, NY.

2011

2010

2009