Focus Group: Web-based Systems

(Prof. Bizer)

We explore technical and empirical questions concerning the development of global, decentralized information environments. Our current focus is the evolution of the World Wide Web from a medium for the publication of documents into a global dataspace. Our empirical work is accompanying this evolution by monitoring the adoption of Semantic Markup and Linked Data technologies on the Web. Our technical work focuses on integrating data from large numbers of Web data sources and includes topics such as information extraction, identity resolution, schema matching, data fusion, and data search. We apply the developed methods for the tasks of integrating product data from large numbers of e-shops as well as for creating large-scale knowledge bases such as DBpedia.

People

Current Team:

Alumni:

  • Dr. Yaser Oulabi (2020)
  • Dr. Oliver Lehmberg (2019)
  • Benedikt Kleppmann (2018)
  • Dr. Dominique Ritze (2017)
  • Petar Petrovski (2017)
  • Dr. Anna Lisa Gentile (2017)
  • Dr. Robert Meusel (2016)
  • Prof. Dr. Kai Eckert (2015)
  • Dr. Volha Bryl (2015)
  • Max Schlachtenberg (2014)
  • Dr. Robert Isele (2013)

Awards

Publications

2020

2019

2018

  • Bizer, C., Vidal, M.-E. und Skaf-Molli, H. (2018). Linked Open Data. In Liu, L., Encyclopedia of Database Systems (S. 2096-2101). New York, NY: Springer.
  • Bizer, C., Vidal, M.-E. und Weiss, M. (2018). RDF Technology. In Liu, L., Encyclopedia of Database Systems (S. 3106-3109). New York, NY: Springer.
  • Bizer, C., Vidal, M.-E. und Weiss, M. (2018). Resource Description Framework. In Liu, L., Encyclopedia of Database Systems (S. 3221-3224). New York, NY: Springer.
  • Kleppmann, B., Bizer, C., Yaqub, E., Temme, F., Schlunder, P., Arnu, D. und Klinkenberg, R. (2018). Density- and correlation-based table extension. In Gemulla, R., LWDA 2018 : Proceedings of the Conference „Lernen, Wissen, Daten, Analysen“ Mannheim, Germany, August 22-24, 2018 (S. 191-194). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., Petrovski, P., Mika, P. und Paulheim, H. (2018). A machine learning approach for product matching and categorization. Semantic Web, 9, 707-728.

2017

2016

  • Auer, S., Heath, T., Bizer, C. und Berners-Lee, T. (2016). LDOW2016: 9th Workshop on Linked Data on the Web. In Bourdeau, J., Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11-15, 2016, Companion Volume (S. 1039-1040). , ACM: Geneva, Switzerland.
  • Basile, P., Caputo, A., Gentile, A. L. und Rizzo, G. (2016). Overview of the EVALITA 2016 Named Entity rEcognition and Linking in Italian Tweets (NEEL-IT) Task. In Basile, P., Proceedings CLiC-it 2016 and EVALITA 2016 : Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016) (S. Paper 7, 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Bizer, C., Dong, L., Ilyas, I. und Vidal, M.-E. (2016). Editorial: Special issue on web data quality. Journal of Data and Information Quality : JDIQ, 8, 1:1-1:3.
  • Bryl, V., Bizer, C. und Paulheim, H. (2016). Gathering alternative surface forms for DBpedia entities. In Paulheim, H., NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 13-24). CEUR Workshop Proceedings, RWTH: Aachen.
  • van Erp, M., Mendes, P. N., Paulheim, H., Ilievski, F., Plu, J., Rizzo, G. und Waitelonis, J. (2016). Evaluating entity linking: an analysis of current benchmark datasets and a roadmap for doing a better job. In Calzolari, N., Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 4373-4379). , European Language Resources Association, ELRA-ELDA: Paris.
  • Faralli, S., Bizer, C., Eckert, K., Meusel, R. und Ponzetto, S. P. (2016). A Web application to search a large repository of taxonomic relations from the Web. In Kawamura, T., ISWC-P&D 2016 : Proceedings of the ISWC 2016 Posters & Demonstrations Track co-located with 15th International Semantic Web Conference (ISWC 2016) Kobe, Japan, October 19, 2016 (S. Paper 58). CEUR Workshop Proceedings, RWTH: Aachen.
  • Gentile, A. L., Kirstein, S., Paulheim, H. und Bizer, C. (2016). Extending RapidMiner with data search and integration capabilities. In Sack, H., The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 161-171). Lecture Notes in Computer Science, Springer: Cham.
  • Hertling, S., Schröder, M., Jilek, C. und Dengel, A. (2016). Top-k shortest paths in directed labeled multigraphs. In Sack, H., Semantic web challenges : third SemWebEval Challenge at ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016 : revised selected papers (S. 200-212). Communications in Computer and Information Science, Springer: Cham.
  • Huelss, J. und Paulheim, H. (2016). What SPARQL query logs tell and do not tell about semantic relatedness in LOD : or: the unsuccessful attempt to improve the browsing experience of DBpedia by exploiting query logs. In Gandon, F., The Semantic Web: ESWC 2015 Satellite Events : ESWC 2015 Satellite Events Portorož, Slovenia, May 31 – June 4, 2015, Revised Selected Papers (S. 297-308). Lecture Notes in Computer Science, Springer: Cham.
  • Lehmberg, O. und Bizer, C. (2016). Web table column categorisation and profiling. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 4, 1-7). , ACM: New York, NY.
  • Lehmberg, O., Ritze, D., Meusel, R. und Bizer, C. (2016). A large public corpus of web tables containing time and context metadata. In Bourdeau, J., WWW '16 Companion : Proceedings of the 25th International Conference Companion on World Wide Web : Montreal, Canada, April 11 - 15, 2016 (S. 75-76). , ACM: New York, NY.
  • Melo, A., Paulheim, H. und Völker, J. (2016). Type prediction in RDF knowledge bases using hierarchical multilabel classification. In Akerkar, R., Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, WIMS 2016, Nîmes, France, June 13-15, 2016 (S. Article 14, 1-10). , ACM: New York, NY.
  • Müller, A. C. und Paulheim, H. (2016). Towards combining ontology matchers via anomaly detection. In Shvaiko, P., OM 2015 : Proceedings of the 10th International Workshop on Ontology Matching collocated with the 14th International Semantic Web Conference (ISWC 2015) Bethlehem, PA, USA, October 12, 2015 (S. 40-44). CEUR Workshop Proceedings, RWTH: Aachen.
  • Nuzzolese, A. G., Gentile, A. L., Presutti, V. und Gangemi, A. (2016). Conference Linked Data: the ScholarlyData project. In Groth, P., The Semantic Web – ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17–21, 2016, Proceedings, Part II (S. 150-158). Lecture Notes in Computer Science, Springer: Cham.
  • Nuzzolese, A. G., Gentile, A. L., Presutti, V. und Gangemi, A. (2016). Semantic Web Conference ontology - a refactoring solution. In Sack, H., The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 84-87). Lecture Notes in Computer Science, Springer: Cham.
  • Oulabi, Y., Meusel, R. und Bizer, C. (2016). Fusing time-dependent web table data. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 3, 1-7). , ACM: New York, NY.
  • Paulheim, H. (2016). 14th International Semantic Web Conference 2015 Bethlehem, PA, USA; October 11–15. Künstliche Intelligenz : KI ; Forschung, Entwicklung, Erfahrungen ; Organ des Fach­bereichs 1 Künstliche Intelligenz der Gesellschaft für Informatik e.V., GI / Fach­bereich 1 der Gesellschaft für Informatik e.V, 30, 207-208.
  • Paulheim, H. und Stuckenschmidt, H. (2016). Fast approximate A-box consistency checking using machine learning. In Sack, H., The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings (S. 135-150). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Paulheim, H. und Unger, C. (2016). Can predicate lexicalizations help in named entity disambiguation? In Paulheim, H., NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 92-97). CEUR Workshop Proceedings, RWTH: Aachen.
  • Petrovski, P. und Gentile, A. L. (2016). Can you judge a music album by its cover? In Paulheim, H., Know(at)LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and the 1st International Workshop on Completing and Debugging the Semantic Web (Know(at)LOD-2016, CoDeS-2016) ... with 13th ESWC 2016 (S. 1-4). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. und Mika, P. (2016). Enriching product ads with Metadata from HTML annotations. In Sack, H., The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings (S. 151-167). Lecture Notes in Computer Science, Springer: Cham.
  • Ristoski, P. und Paulheim, H. (2016). Analyzing statistics with background knowledge from Linked Open Data. In Capadisli, S., SemStats 2013 : Proceedings of the 1st International Workshop on Semantic Statistics co-located with 13th International Semantic Web Conference (ISWC 2013), Sydney, Australia, October 11th, 2013 (S. Article 12). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. und Paulheim, H. (2016). RDF2Vec: RDF graph embeddings for data mining. In Groth, P., The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part I (S. 498-514). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Ristoski, P. und Paulheim, H. (2016). Semantic Web in data mining and knowledge discovery: a comprehensive survey. Web Semantics, 36, 1-22.
  • Ristoski, P., Paulheim, H., Svatek, V. und Zeman, V. (2016). The Linked Data Mining Challenge 2016. In Paulheim, H., Know(at)LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and 1st International Workshop on Completing and Debugging the Semantic Web ...with 13th ESWC 2016, Heraklion, Greece, May 30th 2016 (S. 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., de Vries, G. und Paulheim, H. (2016). A collection of benchmark datasets for systematic evaluations of machine learning on the Semantic Web. In Groth, P., The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part II (S. 186-194). Lecture Notes in Computer Science, Springer: Cham.
  • Ritze, D., Lehmberg, O., Oulabi, Y. und Bizer, C. (2016). Profiling the potential of web tables for augmenting cross-domain knowledge bases. In Bourdeau, J., Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11 - 15, 2016 (S. 251-261). , ACM: Geneva, Switzerland.
  • Rosati, J., Ristoski, P., Di Noia, T., de Leone, R. und Paulheim, H. (2016). RDF graph embeddings for content-based recommender systems. In Bogers, T., CBRecSys 2016 : Proceedings of the 3rd Workshop on New Trends in Content-Based Recommender Systems co-located with ACM Conference on Recommender Systems (RecSys 2016) Boston, MA, USA, September 16, 2016 (S. 23-30). CEUR Workshop Proceedings, RWTH: Aachen.
  • Seitner, J., Bizer, C., Eckert, K., Faralli, S., Meusel, R., Paulheim, H. und Ponzetto, S. P. (2016). A large DataBase of hypernymy relations extracted from the Web. In Calzolari, N., Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 360-367). , European Language Resources Association, ELRA-ELDA: Paris.

2015

2014

2013

2012

  • Aguirre, J. L., Eckert, K., Euzenat, J., Ferrara, A., van Hage, W., Hollink, L., Meilicke, C., Nikolov, A., Ritze, D., Scharffe, F., Shvaiko, P., Svab-Zamazal, O., Trojahn, C., Jiménez-Ruiz, E., Grau, B. C. und Zapilko, B. (2012). Results of the Ontology Alignment Evaluation Initiative 2012. In Shvaiko, P., OM-2012 Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) Boston, MA, USA, November 11, 2012 (S. 1-43). CEUR Workshop Proceedings, RWTH: Aachen.
  • Boland, K., Ritze, D., Eckert, K. und Mathiak, B. (2012). Identifying References to Datasets in Publications. In Zaphiris, P., Theory and Practice of Digital Libraries : Second International Conference, TPDL 2012, Paphos, Cyprus, September 23-27, 2012. Proceedings (S. 150-161). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Dang, T. T., Gabriel, A., Hertling, S., Roskosch, P., Wlotzka, M., Zilke, J. R., Janssen, F. und Paulheim, H. (2012). HotMatch results for OEAI 2012. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 145-151). CEUR Workshop Proceedings, RWTH: Aachen.
  • Eckert, K. (2012). Metadata Provenance in Europeana and the Semantic Web. Berlin: Humboldt-Univ.
  • Efremov, M., Zdraveski, V., Ristoski, P. und Trajanov, D. (2012). Semantic Stored Procedures Programming Environment and Performance Analysis. In Kocarev, L., ICT innovations 2011 (S. 357-366). Advances in Intelligent and Soft Computing, Springer: Berlin [u.a.].
  • Heath, T. und Bizer, C. (2012). Web de données : Méthodes et outils pour les données liées. Paris: Pearson.
  • Hertling, S. (2012). Hertuda results for OEAI 2012. In Shvaiko, P., OM 2012 : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012), Boston, MA, USA, November 11, 2012 (S. 141-144). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hertling, S. und Paulheim, H. (2012). WikiMatch - Using Wikipedia for Ontology Matching. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 37-48). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hertling, S. und Paulheim, H. (2012). WikiMatch results for OEAI 2012. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 220-225). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hienert, D., Wegener, D. und Paulheim, H. (2012). Automatic Classification and Relations­hip Extraction for Multi-Lingual and Multi-Granular Events from Wikipedia. In Erp, M., DeRiVE 2012 : Proceedings of the Workhop on Detection, Representation, and Exploitation of Events in the Semantic Web (DeRiVE 2012); Workshop in conjunction with the 11th International Semantic Web Conference 2012 (ISWC 2012) (S. 1-10). CEUR Workshop Proceedings, RWTH: Aachen.
  • Isele, R. und Bizer, C. (2012). Learning Expressive Linkage Rules using Genetic Programming. Proceedings of the VLDB Endowment, 5, 1638-1649.
  • Paulheim, H. (2012). Browsing Linked Open Data with Auto Complete. 10th Semantic Web Challenge 2012 at the 11th International Semantic Web Conference, Boston, Mass..
  • Paulheim, H. (2012). Explain-a-LOD: Using Linked Open Data for Interpreting Statistics. In Duarte, C., IUI'12 : proceedings of the 17th International Conference on Intelligent User Interfaces; February 14 - 17, 2012, Lisbon, Portugal (S. 313-314). , ACM: [New York, NY].
  • Paulheim, H. (2012). Generating Possible Interpretations for Statistics from Linked Open Data. In Simperl, E., The Semantic Web: Research and Applications : 9th Extended Semantic Web Conference, ESWC 2012, Heraklion, Crete, Greece, May 27-31, 2012. Proceedings (S. 560-574). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Paulheim, H. (2012). Ontologie­basierte Applikations­integration auf Nutzerschnittstellenebene. In Hölldobler, S. Ausgezeichnete Informatikdissertationen 2011 (S. 133-142). Bonn: Ges. für Informatik.
  • Paulheim, H. (2012). WeSeE-Match results for OEAI 2012. In , Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012) (S. 213-219). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. und Fürnkranz, J. (2012). Unsupervised Generation of Data Mining Features from Linked Open Data. In Burdescu, D., WIMS '12 : Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics (S. Article 31). , ACM: New York, NY.
  • Paulheim, H., Oberle, D., Plendl, R. und Probst, F. (2012). An Architecture for Information Exchange based on Reference Models. In Sloane, A., Software Language Engineering : 4th international conference; revised selected papers (S. 160-179). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Paulheim, H. und Pan, J. Z. (2012). Why the Semantic Web Should Become More Imprecise. In , What will the Semantic Web look like 10 years from now? In conjunction with the 11th International Semantic Web Conference 2012 (ISWC 2012) (S. ). , Univ. of Calif., Dept. of Geography: Santa Barbara, Calif..
  • Paulheim, H. und Probst, F. (2012). Ontology-Enhanced User Interfaces: A Survey. In Sheth, A. Semantic-Enabled Advancements on the Web : Applications Across Industries (S. 214-238). Hershey, PA: IGI Global.
  • Ristoski, P., Efremov, M., Zdraveski, V. und Trajanov, D. (2012). JDeveloper 11g R2 Jena Adapter Extension. In Bakeva, V., Proceedings of the Ninth International Conference on Informatics and Information Technology, CIIT 2012, April 19 - 22, 2012, Molika, Bitola, Macedonia (S. 1-5). , Univ. „Ss. Cyril and Methodius“, Fac. of Computer Science and Engineering: Skopje.
  • Ritze, D. und Eckert, K. (2012). Thesaurus mapping: a challenge for ontology alignment? In Shvaiko, P., OM-2012 Ontology Matching : Proceedings of the 7th International Workshop on Ontology Matching (OM-2012) collocated with the 11th International Semantic Web Conference (ISWC-2012), Boston, MA, USA, November 11, 2012 (S. 248-249). CEUR Workshop Proceedings, RWTH: Aachen.
  • Schulz, A. und Paulheim, H. (2012). Combining Government and Linked Open Data in Emergency Management. AI Mashup Challenge 2012 colocated with 9th Extended Semantic Web Conference (ESWC 2012), Heraklion, Greece.
  • Schulz, A., Paulheim, H. und Probst, F. (2012). Crisis Information Management in the Web 3.0 Age. In Rothkrantz, L., ISCRAM 2012 Conference Proceedings : book of papers; 9 th International Conference on Information Systems for Crisis Response and Management (S. ID: 160). , Simon Fraser Univ.: Vancouver.
  • Seeliger, A. und Paulheim, H. (2012). A Semantic Browser for Linked Open Data. 10th Semantic Web Challenge 2012 at the 11th International Semantic Web Conference in Boston, USA, Boston, Mass..
  • Trajanov, D., Stojanov, R., Jovanovik, M., Zdraveski, V., Ristoski, P., Georgiev, M. und Filiposka, S. (2012). Semantic Sky A Platform for Cloud Service Integration based on Semantic Web Technologies. In Presutti, V., I-SEMANTICS 2012 : proceedings of the 8th International Conference on Semantic Systems, September 5-7 2012, Graz, Austria (S. 109-116). , ACM Press: New York, NY.

2011

2010