Focus Group: Web-based Systems
(Prof. Bizer)
The Web-based Systems group conducts research on methods for integrating data from large numbers of data sources in the context of the open Web and in corporate data lakes. Our research includes areas such as entity matching, schema matching, table annotation, information extraction, and data discovery. Our current work focuses on utilizing large language models and LLM-based agents for data integration tasks. We apply the developed methods to integrate product data from large numbers of e-shops and to construct knowledge graphs such as DBpedia. The empirical research of the group includes monitoring the adoption of schema.org annotations on the public Web by regularly extracting structured data from large Web corpora.
People
Current Team:
- Prof. Dr. Christian Bizer
- Ralph Peeters: Entity Matching using Deep Learning
- Alexander Brinkmann: Data Search using Deep Learning
- Keti Korini: Table Annotation using Deep Learning
- Stephanie Keil: Administration
- Dr. Anna Primpeli (2022)
- Dr. Pedro Ortiz Suarez (2022)
- Dr. Yaser Oulabi (2020)
- Dr. Oliver Lehmberg (2019)
- Benedikt Kleppmann (2018)
- Dr. Dominique Ritze (2017)
- Petar Petrovski (2017)
- Dr. Anna Lisa Gentile (2017)
- Dr. Robert Meusel (2016)
- Prof. Dr. Kai Eckert (2015)
- Dr. Volha Bryl (2015)
- Max Schlachtenberg (2014)
- Dr. Robert Isele (2013)
Data and Software
Awards
- SemTab Challenge at International Semantic Web Conference 2022 – Winner of the Dataset Track
- SIGMOD Programming Contest 2022
- SWSA Ten-Year Award at International Semantic Web Conference 2019
- SWSA Ten-Year Award at International Semantic Web Conference 2017
- Best Demo Award at Extended Semantic Web Conference 2016
- 7 Years Best Paper Award at Extended Semantic Web Conference 2016
- Yahoo Faculty Research and Engagement Program (FREP) Award 2015
- Semantic Web Challenge 2014 – Winner of the Open Track
- Semantic Web Challenge 2014 – Winner of the Big Data Track
- Semantic Web Journal – Outstanding Paper Award 2014
Teaching
Publications
2025
- Peeters, R., Steiner, A. and Bizer, C. (2025). Entity matching using large language models. In , Proceedings 28th International Conference on Extending Database Technology (EDBT 2025), Barcelona, Spain, March 25-March 28 (S. 529–541). OpenProceedings, OpenProceedings.org: Konstanz.
2024
- Brinkmann, A., Baumann, N. and Bizer, C. (2024). Using LLMs for the extraction and normalization of product attribute values. In , Advances in databases and information systems : 28th European Conference, ADBIS 2024, Bayonne, France, August 28–31, 2024 ; Proceedings (S. 217–230). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Brinkmann, A., Shraga, R. and Bizer, C. (2024). SC-block: Supervised contrastive blocking within entity resolution pipelines. In , The Semantic Web : 21st International Conference, ESWC 2024, Hersonissos, Crete, Greece, May 26–30, 2024, Proceedings, Part I (S. 121–142). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Peeters, R., Brinkmann, A. and Bizer, C. (2024). The Web Data Commons Schema.org Table Corpora. In , WWW '24 companion : companion proceedings of the ACM Web Conference 2024 (S. 1079-1082). , Association for Computing Machinery: New York, NY, United States.
2023
- Bizer, C. (2023). GPT-4 versus BERT: Which foundation model is more suitable for integrating data from the web? WEBIST 2023, 19th International Conference on Web Information Systems and Technologies, Roma, Italy.
- Bizer, C., Heath, T. and Berners-Lee, T. (2023). Linked data – the story so far. In Linking the world’s information: Essays on Tim Berners-Lee’s Invention of the World Wide Web (S. 115–143). New York: ACM Digital Library.
- Brinkmann, A. (2023). Neural data search for table augmentation.
In , Proceedings of the Workshops of the EDBT/
ICDT 2023 Joint Conference, Ioannina, Greece, March, 28, 2023 (S. 1–4). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany. - Brinkmann, A., Primpeli, A. and Bizer, C. (2023). The Web Data Commons Schema.Org Data Set Series. In , The ACM Web Conference : Companion of the World Wide Web Conference WWW 2023 (S. 136–139). , Association for Computing Machinery: New York, NY.
- Hassanzadeh, O., Abdelmageed, N., Efthymiou, V., Chen, J., Cutrona, V., Hulsebos, M., Jiménez-Ruiz, E., Khatiwada, A., Korini, K., Kruit, B., Sequeda, J. and Srinivas, K. (2023). Results of SemTab 2023. In , Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, SemTab 2023, co-located with the 22nd International Semantic Web Conference, ISWC 2023, Athens, Greece, November 6–10, 2023 (S. 1–14). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Korini, K. and Bizer, C. (2023). Column type annotation using ChatGPT. In , Joint proceedings of workshops at the 49th International Conference on Very Large Data Bases (VLDB 2023), Vancouver, Canada, August 28 – September 1, 2023, VLDBW 2023 (S. 1–12). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Peeters, R. and Bizer, C. (2023). Using ChatGPT for Entity Matching. In , New Trends in Database and Information Systems : ADBIS 2023 short papers, doctoral consortium and workshops: AIDMA, DOING, K-Gals, MADEISD, PeRS, Barcelona, Spain, September 4–7, 2023, Proceedings (S. 221–230). Communications in Computer and Information Science, Springer: Cham.
- Peeters, R., Der, R. C. and Bizer, C. (2023). WDC products: A multi-dimensional entity matching benchmark. In , Proceedings 27th International Conference on Extending Database Technology (EDBT 2024), Paestum, Italy, March 25 – March 28 (S. 22–33). OpenProceedings, OpenProceedings.org: Konstanz.
2022
- Korini, K., Peeters, R. and Bizer, C. (2022). SOTAB: The WDC Schema.org table annotation benchmark. In , SemTab 2022 : Proceedings of the Semantic Web Challenge on Tabular Data to Knowledge Graph Matching, co-located with the 21st International semantic Web Conference (ISWC 2022), virtual conference, October 23–27, 2022 (S. 14–19). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Peeters, R. and Bizer, C. (2022). Cross-language learning for product matching. In , Companion Proceedings of the Web Conference 2022 (S. 236–238). , ACM: New York, NY.
- Peeters, R. and Bizer, C. (2022). Integrating product data using deep learning : Art.-Nr. 11. In , Proceedings of the 7th bwHPC Symposium (S. 59–62). , Universität Ulm: Ulm.
- Peeters, R. and Bizer, C. (2022). Supervised contrastive learning for product matching. In , Companion Proceedings of the Web Conference 2022 (S. 248–251). , ACM: New York, NY.
- Primpeli, A. (2022). Reducing the labeling effort for entity resolution using distant supervision and active learning. Dissertation. Mannheim.
- Primpeli, A. and Bizer, C. (2022). Impact of the characteristics of multi-source entity matching tasks on the performance of active learning methods. In , The Semantic Web : 19th International Conference, ESWC 2022, Hersonissos, Crete, Greece, May 29 – June 2, 2022, proceedings (S. 113–129). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
2021
- Brinkmann, A. and Bizer, C. (2021). Improving hierarchical product classification using domain-specific language modelling. Bulletin of the Technical Committee on Data Engineering / IEEE Computer Society, 44, 14–25.
- Peeters, R. and Bizer, C. (2021). Dual-objective fine-tuning of BERT for entity matching. In , 47th International Conference on Very Large Data Bases (VLDB 2021) : Copenhagen, Denmark, August 16–20, 2021 (S. 1913-1921). Proceedings of the VLDB Endowment, Association of Computing Machinery: New York, NY.
- Primpeli, A. and Bizer, C. (2021). Graph-boosted active learning for multi-source entity resolution. In , The Semantic Web – ISWC 2021 : 20th international semantic web conference, ISWC 2021, virtual event, October 24–28, 2021, proceedings (S. 182–199). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
2020
- Oulabi, Y. (2020). Augmenting cross-domain knowledge bases using web tables. Dissertation. Mannheim.
- Peeters, R., Bizer, C. and Glavaš, G. (2020). Intermediate training of BERT for product matching. In , DI2KG 2020 : Proceedings of the 2nd International Workshop on Challenges and Experiences from Data Integration to Knowledge Graphs co-located with 46th International Conference on Very Large Data Bases (VLDB 2020), Tokyo, Japan, August 31, 2020 (S. 1–2). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Peeters, R., Primpeli, A., Wichtlhuber, B. and Bizer, C. (2020). Using schema.org annotations for training and maintaining product matchers. In , WIMS 2020: proceedings of the 10th International Conference on Web Intelligence, Mining and Semantics, Biarritz, France, June 30 – July 3, 2020 (S. 195–204). , ACM: New York, NY.
- Petrovski, P. and Bizer, C. (2020). Learning expressive linkage rules from sparse data. Semantic Web, 11, 549–567.
- Primpeli, A. and Bizer, C. (2020). Profiling entity matching benchmark tasks. In , CIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Management : (S. 3101-3108). , Association for Computing Machinery: New York, NY.
- Primpeli, A., Bizer, C. and Keuper, M. (2020). Unsupervised bootstrapping of active learning for entity resolution. In , The Semantic Web : 17th International Conference, ESWC 2020, Heraklion, Crete, Greece, May 31-June 4, 2020, Proceedings (S. 215–231). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Zhang, Z., Bizer, C., Peeters, R. and Primpeli, A. (2020). MWPD2020: Semantic Web challenge on Mining the Web of HTML-embedded product data. In , MWPD 2020 : Proceedings of the Semantic Web Challenge on Mining the Web of HTML-embedded Product Data co-located with the 19th International Semantic Web Conference (ISWC 2020) Athens, Greece, November 5, 2020 (S. 2–18). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
2019
- Bizer, C., Primpeli, A. and Peeters, R. (2019). Using the Semantic Web as a source of training data. Datenbank-Spektrum, 19, 127–135.
- Lehmberg, O. (2019). Web table integration and profiling for knowledge base augmentation. Dissertation. Mannheim.
- Lehmberg, O. and Bizer, C. (2019). Profiling the semantics of n-ary web table data. In , Proceedings of the International Workshop on Semantic Big Data : SBD '19, Amsterdam, The Netherlands, June 30 – July 5, 2019 (S. 5:1–5:6). , ACM: New York, NY, USA.
- Lehmberg, O. and Bizer, C. (2019). Synthesizing N-ary relations from web tables. In , WIMS2019 : Proceedings of the 9th International Conference on Web Intelligence, Mining and Semantics, Seoul, Republic of Korea, June 26 – 28, 2019 (S. 17:1–17:12). , ACM: New York, NY, USA.
- Melo, A. and Paulheim, H. (2019). Local and global feature selection for multilabel classification with binary relevance : An empirical comparison on flat and hierarchical problems. Artificial Intelligence Review, 51, 33–60.
- Oulabi, Y. and Bizer, C. (2019). Extending cross-domain knowledge bases with long tail entities using web table data. In , Advances in Database Technology – 22nd International Conference on Extending Database Technology, EDBT 2019, Lisbon, Portugal, March 26–29, 2019 : proceedings (S. 385–396). , OpenProceedings.org: Konstanz.
- Oulabi, Y. and Bizer, C. (2019). Using weak supervision to identify long-tail entities for knowledge base completion. In , Semantic systems : The power of AI and knowledge graphs : 15th International Conference, SEMANTiCS 2019, Karlsruhe, Germany, September 9–12, 2019, proceedings (S. 83–98). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Primpeli, A. and Bizer, C. (2019). Robust active learning of expressive linkage rules. In , WIMS2019 : Proceedings of the 9th International Conference on Web Intelligence, Mining and Semantics, Seoul, Republic of Korea, June 26 – 28, 2019 (S. 2:1–2:7). , ACM: New York, NY.
- Primpeli, A., Peeters, R. and Bizer, C. (2019). The WDC training dataset and gold standard for large-scale product matching. In , Companion Proceedings of The 2019 World Wide Web Conference (S. 381–386). , ACM: New York, NY, USA.
2018
- Bizer, C., Vidal, M.-E. and Skaf-Molli, H. (2018). Linked Open Data. In , Encyclopedia of Database Systems (S. 2096-2101). New York, NY: Springer.
- Bizer, C., Vidal, M.-E. and Weiss, M. (2018). RDF Technology. In , Encyclopedia of Database Systems (S. 3106-3109). New York, NY: Springer.
- Bizer, C., Vidal, M.-E. and Weiss, M. (2018). Resource Description Framework. In , Encyclopedia of Database Systems (S. 3221-3224). New York, NY: Springer.
- Kleppmann, B., Bizer, C., Yaqub, E., Temme, F., Schlunder, P., Arnu, D. and Klinkenberg, R. (2018). Density- and correlation-based table extension. In , LWDA 2018 : Proceedings of the Conference “Lernen, Wissen, Daten, Analysen” Mannheim, Germany, August 22–24, 2018 (S. 191–194). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Ristoski, P., Petrovski, P., Mika, P. and Paulheim, H. (2018). A machine learning approach for product matching and categorization. Semantic Web, 9, 707–728.
2017
- van Erp, M., Hellmann, S., McCrae, J. P., Chiarcos, C., Choi, K.-S., Gracia, J., Hayashi, Y., Koide, S., Mendes, P., Paulheim, H. and Takeda, H. (eds.) (2017). Knowledge graphs and language technology : ISWC 2016 International Workshops: KEKI and NLP&DBpedia, Kobe, Japan, October 17–21, 2016, revised selected papers. Berlin [u.a.]: Springer.
- Blomqvist, E., Hose, K., Paulheim, H., Ławrynowicz, A., Ciravegna, F. and Hartig, O. (eds.) (2017). The Semantic Web: ESWC 2017 Satellite Events : ESWC 2017 Satellite Events, Portorož, Slovenia, May 28 – June 1, 2017, revised selected papers. Berlin [u.a.]: Springer.
- Cochez, M., Ristoski, P., Ponzetto, S. P. and Paulheim, H. (2017). Biased graph walks for RDF graph embeddings. In , WIMS '17 Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics : Amantea, Italy, June 19 – 22, 2017 (S. Article 21, 1–12). , ACM: New York, NY.
- Gentile, A. L., Ristoski, P., Eckel, S., Ritze, D. and Paulheim, H. (2017). Entity matching on web tables: a table embeddings approach for blocking. In , Advances in Database Technology – EDBT 2017 : 20th International Conference on Extending Database Technology, Venice, Italy, March 21–24, 2017, Proceedings (S. 510–513). , OpenProceedings: Konstanz.
- Heist, N. and Paulheim, H. (2017). Language-agnostic relation extraction from Wikipedia abstracts. In , The Semantic Web – ISWC 2017 : 16th International Semantic Web Conference, Vienna, Austria, October 21–25, 2017, proceedings, part I (S. 383–399). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Hertling, S. (2017). WikiV3 results for OAEI 2017. In , OM 2017 : Proceedings of the 12th International Workshop on Ontology Matching co-located with the 16th International Semantic Web Conference (ISWC 2017) Vienna, Austria, October 21, 2017 (S. 190–195). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Hertling, S. and Paulheim, H. (2017). WebIsALOD: providing hypernymy relations extracted from the web as linked open data. In , The Semantic Web – ISWC 2017 : 16th International Semantic Web Conference, Vienna, Austria, October 21–25, 2017, proceedings, part II (S. 111–119). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Hertling, S., Schröder, M., Jilek, C. and Dengel, A. (2017). Where is that button again?! – Towards a universal GUI search engine. In , Proceedings of the 9th International Conference on Agents and Artificial Intelligence : ICAART-17, February 24–26, Porto, Portugal (S. 217–227). , SCITEPRESS: Setúbal.
- Hofmann, A., Perchani, S., Portisch, J., Hertling, S. and Paulheim, H. (2017). DBkWik: towards knowledge graph creation from thousands of wikis. In , ISWC-P&D-Industry 2017 : Proceedings of the ISWC 2017 Posters & Demonstrations and Industry Tracks co-located with 16th International Semantic Web Conference (ISWC 2017) Vienna, Austria, October 23rd to 25th, 2017 (S. Paper 540). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Krstanovic, S. and Paulheim, H. (2017). Ensembles of recurrent neural networks for robust time series forecasting. In , Artificial Intelligence XXXIV : 37th SGAI International Conference on Artificial Intelligence, AI 2017, Cambridge, UK, December 12–14, 2017, proceedings (S. 34–46). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Lehmann, J., Auer, S., Capadisli, S., Janowicz, K., Bizer, C., Heath, T., Hogan, A. and Berners-Lee, T. (2017). LDOW2017: 10th Workshop on Linked Data on the Web. In , WWW '17 Companion Proceedings of the 26th International Conference on World Wide Web Companion (S. 1679-1680). , International World Wide Web Conferences Steering Committee: Geneva.
- Lehmberg, O. and Bizer, C. (2017). Stitching web tables for improving matching quality. Proceedings of the VLDB Endowment, 10, 1502-1513.
- Lehmberg, O., Brinkmann, A. and Bizer, C. (2017). WInte.r – a web data integration framework. In , ISWC-P&D-Industry 2017 : Proceedings of the ISWC 2017 Posters & Demonstrations and Industry Tracks co-located with 16th International Semantic Web Conference (ISWC 2017) Vienna, Austria, October 23rd to 25th, 2017 (S. Paper 506). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Melo, A. and Paulheim, H. (2017). An approach to correction of erroneous links in knowledge graphs. In , K-CAPSAT-2017 : Proceedings of Workshops and Tutorials of the 9th International Conference on Knowledge Capture (K-CAP2017) Austin, Texas, December 4th, 2017 (S. 54–57). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Melo, A. and Paulheim, H. (2017). Detection of relation assertion errors in knowledge graphs. In , Proceedings of the Knowledge Capture Conference, K-CAP 2017, Austin, TX, USA, December 4–6, 2017 (S. Article 22,1–8). , ACM: New York, NY, USA.
- Melo, A. and Paulheim, H. (2017). Synthesizing knowledge graphs for link and type prediction benchmarking. In , The Semantic Web : 14th International Conference, ESWC 2017, Portorož, Slovenia, May 28 – June 1, 2017, Proceedings, Part I (S. 136–151). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Melo, A., Völker, J. and Paulheim, H. (2017). Type prediction in noisy RDF knowledge bases using hierarchical multilabel classification with graph and latent features. International Journal on Artificial Intelligence Tools : IJAIT, 26, Art. 1760011,1–32.
- Meusel, R. (2017). Web-scale profiling of semantic annotations in HTML pages. Dissertation. Mannheim.
- Meusel, R., Ritze, D. and Paulheim, H. (2017). Towards more accurate statistical profiling of deployed schema.org microdata. ACM Journal of Data and Information Quality : JDIQ, 8, 1–31.
- Oulabi, Y. and Bizer, C. (2017). Estimating missing temporal meta-information using Knowledge-Based-Trust. In , KDWEB 2017 : proceedings of the 3rd International Workshop on Knowledge Discovery on the WEB Cagliari, Italy, September 11 to 12, 2017 (S. Paper 4). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Paulheim, H. (2017). A robust number parser based on conditional random fields. In , KI 2017: Advances in Artificial Intelligence : 40th Annual German Conference on AI, Dortmund, Germany, September 25–29, 2017, proceedings (S. 337–343). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Paulheim, H. (2017). Data-driven joint debugging of the DBpedia mappings and ontology. In , The Semantic Web : 14th International Conference, ESWC 2017, Portorož, Slovenia, May 28 – June 1, 2017, Proceedings, Part I (S. 404–418). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Paulheim, H. (2017). Knowledge graph refinement: A survey of approaches and evaluation methods. Semantic Web, 8, 489–508.
- Paulheim, H. (2017). Towards profiling knowledge graphs. In , Profiles 2017 : Proceedings of the 4th International Workshop on Dataset PROFIling and fEderated Search for Web Data (PROFILES 2017) co-located with The 16th International Semantic Web Conference (ISWC 2017) Vienna, Austria, October 22, 2017 (S. Paper 1). CEUR-WS, CEUR Workshop Proceedings: Aachen.
- Petrovski, P. and Bizer, C. (2017). Extracting attribute-value pairs from product specifications on the web. In , WI '17 Proceedings of the International Conference on Web Intelligence : Leipzig, Germany, August 23–26, 2017 (S. 558–565). , ACM: New York, NY [u.a.].
- Petrovski, P., Primpeli, A., Meusel, R. and Bizer, C. (2017). The WDC gold standards for product feature extraction and product matching. In , E-Commerce and Web Technologies : 17th International Conference, EC-Web 2016, Porto, Portugal, September 5–8, 2016, Revised Selected Papers (S. 73–86). Lecture Notes in Business Information Processing : LNBIP, Springer: Berlin [u.a.].
- Primpeli, A. and Bizer, C. (2017). Generalizing matching knowledge using active learning. In , VLDB-PhD 2017 : Proceedings of the VLDB 2017 PhD Workshop co-located with the 43rd International Conference on Very Large Databases (VLDB 2017) Munich, Germany, August 28, 2017 (S. 29–32). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Primpeli, A., Meusel, R., Bizer, C. and Stuckenschmidt, H. (2017). The Web Data Commons structured data extraction. In , E-Science-Tage 2017: Forschungsdaten managen (S. 1). , Heidelberg University: Heidelberg.
- Ringler, D. and Paulheim, H. (2017). One knowledge graph to rule them all? Analyzing the differences between DBpedia, YAGO, Wikidata & co.. In , KI 2017: Advances in Artificial Intelligence : 40th Annual German Conference on AI, Dortmund, Germany, September 25–29, 2017, proceedings (S. 366–372). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Ritze, D. (2017). Web-scale web table to knowledge base matching. Dissertation. Mannheim.
- Ritze, D. and Bizer, C. (2017). Matching web tables to DBpedia – a feature utility study. In , Proceedings of the 20th International Conference on Extending Database Technology, EDBT 2017, Venice, Italy, March 21–24, 2017 (S. 210–221). , OpenProceedings: Konstanz.
- Schröder, M., Jilek, C., Hees, J., Hertling, S. and Dengel, A. (2017). RDF spreadsheet editor : get (G)rid of your RDF data entry problems. In , ISWC-P&D-Industry 2017 : Proceedings of the ISWC 2017 Posters & Demonstrations and Industry Tracks co-located with 16th International Semantic Web Conference (ISWC 2017) Vienna, Austria, October 23rd to 25th, 2017 (S. Paper 635). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Zhang, Z., Gentile, A. L., Blomqvist, E., Augenstein, I. and Ciravegna, F. (2017). An unsupervised data-driven method to discover equivalent relations in large Linked Datasets. Semantic Web, 8, 197–223.
2016
- Auer, S., Berners-Lee, T., Bizer, C. and Heath, T. (eds.) (2016). LDOW 2016 : Proceedings of the Workshop on Linked Data on the web, LDOW 2016, co-located with 25th International World Wide Web Conference(WWW 2016); Montreal, Canada, April 12th, 2016. Aachen, Germany: RWTH Aachen.
- Auer, S., Heath, T., Bizer, C. and Berners-Lee, T. (2016). LDOW2016: 9th Workshop on Linked Data on the Web. In , Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11–15, 2016, Companion Volume (S. 1039-1040). , ACM: Geneva, Switzerland.
- Basile, P., Caputo, A., Gentile, A. L. and Rizzo, G. (2016). Overview of the EVALITA 2016 Named Entity rEcognition and Linking in Italian Tweets (NEEL-IT) Task. In , Proceedings CLiC-it 2016 and EVALITA 2016 : Proceedings of Third Italian Conference on Computational Linguistics (CLiC-it 2016) & Fifth Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2016) (S. Paper 7, 1–8). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Bizer, C., Dong, L., Ilyas, I. and Vidal, M.-E. (2016). Editorial: Special issue on web data quality. Journal of Data and Information Quality : JDIQ, 8, 1:1–1:3.
- Bryl, V., Bizer, C. and Paulheim, H. (2016). Gathering alternative surface forms for DBpedia entities. In , NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 13–24). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- van Erp, M., Mendes, P. N., Paulheim, H., Ilievski, F., Plu, J., Rizzo, G. and Waitelonis, J. (2016). Evaluating entity linking: an analysis of current benchmark datasets and a roadmap for doing a better job. In , Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23–28, 2016, Portorož, Slovenia (S. 4373-4379). , European Language Resources Association, ELRA-ELDA: Paris.
- Faralli, S., Bizer, C., Eckert, K., Meusel, R. and Ponzetto, S. P. (2016). A Web application to search a large repository of taxonomic relations from the Web. In , ISWC-P&D 2016 : Proceedings of the ISWC 2016 Posters & Demonstrations Track co-located with 15th International Semantic Web Conference (ISWC 2016) Kobe, Japan, October 19, 2016 (S. Paper 58). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Gentile, A. L., Kirstein, S., Paulheim, H. and Bizer, C. (2016). Extending RapidMiner with data search and integration capabilities. In , The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 161–171). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Hertling, S., Schröder, M., Jilek, C. and Dengel, A. (2016). Top-k shortest paths in directed labeled multigraphs. In , Semantic web challenges : third SemWebEval Challenge at ESWC 2016, Heraklion, Crete, Greece, May 29 – June 2, 2016 : revised selected papers (S. 200–212). Communications in Computer and Information Science, Springer: Cham.
- Huelss, J. and Paulheim, H. (2016). What SPARQL query logs tell and do not tell about semantic relatedness in LOD : or: the unsuccessful attempt to improve the browsing experience of DBpedia by exploiting query logs. In , The Semantic Web: ESWC 2015 Satellite Events : ESWC 2015 Satellite Events Portorož, Slovenia, May 31 – June 4, 2015, Revised Selected Papers (S. 297–308). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Lehmberg, O. and Bizer, C. (2016). Web table column categorisation and profiling. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 4, 1–7). , ACM: New York, NY.
- Lehmberg, O., Ritze, D., Meusel, R. and Bizer, C. (2016). A large public corpus of web tables containing time and context metadata. In , WWW '16 Companion : Proceedings of the 25th International Conference Companion on World Wide Web : Montreal, Canada, April 11 – 15, 2016 (S. 75–76). , ACM: New York, NY.
- Melo, A., Paulheim, H. and Völker, J. (2016). Type prediction in RDF knowledge bases using hierarchical multilabel classification. In , Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, WIMS 2016, Nîmes, France, June 13–15, 2016 (S. Article 14, 1–10). , ACM: New York, NY.
- Müller, A. C. and Paulheim, H. (2016). Towards combining ontology matchers via anomaly detection. In , OM 2015 : Proceedings of the 10th International Workshop on Ontology Matching collocated with the 14th International Semantic Web Conference (ISWC 2015) Bethlehem, PA, USA, October 12, 2015 (S. 40–44). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Nuzzolese, A. G., Gentile, A. L., Presutti, V. and Gangemi, A. (2016). Conference Linked Data: the ScholarlyData project. In , The Semantic Web – ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17–21, 2016, Proceedings, Part II (S. 150–158). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Nuzzolese, A. G., Gentile, A. L., Presutti, V. and Gangemi, A. (2016). Semantic Web Conference ontology – a refactoring solution. In , The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 84–87). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Oulabi, Y., Meusel, R. and Bizer, C. (2016). Fusing time-dependent web table data. In , WebDB '16 : Proceedings of the 19th International Workshop on Web and Databases, San Francisco, CA, USA, June 26, 2016 : co-located with ACM SIGMOD 2016 (S. Article 3, 1–7). , ACM: New York, NY.
- Paulheim, H. (2016). 14th International Semantic Web Conference 2015 Bethlehem, PA, USA; October 11–15. Künstliche Intelligenz : KI, 30, 207–208.
- Paulheim, H. and Stuckenschmidt, H. (2016). Fast approximate A-box consistency checking using machine learning. In , The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 – June 2, 2016, Proceedings (S. 135–150). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Paulheim, H. and Unger, C. (2016). Can predicate lexicalizations help in named entity disambiguation? In , NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 92–97). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Petrovski, P. and Gentile, A. L. (2016). Can you judge a music album by its cover? In , Know@LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and the 1st International Workshop on Completing and Debugging the Semantic Web (Know@LOD-2016, CoDeS-2016) ... with 13th ESWC 2016 (S. 1–4). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Ristoski, P. and Mika, P. (2016). Enriching product ads with Metadata from HTML annotations. In , The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 – June 2, 2016, Proceedings (S. 151–167). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Ristoski, P. and Paulheim, H. (2016). Analyzing statistics with background knowledge from Linked Open Data. In , SemStats 2013 : Proceedings of the 1st International Workshop on Semantic Statistics co-located with 13th International Semantic Web Conference (ISWC 2013), Sydney, Australia, October 11th, 2013 (S. Article 12). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Ristoski, P. and Paulheim, H. (2016). RDF2Vec: RDF graph embeddings for data mining. In , The Semantic Web – ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17–21, 2016, Proceedings, Part I (S. 498–514). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Ristoski, P. and Paulheim, H. (2016). Semantic Web in data mining and knowledge discovery: a comprehensive survey. Web Semantics, 36, 1–22.
- Ristoski, P., Paulheim, H., Svátek, V. and Zeman, V. (2016). The Linked Data Mining Challenge 2016. In , Know@LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and 1st International Workshop on Completing and Debugging the Semantic Web ...with 13th ESWC 2016, Heraklion, Greece, May 30th 2016 (S. 1–8). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Ristoski, P., de Vries, G. and Paulheim, H. (2016). A collection of benchmark datasets for systematic evaluations of machine learning on the Semantic Web. In , The Semantic Web – ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17–21, 2016, Proceedings, Part II (S. 186–194). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Ritze, D., Lehmberg, O., Oulabi, Y. and Bizer, C. (2016). Profiling the potential of web tables for augmenting cross-domain knowledge bases. In , Proceedings of the 25th International Conference on World Wide Web, WWW 2016, Montreal, Canada, April 11 – 15, 2016 (S. 251–261). , ACM: Geneva, Switzerland.
- Rosati, J., Ristoski, P., Di Noia, T., de Leone, R. and Paulheim, H. (2016). RDF graph embeddings for content-based recommender systems. In , CBRecSys 2016 : Proceedings of the 3rd Workshop on New Trends in Content-Based Recommender Systems co-located with ACM Conference on Recommender Systems (RecSys 2016) Boston, MA, USA, September 16, 2016 (S. 23–30). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Seitner, J., Bizer, C., Eckert, K., Faralli, S., Meusel, R., Paulheim, H. and Ponzetto, S. P. (2016). A large DataBase of hypernymy relations extracted from the Web. In , Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23–28, 2016, Portorož, Slovenia (S. 360–367). , European Language Resources Association, ELRA-ELDA: Paris.
2015
- Bizer, C., Heath, T., Auer, S. and Berners-Lee, T. (eds.) (2015). LDOW 2015 : Proceedings of the Workshop on Linked Data on the Web ; co-located with the 24rd International World Wide Web Conference (WWW 2015) ; Florence, Italy, May 19, 2015. Aachen, Germany: RWTH Aachen.
- De Nies, T., Meusel, R., Ritze, D., Eckert, K., Dimou, A., De Vocht, L., Verborgh, R., Mannens, E. and Van de Walle, R. (2015). A Lightweight Provenance Pingback and Query Service for Web Publications. In , Provenance and Annotation of Data and Processes : 5th International Provenance and Annotation Workshop, IPAW 2014, Cologne, Germany, June 9–13, 2014. Revised Selected Papers (S. 203–208). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- García, R., Paulheim, H. and Di Maio, P. (2015). Special issue on Semantic Web interfaces [Editorial]. Semantic Web, 6, 213–214.
- Gentile, A. L., Acosta, M., Costabello, L., Nuzzolese, A. G., Presutti, V. and Reforgiato Recupero, D. (2015). Conference live: accessible and sociable conference semantic data. In , Proceedings of the 24th International Conference on World Wide Web Companion : May 18 – 22, 2015, Florence, Italy (S. 1007-1012). , ACM: New York, NY.
- Gentile, A. L., Zhang, Z. and Ciravegna, F. (2015). Early steps towards web scale information extraction with LODIE. AI Magazine, 36, 55–64.
- Huelss, J. and Paulheim, H. (2015). What SPARQL query logs tell and do not tell about semantic relatedness in LOD : or: the unsuccessful attempt to improve the browsing experience of DBpedia by exploiting query logs. In , NoISE 2015 : Proceedings of the Workshop on Negative or Inconclusive Results in Semantic Web Co-located with the 12th Extended Semantic Web Conference (ESWC 2015) Portoroz, Slovenia, June 1st, 2015 (S. Paper 1). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P. N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S. and Bizer, C. (2015). DBpedia – A large-scale, multilingual knowledge base extracted from Wikipedia. Semantic Web, 6, 167–195.
- Lehmberg, O., Ritze, D., Ristoski, P., Meusel, R., Paulheim, H. and Bizer, C. (2015). The Mannheim Search Join Engine. Web Semantics, 35, 159–166.
- Meusel, R., Bizer, C. and Paulheim, H. (2015). A web-scale study of the adoption and evolution of the schema.org vocabulary over time. In , Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, WIMS 2015, Larnaca, Cyprus, July 13–15, 2015 (S. Article 15, 1–11). WIMS '15, ACM: New York, NY.
- Meusel, R. and Paulheim, H. (2015). Creating large-scale training and test corpora for extracting structured data from the web. In , Linked Data for Information Extraction : Proceedings of the Third International Workshop on Linked Data for Information Extraction (LD4IE2015) co-loc. with the 14th International Semantic Web Conference (ISWC 2015) ; Bethlehem, PA, USA, Oct. 12, 2015 (S. 2–6). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Meusel, R. and Paulheim, H. (2015). Heuristics for fixing common errors in deployed schema.org microdata. In , The Semantic Web: Research and Applications : 12th International Conference, ESWC 2015, Portoroz, Slovenia, May 30 – June 4, 2015. Proceedings (S. 152–168). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Meusel, R., Primpeli, A., Meilicke, C., Paulheim, H. and Bizer, C. (2015). Exploiting microdata annotations to consistently categorize product offers at web scale. In , E-Commerce and Web Technologies : 16th International Conference on Electronic Commerce and Web Technologies, EC-Web 2015, Valencia, Spain, September 2015, revised selected papers (S. 83–99). Lecture Notes in Business Information Processing : LNBIP, Springer: Berlin [u.a.].
- Meusel, R., Spahiu, B., Bizer, C. and Paulheim, H. (2015). Towards automatic topical classification of LOD datasets. In , LDOW 2015 : Proceedings of the Workshop on Linked Data on the Web ; co-located with the 24th International World Wide Web Conference (WWW 2015) ; Florence, Italy, May 19th, 2015 (S. Paper 03). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Meusel, R., Vigna, S., Lehmberg, O. and Bizer, C. (2015). The graph structure in the web – analyzed on different aggregation levels. The Journal of Web Science, 1, 33–47.
- Paulheim, H. (2015). Nobody wants to live in a cold city where no music has been recorded: analyzing statistics with Explain-a-LOD. In , The Semantic Web: ESWC 2012 Satellite Events : ESWC 2012 Satellite Events, Heraklion, Crete, Greece, May 27–31, 2012. Revised Selected Papers (S. 387–391). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Paulheim, H. (2015). What the Adoption of schema.org Tells About Linked Open Data. In , Joint Proceedings of the 5th International Workshop on Using the Web in the Age of Data (USEWOD '15) and the 2nd International Workshop on Dataset PROFIling and fEderated Search for Linked Data (PROFILES '15) ... 12th European Semantic Web Conference (S. 85–90). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Paulheim, H. and Gangemi, A. (2015). Serving DBpedia with DOLCE – more than just adding a cherry on top. In , The Semantic Web – ISWC 2015 : 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11–15, 2015, Proceedings, Part I (S. 180–196). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Paulheim, H. and Meusel, R. (2015). A decomposition of the outlier detection problem into a set of supervised learning problems. Machine Learning, 100, 509–531.
- Paulheim, H., Schulz, A., Janssen, F., Ristoski, P. and Schweizer, I. (2015). Intelligente Datenauswertung mit Linked Open Data. In Corporate Semantic Web (S. 187–201). Berlin ; Heidelberg: Springer Vieweg.
- Ristoski, P. (2015). Towards Linked Open Data enabled data mining: strategies for feature generation, propositionalization, selection, and consolidation. In , The Semantic Web. Latest Advances and New Domains : 12th European Semantic Web Conference, ESWC 2015, Portoroz, Slovenia, May 31 – June 4, 2015. Proceedings (S. 772–782). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Ristoski, P., Bizer, C. and Paulheim, H. (2015). Mining the web of linked data with RapidMiner. Web Semantics, 35, 142–151.
- Ristoski, P. and Paulheim, H. (2015). Visual analysis of statistical data on maps using Linked Open Data. In , The Semantic Web: ESWC 2015 Satellite Events : ESWC 2015 Satellite Events, Portorož, Slovenia, May 31--June 4, 2015, Revised Selected Papers (S. 138–143). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Ristoski, P., Paulheim, H., Svátek, V. and Zeman, V. (2015). The Linked Data Mining Challenge 2015. In , Knowledge Discovery and Data Mining Meets Linked Open Data : Proceedings of the 4th Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data co-located with 12th Extended Semantic Web Conference (ESWC 2015) Portoroz, Slovenia, May 31, 2015 (S. Paper 13). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Ristoski, P., Schuhmacher, M. and Paulheim, H. (2015). Using graph metrics for linked open data enabled recommender systems. In , E-Commerce and Web Technologies : 16th International Conference on Electronic Commerce and Web Technologies, EC-Web 2015, Valencia, Spain, September 2015, revised selected papers (S. 30–41). Lecture Notes in Business Information Processing : LNBIP, Springer: Berlin [u.a.].
- Ritze, D., Lehmberg, O. and Bizer, C. (2015). Matching HTML tables to DBpedia. In , Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, WIMS 2015, Larnaca, Cyprus, July 13–15, 2015 (S. Paper 10, 1–6). , ACM: New York, NY.
- Schulz, A., Janssen, F., Ristoski, P. and Fürnkranz, J. (2015). Event-based clustering for reducing labeling costs of event-related microposts. In , ICWSM 15 : Proceedings of the 9th International AAAI Conference on Web and Social Media, Oxford, UK, May 26, 2015 – May 29, 2015 (S. 686–689). , AAAI Press: Palo Alto, Calif..
- Schulz, A., Ristoski, P., Fürnkranz, J. and Janssen, F. (2015). Event-based clustering for reducing labeling costs of incident-related microposts. In , MUD 2015 Mining Urban Data : Proceedings of the 2nd International Workshop on Mining Urban Data co-located with 32nd International Conference on Machine Learning (ICML 2015) Lille, France, July 11th, 2015 (S. 44–52). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Schäfer, B., Ristoski, P. and Paulheim, H. (2015). What is special about Bethlehem, Pennsylvania? Identifying unexpected facts about DBpedia entities. In , ISWC-P&D 2015 : Proceedings of the ISWC 2015 Posters & Demonstrations Track co-located with the 14th International Semantic Web Conference (ISWC-2015) Bethlehem, PA, USA, October 11, 2015 (S. Paper 46). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Xie, C., Ritze, D., Spahiu, B. and Hongming, C. (2015). Instance-based property matching in linked open data environment. In , OM 2015 : Proceedings of the 10th International Workshop on Ontology Matching collocated with the 14th International Semantic Web Conference (ISWC 2015) Bethlehem, PA, USA, October 12, 2015 (S. 222–223). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
2014
- Bizer, C., Heath, T., Auer, S. and Berners-Lee, T. (eds.) (2014). LDOW 2014 : Proceedings of the Workshop on Linked Data on the Web co-located with the 23rd International World Wide Web Conference (WWW 2014); Seoul, Korea, April 8, 2014. Aachen, Germany: RWTH Aachen.
- Bizer, C. (2014). Search Joins with the Web. In , Database Theory – ICDT 2014 : 17th International Conference on Database Theory, Athens, Greece, March 24–28, 2014; Proceedings (S. 3). , Univ. Konstanz, Univ. Library: Konstanz.
- Bizer, C. and Cyganiak, R. (2014). RDF 1.1 TriG – RDF Dataset Language – W3C Recommendation 25 February 2014.
- Bryl, V. and Bizer, C. (2014). Learning Conflict Resolution Strategies for Cross-language Wikipedia Data Fusion. In , 23rd International World Wide Web Conference, WWW '14, Seoul, Republic of Korea, April 7–11, 2014, Companion Volume (S. 1129-1134). , ACM: New York, NY.
- Bryl, V., Bizer, C., Isele, R., Verlic, M., Hong, S. G., Jang, S., Yi, M. Y. and Choi, K.-S. (2014). Interlinking and Knowledge Fusion. In Linked Open Data – Creating Knowledge Out of Interlinked Data : Results of the LOD2 Project (S. 70–89). Berlin [u.a.]: Springer.
- De Clercq, O., Hertling, S., Hoste, V., Ponzetto, S. P. and Paulheim, H. (2014). Identifying Disputed Topics in the News. In , LD4KG 2014 : Proceedings of the 1st Workshop on Linked Data for Knowledge Discovery co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2014); Nancy, France, Sept. 19th, 2014 (S. Paper 4). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- De Nies, T., Meusel, R., Ritze, D., Eckert, K., Dimou, A., De Vocht, L., Verborgh, R., Mannens, E. and Van de Walle, R. (2014). DEMO: A Lightweight Provenance Pingback and Query Service for Web Publications. In , IPAW 2014: 5th International Provenance and Annotation Workshop : June 9–13, 2014, Cologne, Germany (S. 1–6). A Lightweight Provenance Pingback and Query Service for Web Publications, UGent Institutional Repository: Gent.
- Dragisic, Z., Eckert, K., Euzenat, J., Faria, D., Ferrara, A., Granada, R., Ivanova, V., Jiménez-Ruiz, E., Kempf, A. O., Lambrix, P., Montanelli, S., Paulheim, H., Ritze, D., Shvaiko, P., Solimando, A., Trojahn, C., Zamazal, O. and Grau, B. C. (2014). Results of the Ontology Alignment Evaluation Initiative 2014. In , OM 2014 : Proceedings of the 9th International Workshop on Ontology Matching co-located with the 13th International Semantic Web Conference (ISWC 2014) ; Riva del Garda, Trentino, Italy, October 20, 2014 (S. 61–104). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Eckert, K., Ritze, D., Baierer, K. and Bizer, C. (2014). RESTful Open Workflows for Data Provenance and Reuse. In , 23rd International World Wide Web Conference, WWW '14, Seoul, Republic of Korea, April 7–11, 2014, Companion Volume (S. 259–260). Proceedings of the IW3C2 WWW 2014 Conference, ACM: New York, NY.
- Fleischhacker, D., Paulheim, H., Bryl, V., Völker, J. and Bizer, C. (2014). Detecting Errors in Numerical Linked Data Using Cross-Checked Outlier Detection. In , The Semantic Web – ISWC 2014 : 13th International Semantic Web Conference, Riva del Garda, Italy, October 19–23, 2014. Proceedings, Part I (S. 357–372). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Gabriel, A., Paulheim, H. and Janssen, F. (2014). Learning Semantically Coherent Rules. In , DMNLP 2014 : Proceedings of the 1st International Workshop on Interactions between Data Mining and Natural Language Processing co-located with the ECML PKDD 2014, Nancy, France, September 15, 2014 (S. 49–63). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Gentile, A. L. and Mazumdar, S. (2014). User driven information extraction with LODIE. In , ISWC-P&D 2014 : Proceedings of the ISWC 2014 Posters & Demonstrations Track a track within the 13th International Semantic Web Conference (ISWC 2014) Riva del Garda, Italy, October 21, 2014 (S. 385–388). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Kempf, A. O., Ritze, D., Eckert, K. and Zapilko, B. (2014). New Ways of Mapping Knowledge Organization Systems: Using a Semi-Automatic Matching Procedure for Building up Vocabulary Crosswalks. Knowledge Organization : KO, 41, 66–75.
- Lehmberg, O., Meusel, R. and Bizer, C. (2014). Graph structure in the Web – aggregated by Pay-Level Domain. In , WebSci 2014 : Proceedings of the 6th ACM Conference on Web Science, Bloomington, IND, USA, June 23 – 26, 2014 (S. 119–128). , ACM: New York, NY.
- Lehmberg, O., Ritze, D., Ristoski, P., Eckert, K., Paulheim, H. and Bizer, C. (2014). Extending Tables with Data from over a Million Websites.
- Meusel, R., Mika, P. and Blanco, R. (2014). Focused Crawling for Structured Data. In , CIKM 2014 : Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (S. 1039-1048). , ACM: New York, NY.
- Meusel, R. and Paulheim, H. (2014). Linked Data for Information Extraction Challenge 2014 : Tasks and Results. In , LD4IE 2014 : Linked Data for Information Extraction : Proceedings of the Second International Workshop on Linked Data for Information Extraction (LD4IE 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014) Riva del Garda, Italy (S. 3–8). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Meusel, R., Petrovski, P. and Bizer, C. (2014). The WebDataCommons Microdata, RDFa and Microformat Dataset Series. In , The Semantic Web – ISWC 2014 : 13th International Semantic Web Conference, Riva del Garda, Italy, October 19–23, 2014. Proceedings, Part I (S. 277–292). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Meusel, R., Vigna, S., Lehmberg, O. and Bizer, C. (2014). Graph Structure in the Web – Revisited. In , 23rd International World Wide Web Conference, WWW '14, Seoul, Republic of Korea, April 7–11, 2014, Companion Volume (S. 427–432). WWW WebSci '14, Internat. World Wide Web Conferences Steering Committee: Geneva, Switzerland.
- Paulheim, H. (2014). Identifying Wrong Links between Datasets by Multi-dimensional Outlier Detection.
In , WoDOOM 2014 : Debugging ontologies and ontology mappings : proceedings of the Third International Workshop on Debugging Ontologies and Ontology Mappings co-located with 11th Extended Semantic Web Conference, Anissaras/
Hersonissou, Greece, May 26, 2014 (S. 27–38). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany. - Paulheim, H. and Bizer, C. (2014). Improving the Quality of Linked Data Using Statistical Distributions. International Journal on Semantic Web and Information Systems : IJSWIS, 10, 63–86.
- Paulheim, H., Ristoski, P., Mitichkin, E. and Bizer, C. (2014). Data Mining with Background Knowledge from the Web. In , Proceedings of the 5th RapidMiner World (2014) (S. 1–14). , Shaker: Aachen.
- Petrovski, P., Bryl, V. and Bizer, C. (2014). Integrating Product Data from Websites offering Microdata Markup. In , 23rd International World Wide Web Conference, WWW '14, Seoul, Republic of Korea, April 7–11, 2014, Companion Volume (S. 1299-1304). , ACM: Geneva.
- Petrovski, P., Bryl, V. and Bizer, C. (2014). Learning regular expressions for the extraction of product attributes from E-commerce microdata. In , LD4IE 2014 : Proceedings of the Second International Workshop on Linked Data for Information Extraction (LD4IE 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014), Riva del Garda, Italy, October 20, 2014 (S. 45–54). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Ristoski, P., Loza Mencía, E. and Paulheim, H. (2014). A Hybrid Multi-strategy Recommender System Using Linked Open Data. In Semantic Web Evaluation Challenge : SemWebEval 2014 at ESWC 2014, Anissaras, Crete, Greece, May 25–29, 2014, Revised Selected Papers (S. 150–156). Cham: Springer Internat. Publ.
- Ristoski, P. and Paulheim, H. (2014). A Comparison of Propositionalization Strategies for Creating Features from Linked Open Data. In , LD4KD 2014 : Proceedings of the 1st Workshop on Linked Data for Knowledge Discovery co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2014); Nancy, France, Sept. 19th, 2014 (S. Paper 1). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Ristoski, P. and Paulheim, H. (2014). Feature Selection in Hierarchical Feature Spaces. In , Discovery Science : 17th International Conference, DS 2014, Bled, Slovenia, October 8–10, 2014. Proceedings (S. 288–300). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Ritze, D. and Eckert, K. (2014). Data Enrichment in Discovery Systems Using Linked Data. In Data Analysis, Machine Learning and Knowledge Discovery (S. 455–462). Cham [u.a.]: Springer.
- Ritze, D., Zirn, C., Greenstreet, C., Eckert, K. and Ponzetto, S. P. (2014). Named Entities in Court: The MarineLives Corpus. In , Language Resources and Technologies for Processing and Linking Historical Documents and Archives – Deploying Linked Open Data in Cultural Heritage Workshop : associated with the LREC 2014 Conference, 26 – 30 May 2014, Reykjavik (S. 26–30). Language resources and technologies for processing and linking historical documents and archives- Deploying Linked Open Data in Cultural Heritage, LREC: Reykjavik.
- Schmachtenberg, M., Bizer, C. and Paulheim, H. (2014). Adoption of the Linked Data Best Practices in Different Topical Domains. In , The Semantic Web – ISWC 2014 : 13th International Semantic Web Conference, Riva del Garda, Italy, October 19–23, 2014. Proceedings, Part I (S. 245–260). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
- Schmachtenberg, M., Strufe, T. and Paulheim, H. (2014). Enhancing a Location-based Recommendation System by Enrichment with Structured Data from the Web. In , Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS14) (S. Article No. 17,1–12). , ACM: New York, NY.
- Schäfer, B. (2014). Exploiting DBpedia for graph-based entity linking to Wikipedia. Thesis, . Mannheim
- Svátek, V., Mynarz, J. and Paulheim, H. (2014). The Linked Data Mining Challenge 2014: Results and experiences. In , Knowledge Discovery and Data Mining Meets Linked Open Data : Proceedings of the 3rd Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data co-located with 11th Extended Semantic Web Conference (ESWC 2014) Crete, Greece, May 25, 2014 (S. Paper 6). CEUR Workshop Proceedings, RWTH Aachen: Aachen, Germany.
- Trojahn, C., Fu, B., Zamazal, O. and Ritze, D. (2014). State-of-the-Art in Multilingual and Cross-Lingual Ontology Matching. In Towards the Multilingual Semantic Web (S. 119–135). Berlin [u.a.]: Springer.
- Wienand, D. and Paulheim, H. (2014). Detecting Incorrect Numerical Data in DBpedia. In , The Semantic Web: Trends and Challenges : 11th International Conference, ESWC 2014, Anissaras, Crete, Greece, May 25–29, 2014. Proceedings (S. 504–518). Lecture Notes in Computer Science, Springer: Berlin [u.a.].