Photo credit: Anna Logue

Focus Group: Web Data Mining

(Prof. Paulheim)

The Web Data Mining group focuses on the curation, refinement, and use of Web-scale knowledge graphs.

Knowledge graphs provide general, cross-domain knowledge about the world in a machine interpretable form. In the Web Data Mining group, we contribute to open source knowledge graphs such as DBpedia by developing refinement operators, e.g., for completing missing information or identifying errors. Furthermore, we develop new knowledge graphs, such as WebIsALOD and DBkWik, which are designed to be complementary to existing ones, and methods for using those knowledge graphs in practical knowledge intensive tasks, such as the RapidMiner Linked Open Data Extension and RDF2vec.

People

External PhD Students

Former Members

  • André Melo
  • Dr. Anna Lisa Gentile
  • Dr. Petar Ristoski

Projects

Data and Software

Software

Datasets

Publications

  • Heist, N. and Paulheim, H. (2019). Uncovering the semantics of Wikipedia categories. In , ISWC 2019 : The 18th International Semantic Web Conference, Knowledge Graphs, Linked Data, Linked Schemas and AI on the Web : October 26 - 30, 2019 The University of Auckland, New Zealand (S. tba). , Springer: Cham.
  • Algergawy, A., Cheatham, M., Faria, D., Ferrara, A., Fundulaki, I., Harrow, I., Hertling, S., Jiménez-Ruiz, E., Karam, N., Khiat, A., Lambrix, P., Li, H., Montanelli, S., Paulheim, H., Pesquita, C., Saveta, T., Schmidt, D., Shvaiko, P., Splendiani, A., Thiéblin, E., Trojahn, C., Vataščinová, J., Zamazal, O. and Zhou, L. (2018). Results of the Ontology Alignment Evaluation Initiative 2018. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 76-116). CEUR Workshop Proceedings, RWTH: Aachen.
  • Heist, N. (2018). Towards knowledge graph construction from entity co-occurrence. In Hollink, L., EKAW-DC 2018 : Proceedings of the EKAW Doctoral Consortium 2018 co-located with the 21st International Conference on Knowledge Engineering and Knowledge Management (EKAW 2018) Nancy, France, November 13, 2018 (S. 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Helmstetter, S. and Paulheim, H. (2018). Weakly supervised learning for fake news detection on Twitter. In Day, M., ASONAM 2018 : 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Barcelona, Spain, 28-31 August, 2018 (S. 274-277). , IEEE Computer Society: Washington, DC.
  • Hertling, S. and Paulheim, H. (2018). DBkWik: A consolidated knowledge graph from thousands of Wikis. In Wu, X., 9th IEEE International Conference on Big Knowledge, ICBK 2018, Singapore, November 17-18, 2018 : proceedings (S. 17-24). , IEEE Computer Society: Piscataway, NJ [u.a.].
  • Hertling, S. and Paulheim, H. (2018). DOME results for OAEI 2018. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 144-151). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hertling, S. and Paulheim, H. (2018). Provision and usage of provenance data in the WebIsALOD Knowledge Graph. In Capadisli, S., CKGSemStats 2018 : Joint Proceedings of the International Workshops on Contextualized Knowledge Graphs, and Semantic Statistics co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th, 2018 (S. Article 6). CEUR Workshop Proceedings, RWTH: Aachen.
  • Jiménez-Ruiz, E., Saveta, T., Zamazal, O., Hertling, S., Röder, M., Fundulaki, I., Ngonga Ngomo, A.-C., Sherif, M. A., Annane, A., Bellahsene, Z., Ben Yahia, S., Diallo, G., Faria, D., Kachroudi, M., Khiat, A., Lambrix, P., Li, H., Mackeprang, M., Mohammadi, M., Rybinski, M., Balasubramani, B. S. and Trojahn, C. (2018). Introducing the HOBBIT platform into the ontology alignment evaluation campaign. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 49-60). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. (2018). How much is a triple? Estimating the cost of knowledge graph creation. In Erp, M., ISWC-P&D-Industry-BlueSky 2018 : Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018) Monterey, USA, October 8th to 12th, 2018 (S. Paper 10). , RWTH: Aachen.
  • Paulheim, H. (2018). Machine learning with and for semantic web knowledge graphs. In d'Amato, C., Reasoning Web: Learning, Uncertainty, Streaming, and Scalability : 14th International Summer School 2018 Esch-sur-Alzette, Luxembourg, September 22 – 26, 2018 Tutorial Lectures (S. 110-141). Lecture Notes in Computer Science, Springer: Cham.
  • Paulheim, H. (2018). Make embeddings semantic again!. In Erp, M., ISWC-P&D-Industry-BlueSky 2018 : Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018) Monterey, USA, October 8th to 12th, 2018 (S. 4). CEUR Workshop Proceedings, RWTH: Aachen.
  • Portisch, J. and Paulheim, H. (2018). ALOD2Vec matcher. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 132-137). CEUR Workshop Proceedings, RWTH: Aachen.
  • Rico, M., Mihindukulasooriya, N., Kontokostas, D., Paulheim, H., Hellmann, S. and Gómez-Pérez, A. (2018). Predicting incorrect mappings : a data-driven approach applied to DBpedia. In , SAC '18 : the 33rd ACM/SIGAPP Symposium On Applied Computing, Pau, France, April 9 - 13, 2018, proceedings (S. 323-330). , ACM: New York, NY.
  • Cochez, M., Ristoski, P., Ponzetto, S. P. and Paulheim, H. (2017). Biased graph walks for RDF graph embeddings. In Akerkar, R., WIMS '17 Proceedings of the 7th International Conference on Web Intelligence, Mining and Semantics : Amantea, Italy, June 19 - 22, 2017 (S. Article 21). , ACM: New York, NY.
  • Cochez, M., Ristoski, P., Ponzetto, S. P. and Paulheim, H. (2017). Global RDF vector space embeddings. In d'Amato, C., The Semantic Web – ISWC 2017 : 16th International Semantic Web Conference, Vienna, Austria, October 21–25, 2017, proceedings, part I (S. 190-207). Lecture Notes in Computer Science, Springer: Cham.
  • Gentile, A. L., Ristoski, P., Eckel, S., Ritze, D. and Paulheim, H. (2017). Entity matching on web tables: a table embeddings approach for blocking. In Markl, V., Advances in Database Technology - EDBT 2017 : 20th International Conference on Extending Database Technology, Venice, Italy, March 21–24, 2017, Proceedings (S. 510-513). , OpenProceedings: Konstanz.
  • Heist, N. and Paulheim, H. (2017). Language-agnostic relation extraction from Wikipedia abstracts. In d'Amato, C., The Semantic Web – ISWC 2017 : 16th International Semantic Web Conference, Vienna, Austria, October 21–25, 2017, proceedings, part I (S. 383-399). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Hertling, S. and Paulheim, H. (2017). WebIsALOD: providing hypernymy relations extracted from the web as linked open data. In d'Amato, C., The Semantic Web – ISWC 2017 : 16th International Semantic Web Conference, Vienna, Austria, October 21-25, 2017, proceedings, part II (S. 111-119). Lecture Notes in Computer Science, Springer: Cham.
  • Hofmann, A., Perchani, S., Portisch, J., Hertling, S. and Paulheim, H. (2017). DBkWik: towards knowledge graph creation from thousands of wikis. In Nikitina, N., ISWC-P&D-Industry 2017 : Proceedings of the ISWC 2017 Posters & Demonstrations and Industry Tracks co-located with 16th International Semantic Web Conference (ISWC 2017) Vienna, Austria, October 23rd to 25th, 2017 (S. Paper 540). CEUR Workshop Proceedings, RWTH: Aachen.
  • Krstanovic, S. and Paulheim, H. (2017). Ensembles of recurrent neural networks for robust time series forecasting. In Bramer, M., Artificial Intelligence XXXIV : 37th SGAI International Conference on Artificial Intelligence, AI 2017, Cambridge, UK, December 12-14, 2017, proceedings (S. 34-46). Lecture Notes in Computer Science, Springer: Cham.
  • Meilicke, C., Ruffinelli, D., Nolle, A., Paulheim, H. and Stuckenschmidt, H. (2017). Fast ABox consistency checking using incomplete reasoning and caching. In Costantini, S., Rules and Reasoning : International Joint Conference : RuleML+RR 2017, London, UK, July 12-15, 2017, Proceedings (S. 168-183). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Melo, A. and Paulheim, H. (2017). An approach to correction of erroneous links in knowledge graphs. In Tiddi, I., K-CAPSAT-2017 : Proceedings of Workshops and Tutorials of the 9th International Conference on Knowledge Capture (K-CAP2017) Austin, Texas, December 4th, 2017 (S. 54-57). CEUR Workshop Proceedings, RWTH: Aachen.
  • Melo, A. and Paulheim, H. (2017). Detection of relation assertion errors in knowledge graphs. In Corcho, O., Proceedings of the Knowledge Capture Conference, K-CAP 2017, Austin, TX, USA, December 4-6, 2017 (S. Article 22,1-8). , ACM: New York, NY, USA.
  • Paulheim, H. (2017). A robust number parser based on conditional random fields. In Kern-Isberner, G., KI 2017: Advances in Artificial Intelligence : 40th Annual German Conference on AI, Dortmund, Germany, September 25–29, 2017, proceedings (S. 337-343). Lecture Notes in Computer Science, Springer: Cham.
  • Paulheim, H. (2017). Data-driven joint debugging of the DBpedia mappings and ontology. In Blomqvist, E., The Semantic Web : 14th International Conference, ESWC 2017, Portorož, Slovenia, May 28 - June 1, 2017, Proceedings, Part I (S. 404-418). Lecture Notes in Computer Science, Springer: Cham.
  • Paulheim, H. (2017). Towards profiling knowledge graphs. In Demidova, E., Profiles 2017 : Proceedings of the 4th International Workshop on Dataset PROFIling and fEderated Search for Web Data (PROFILES 2017) co-located with The 16th International Semantic Web Conference (ISWC 2017) Vienna, Austria, October 22, 2017 (S. Paper 1). CEUR-WS, CEUR Workshop Proceedings: Aachen.
  • Ringler, D. and Paulheim, H. (2017). One knowledge graph to rule them all? Analyzing the differences between DBpedia, YAGO, Wikidata & co.. In Kern-Isberner, G., KI 2017: Advances in Artificial Intelligence : 40th Annual German Conference on AI, Dortmund, Germany, September 25–29, 2017, proceedings (S. 366-372). Lecture Notes in Computer Science, Springer: Cham.
  • Ristoski, P., Faralli, S., Ponzetto, S. P. and Paulheim, H. (2017). Large-scale taxonomy induction using entity and word embeddings. In Sheth, A., WI 2017 : proceedings of the International Conference on Web Intelligence, Leipzig, Germany, August 23-26, 2017 (S. 81-87). , ACM: New York, NY.
  • Bryl, V., Bizer, C. and Paulheim, H. (2016). Gathering alternative surface forms for DBpedia entities. In Paulheim, H., NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 13-24). CEUR Workshop Proceedings, RWTH: Aachen.
  • van Erp, M., Mendes, P. N., Paulheim, H., Ilievski, F., Plu, J., Rizzo, G. and Waitelonis, J. (2016). Evaluating entity linking: an analysis of current benchmark datasets and a roadmap for doing a better job. In Calzolari, N., Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 4373-4379). , European Language Resources Association, ELRA-ELDA: Paris.
  • Gentile, A. L., Kirstein, S., Paulheim, H. and Bizer, C. (2016). Extending RapidMiner with data search and integration capabilities. In Sack, H., The Semantic Web : ESWC 2016 Satellite Events, Heraklion, Crete, Greece, May 29 – June 2, 2016, Revised Selected Papers (S. 161-171). Lecture Notes in Computer Science, Springer: Cham.
  • Huelss, J. and Paulheim, H. (2016). What SPARQL query logs tell and do not tell about semantic relatedness in LOD : or: the unsuccessful attempt to improve the browsing experience of DBpedia by exploiting query logs. In Gandon, F., The Semantic Web: ESWC 2015 Satellite Events : ESWC 2015 Satellite Events Portorož, Slovenia, May 31 – June 4, 2015, Revised Selected Papers (S. 297-308). Lecture Notes in Computer Science, Springer: Cham.
  • Melo, A., Paulheim, H. and Völker, J. (2016). Type prediction in RDF knowledge bases using hierarchical multilabel classification. In Akerkar, R., Proceedings of the 6th International Conference on Web Intelligence, Mining and Semantics, WIMS 2016, Nîmes, France, June 13-15, 2016 (S. Article 14, 1-10). , ACM: New York, NY.
  • Müller, A. C. and Paulheim, H. (2016). Towards combining ontology matchers via anomaly detection. In Shvaiko, P., OM 2015 : Proceedings of the 10th International Workshop on Ontology Matching collocated with the 14th International Semantic Web Conference (ISWC 2015) Bethlehem, PA, USA, October 12, 2015 (S. 40-44). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. and Stuckenschmidt, H. (2016). Fast approximate A-box consistency checking using machine learning. In Sack, H., The Semantic Web. Latest Advances and New Domains : 13th International Conference, ESWC 2016, Heraklion, Crete, Greece, May 29 - June 2, 2016, Proceedings (S. 135-150). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Paulheim, H. and Unger, C. (2016). Can predicate lexicalizations help in named entity disambiguation? In Paulheim, H., NLP & DBpedia 2015 : Proceedings of the Third NLP&DBpedia Workshop (NLP & DBpedia 2015) co-located with the 14th International Semantic Web Conference 2015 (ISWC 2015) Bethlehem, Pennsylvania, USA, October 11, 2015 (S. 92-97). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. and Paulheim, H. (2016). Analyzing statistics with background knowledge from Linked Open Data. In Capadisli, S., SemStats 2013 : Proceedings of the 1st International Workshop on Semantic Statistics co-located with 13th International Semantic Web Conference (ISWC 2013), Sydney, Australia, October 11th, 2013 (S. Article 12). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. and Paulheim, H. (2016). RDF2Vec: RDF graph embeddings for data mining. In Groth, P., The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part I (S. 498-514). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Ristoski, P., Paulheim, H., Svatek, V. and Zeman, V. (2016). The Linked Data Mining Challenge 2016. In Paulheim, H., Know@LOD&CoDeS 2016 : Joint Proceedings of the 5th Workshop on Data Mining and Knowledge Discovery meets Linked Open Data and 1st International Workshop on Completing and Debugging the Semantic Web ...with 13th ESWC 2016, Heraklion, Greece, May 30th 2016 (S. 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., de Vries, G. and Paulheim, H. (2016). A collection of benchmark datasets for systematic evaluations of machine learning on the Semantic Web. In Groth, P., The Semantic Web - ISWC 2016 : 15th International Semantic Web Conference, Kobe, Japan, October 17-21, 2016, Proceedings, Part II (S. 186-194). Lecture Notes in Computer Science, Springer: Cham.
  • Rosati, J., Ristoski, P., Di Noia, T., de Leone, R. and Paulheim, H. (2016). RDF graph embeddings for content-based recommender systems. In Bogers, T., CBRecSys 2016 : Proceedings of the 3rd Workshop on New Trends in Content-Based Recommender Systems co-located with ACM Conference on Recommender Systems (RecSys 2016) Boston, MA, USA, September 16, 2016 (S. 23-30). CEUR Workshop Proceedings, RWTH: Aachen.
  • Seitner, J., Bizer, C., Eckert, K., Faralli, S., Meusel, R., Paulheim, H. and Ponzetto, S. P. (2016). A large DataBase of hypernymy relations extracted from the Web. In Calzolari, N., Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016) : May 23-28, 2016, Portorož, Slovenia (S. 360-367). , European Language Resources Association, ELRA-ELDA: Paris.
  • Meusel, R., Bizer, C. and Paulheim, H. (2015). A web-scale study of the adoption and evolution of the schema.org vocabulary over time. In Akerkar, R., Proceedings of the 5th International Conference on Web Intelligence, Mining and Semantics, WIMS 2015, Larnaca, Cyprus, July 13-15, 2015 (S. Article 15, 1-11). , ACM: New York, NY.
  • Meusel, R. and Paulheim, H. (2015). Creating large-scale training and test corpora for extracting structured data from the web. In Gentile, A., Linked Data for Information Extraction : Proceedings of the Third International Workshop on Linked Data for Information Extraction (LD4IE2015) co-loc. with the 14th International Semantic Web Conference (ISWC 2015) ; Bethlehem, PA, USA, Oct. 12, 2015 (S. 2-6). CEUR Workshop Proceedings, RWTH: Aachen.
  • Meusel, R. and Paulheim, H. (2015). Heuristics for fixing common errors in deployed schema.org microdata. In Gandon, F., The Semantic Web: Research and Applications : 12th International Conference, ESWC 2015, Portoroz, Slovenia, May 30 - June 4, 2015. Proceedings (S. 152-168). Lecture Notes in Computer Science, Springer International Publishing: Cham.
  • Meusel, R., Primpeli, A., Meilicke, C., Paulheim, H. and Bizer, C. (2015). Exploiting microdata annotations to consistently categorize product offers at web scale. In Stuckenschmidt, H., E-Commerce and Web Technologies : 16th International Conference on Electronic Commerce and Web Technologies, EC-Web 2015, Valencia, Spain, September 2015, revised selected papers (S. 83-99). Lecture Notes in Business Information Processing, Springer International Publishing : Cham.
  • Meusel, R., Spahiu, B., Bizer, C. and Paulheim, H. (2015). Towards automatic topical classification of LOD datasets. In Bizer, C., LDOW 2015 : Proceedings of the Workshop on Linked Data on the Web ; co-located with the 24th International World Wide Web Conference (WWW 2015) ; Florence, Italy, May 19th, 2015 (S. Paper 03). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. (2015). Nobody wants to live in a cold city where no music has been recorded: analyzing statistics with Explain-a-LOD. In Simperl, E., The Semantic Web: ESWC 2012 Satellite Events : ESWC 2012 Satellite Events, Heraklion, Crete, Greece, May 27-31, 2012. Revised Selected Papers (S. 387-391). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Paulheim, H. (2015). What the Adoption of schema.org Tells About Linked Open Data. In Berendt, B., Joint Proceedings of the 5th International Workshop on Using the Web in the Age of Data (USEWOD '15) and the 2nd International Workshop on Dataset PROFIling and fEderated Search for Linked Data (PROFILES '15) ... 12th European Semantic Web Conference (S. 85-90). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. and Gangemi, A. (2015). Serving DBpedia with DOLCE - more than just adding a cherry on top. In Arenas, M., The Semantic Web - ISWC 2015 : 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11-15, 2015, Proceedings, Part I (S. 180-196). Lecture Notes in Computer Science, Springer: Cham [u.a.].
  • Ristoski, P. and Paulheim, H. (2015). Visual analysis of statistical data on maps using Linked Open Data. In Gandon, F., The Semantic Web: ESWC 2015 Satellite Events : ESWC 2015 Satellite Events, Portorož, Slovenia, May 31--June 4, 2015, Revised Selected Papers (S. 138-143). Lecture Notes in Computer Science, Springer: Cham.
  • Ristoski, P., Paulheim, H., Svatek, V. and Zeman, V. (2015). The Linked Data Mining Challenge 2015. In Völker, J., Knowledge Discovery and Data Mining Meets Linked Open Data : Proceedings of the 4th Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data co-located with 12th Extended Semantic Web Conference (ESWC 2015) Portoroz, Slovenia, May 31, 2015 (S. Paper 13). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P., Schuhmacher, M. and Paulheim, H. (2015). Using graph metrics for linked open data enabled recommender systems. In Stuckenschmidt, H., E-Commerce and Web Technologies : 16th International Conference on Electronic Commerce and Web Technologies, EC-Web 2015, Valencia, Spain, September 2015, revised selected papers (S. 30-41). Lecture Notes in Business Information Processing, Springer International Publishing: Cham.
  • Schäfer, B., Ristoski, P. and Paulheim, H. (2015). What is special about Bethlehem, Pennsylvania? Identifying unexpected facts about DBpedia entities. In Villata, S., ISWC-P&D 2015 : Proceedings of the ISWC 2015 Posters & Demonstrations Track co-located with the 14th International Semantic Web Conference (ISWC-2015) Bethlehem, PA, USA, October 11, 2015 (S. Paper 46). CEUR Workshop Proceedings, RWTH: Aachen.
  • De Clercq, O., Hertling, S., Hoste, V., Ponzetto, S. P. and Paulheim, H. (2014). Identifying Disputed Topics in the News. In Tiddi, I., LD4KG 2014 : Proceedings of the 1st Workshop on Linked Data for Knowledge Discovery co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2014); Nancy, France, Sept. 19th, 2014 (S. Paper 4). CEUR Workshop Proceedings, RWTH: Aachen.
  • Dragisic, Z., Eckert, K., Euzenat, J., Faria, D., Ferrara, A., Granada, R., Ivanova, V., Jiménez-Ruiz, E., Kempf, A. O., Lambrix, P., Montanelli, S., Paulheim, H., Ritze, D., Shvaiko, P., Solimando, A., Trojahn, C., Zamazal, O. and Grau, B. C. (2014). Results of the Ontology Alignment Evaluation Initiative 2014. In Shvaiko, P., OM 2014 : Proceedings of the 9th International Workshop on Ontology Matching co-located with the 13th International Semantic Web Conference (ISWC 2014) ; Riva del Garda, Trentino, Italy, October 20, 2014 (S. 61-104). CEUR Workshop Proceedings, RWTH: Aachen.
  • Fleischhacker, D., Paulheim, H., Bryl, V., Völker, J. and Bizer, C. (2014). Detecting Errors in Numerical Linked Data Using Cross-Checked Outlier Detection. In , The Semantic Web – ISWC 2014 : 13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I (S. 357-372). Lecture Notes in Computer Science, Springer Internat. Publ.: Cham.
  • Gabriel, A., Paulheim, H. and Janssen, F. (2014). Learning Semantically Coherent Rules. In Cellier, P., DMNLP 2014 : Proceedings of the 1st International Workshop on Interactions between Data Mining and Natural Language Processing co-located with the ECML PKDD 2014, Nancy, France, September 15, 2014 (S. 49-63). CEUR Workshop Proceedings, RWTH: Aachen.
  • Meusel, R. and Paulheim, H. (2014). Linked Data for Information Extraction Challenge 2014 : Tasks and Results. In Gentile, A., LD4IE 2014 : Linked Data for Information Extraction : Proceedings of the Second International Workshop on Linked Data for Information Extraction (LD4IE 2014) co-located with the 13th International Semantic Web Conference (ISWC 2014) Riva del Garda, Italy (S. 3-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. (2014). Identifying Wrong Links between Datasets by Multi-dimensional Outlier Detection. In Lambrix, P., WoDOOM 2014 : Debugging ontologies and ontology mappings : proceedings of the Third International Workshop on Debugging Ontologies and Ontology Mappings co-located with 11th Extended Semantic Web Conference, Anissaras/Hersonissou, Greece, May 26, 2014 (S. 27-38). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H., Ristoski, P., Mitichkin, E. and Bizer, C. (2014). Data Mining with Background Knowledge from the Web. In Fischer, S., Proceedings of the 5th RapidMiner World (2014) (S. 1-14). , Shaker: Aachen.
  • Ristoski, P. and Paulheim, H. (2014). A Comparison of Propositionalization Strategies for Creating Features from Linked Open Data. In Tiddi, I., LD4KD 2014 : Proceedings of the 1st Workshop on Linked Data for Knowledge Discovery co-located with European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD 2014); Nancy, France, Sept. 19th, 2014 (S. Paper 1). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ristoski, P. and Paulheim, H. (2014). Feature Selection in Hierarchical Feature Spaces. In Dzeroski, S., Discovery Science : 17th International Conference, DS 2014, Bled, Slovenia, October 8-10, 2014. Proceedings (S. 288-300). Lecture Notes in Computer Science, Springer Internat. Publ.: Cham.
  • Schmachtenberg, M., Bizer, C. and Paulheim, H. (2014). Adoption of the Linked Data Best Practices in Different Topical Domains. In Mika, P., The Semantic Web – ISWC 2014 : 13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014. Proceedings, Part I (S. 245-260). Lecture Notes in Computer Science, Springer Internat. Publ.: Cham.
  • Schmachtenberg, M., Strufe, T. and Paulheim, H. (2014). Enhancing a Location-based Recommendation System by Enrichment with Structured Data from the Web. In Akerkar, R., Proceedings of the 4th International Conference on Web Intelligence, Mining and Semantics (WIMS14) (S. Article No. 17). , ACM: New York, NY.
  • Svatek, V., Mynarz, J. and Paulheim, H. (2014). The Linked Data Mining Challenge 2014: Results and experiences. In Völker, J., Knowledge Discovery and Data Mining Meets Linked Open Data : Proceedings of the 3rd Workshop on Knowledge Discovery and Data Mining Meets Linked Open Data co-located with 11th Extended Semantic Web Conference (ESWC 2014) Crete, Greece, May 25, 2014 (S. Paper 6). CEUR Workshop Proceedings, RWTH: Aachen.
  • Wienand, D. and Paulheim, H. (2014). Detecting Incorrect Numerical Data in DBpedia. In Presutti, V., The Semantic Web: Trends and Challenges : 11th International Conference, ESWC 2014, Anissaras, Crete, Greece, May 25-29, 2014. Proceedings (S. 504-518). Lecture Notes in Computer Science, Springer: Cham ; Heidelberg [u.a.].
  • Cucena Grau, B., Dragisic, Z., Eckert, K., Euzenat, J., Ferrara, A., Granada, R., Ivanova, V., Jiménez-Ruiz, E., Kempf, A. O., Lambrix, P., Nikolov, A., Paulheim, H., Ritze, D., Scharffe, F., Shvaiko, P., Trojahn, C. and Zamazal, O. (2013). Results of the Ontology Alignment Evaluation Initiative 2013. In Shvaiko, P., Proceedings of the 8th International Workshop on Ontology Matching co-located with the 12th International Semantic Web Conference (ISWC 2013) Sydney, Australia, October 21, 2013 (S. 61-100). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. (2013). DBpediaNYD - A Silver Standard Benchmark Dataset for Semantic Relatedness in DBpedia. In Hellmann, S., NLP-DBPEDIA 2013 : Proceedings of the NLP & DBpedia workshop co-located with the 12th International Semantic Web Conference (ISWC 2013) Sydney, Australia, October 22, 2013 (S. 80-84). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. (2013). Exploiting Linked Open Data as Background Knowledge in Data Mining. In D'Amato, C., DMoLD 2013 : Proceedings of the International Workshop on Data Mining on Linked Data, with Linked Data Mining Challenge collocated with the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD) (S. 1-10). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. and Bizer, C. (2013). Type Inference on Noisy RDF Data. In Alani, H., The Semantic Web - ISWC 2013 : 12th International Semantic Web Conference, Sydney, NSW, Australia, October 21-25, 2013, Proceedings, Part I (S. 510-525). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Paulheim, H. and Hertling, S. (2013). Discoverability of SPARQL Endpoints in Linked Open Data. In Blomqvist, E., Proceedings of the ISWC 2013 Posters & Demonstrations Track : track within the 12th International Semantic Web Conference (ISWC 2013) (S. 245-248). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. and Hertling, S. (2013). WeSeE-Match results for OAEI 2013. In Shvaiko, P., Proceedings of the 8th International Workshop on Ontology Matching co-located with the 12th International Semantic Web Conference (ISWC 2013) Sydney, Australia, October 21, 2013 (S. 197-202). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H., Hertling, S. and Ritze, D. (2013). Towards Evaluating Interactive Ontology Matching Tools. In Cimiano, P., The Semantic Web : semantics and big data ; 10th International Conference ; proceedings / ESWC 2013 (S. 31-45). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Paulheim, H. and Ponzetto, S. P. (2013). Extending DBpedia with Wikipedia List Pages. In Hellmann, S., NLP-DBPEDIA 2013 : Proceedings of the NLP & DBpedia workshop co-located with the 12th International Semantic Web Conference (ISWC 2013) Sydney, Australia, October 22, 2013 (S. 1-6). CEUR Workshop Proceedings, RWTH: Aachen.
  • Ritze, D., Paulheim, H. and Eckert, K. (2013). Evaluation measures for ontology matchers in supervised matching scenarios. In Alani, H., The Semantic Web – ISWC 2013 : 12th International Semantic Web Conference, Sydney, NSW, Australia, October 21-25, 2013, Proceedings, Part II (S. 392-407). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Schulz, A., Hadjakos, A., Paulheim, H., Nachtwey, J. and Mühlhäuser, M. (2013). A Multi-Indicator Approach for Geolocalization of Tweets. In Kiciman, E., Proceedings of the Seventh International AAAI Conference on Weblogs and Social Media, Cambridge, Massachusetts, USA, July 8–11, 2013 (S. [1-10]). , AAAI Press: Palo Alto, Calif..
  • Schulz, A., Ristoski, P. and Paulheim, H. (2013). I See a Car Crash: Real-time Detection of Small Scale Incidents in Microblogs. In Cimiano, P., The Semantic Web: ESWC 2013 Satellite Events : ESWC 2013, Satellite Events, Montpellier, France, May 26-30, 2013, Revised Selected Papers (S. 22-33). Lecture Notes in Computer Science, Springer: Berlin [u.a.].
  • Schulz, A., Thanh, T. D., Paulheim, H. and Schweizer, I. (2013). A Fine-Grained Sentiment Analysis Approach for Detecting Crisis Related Microposts. In Comes, T., Komplexe Notsituationen schnell meistern - Die ISCRAM Konferenz 2013 zum Krisenmangement (S. 846-851, ID 249). , KIT-Bibliothek Süd: Karlsruhe.