Photo credit: Anna Logue

Focus Group: Web Data Mining

(Prof. Paulheim)

The Web Data Mining group focuses on the curation, refinement, and use of Web-scale knowledge graphs.

Knowledge graphs provide general, cross-domain knowledge about the world in a machine interpretable form. In the Web Data Mining group, we contribute to open source knowledge graphs such as DBpedia by developing refinement operators, e.g., for completing missing information or identifying errors. Furthermore, we develop new knowledge graphs, such as WebIsALOD and DBkWik, which are designed to be complementary to existing ones, and methods for using those knowledge graphs in practical knowledge intensive tasks, such as the RapidMiner Linked Open Data Extension and RDF2vec.

People

External PhD Students

Former Members

  • André Melo
  • Dr. Anna Lisa Gentile
  • Dr. Petar Ristoski

Projects

Data and Software

Software

Datasets

Publications

  • Algergawy, A., Cheatham, M., Faria, D., Ferrara, A., Fundulaki, I., Harrow, I., Hertling, S., Jiménez-Ruiz, E., Karam, N., Khiat, A., Lambrix, P., Li, H., Montanelli, S., Paulheim, H., Pesquita, C., Saveta, T., Schmidt, D., Shvaiko, P., Splendiani, A., Thiéblin, E., Trojahn, C., Vataščinová, J., Zamazal, O. and Zhou, L. (2018). Results of the Ontology Alignment Evaluation Initiative 2018. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 76-116). CEUR Workshop Proceedings, RWTH: Aachen.
  • Heist, N. (2018). Towards knowledge graph construction from entity co-occurrence. In Hollink, L., EKAW-DC 2018 : Proceedings of the EKAW Doctoral Consortium 2018 co-located with the 21st International Conference on Knowledge Engineering and Knowledge Management (EKAW 2018) Nancy, France, November 13, 2018 (S. 1-8). CEUR Workshop Proceedings, RWTH: Aachen.
  • Helmstetter, S. and Paulheim, H. (2018). Weakly supervised learning for fake news detection on Twitter. In Day, M., ASONAM 2018 : 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, Barcelona, Spain, 28-31 August, 2018 (S. 274-277). , IEEE Computer Society: Washington, DC.
  • Hertling, S. and Paulheim, H. (2018). DBkWik: A consolidated knowledge graph from thousands of Wikis. In Wu, X., 9th IEEE International Conference on Big Knowledge, ICBK 2018, Singapore, November 17-18, 2018 : proceedings (S. 17-24). , IEEE Computer Society: Piscataway, NJ [u.a.].
  • Hertling, S. and Paulheim, H. (2018). DOME results for OAEI 2018. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 144-151). CEUR Workshop Proceedings, RWTH: Aachen.
  • Hertling, S. and Paulheim, H. (2018). Provision and usage of provenance data in the WebIsALOD Knowledge Graph. In Capadisli, S., CKGSemStats 2018 : Joint Proceedings of the International Workshops on Contextualized Knowledge Graphs, and Semantic Statistics co-located with 17th International Semantic Web Conference (ISWC 2018), Monterey, USA, October 8th, 2018 (S. Article 6). CEUR Workshop Proceedings, RWTH: Aachen.
  • Jiménez-Ruiz, E., Saveta, T., Zamazal, O., Hertling, S., Röder, M., Fundulaki, I., Ngonga Ngomo, A.-C., Sherif, M. A., Annane, A., Bellahsene, Z., Ben Yahia, S., Diallo, G., Faria, D., Kachroudi, M., Khiat, A., Lambrix, P., Li, H., Mackeprang, M., Mohammadi, M., Rybinski, M., Balasubramani, B. S. and Trojahn, C. (2018). Introducing the HOBBIT platform into the ontology alignment evaluation campaign. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 49-60). CEUR Workshop Proceedings, RWTH: Aachen.
  • Paulheim, H. (2018). How much is a triple? Estimating the cost of knowledge graph creation. In Erp, M., ISWC-P&D-Industry-BlueSky 2018 : Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018) Monterey, USA, October 8th to 12th, 2018 (S. Paper 10). , RWTH: Aachen.
  • Paulheim, H. (2018). Machine learning with and for semantic web knowledge graphs. In d'Amato, C., Reasoning Web: Learning, Uncertainty, Streaming, and Scalability : 14th International Summer School 2018 Esch-sur-Alzette, Luxembourg, September 22 – 26, 2018 Tutorial Lectures (S. 110-141). Lecture Notes in Computer Science, Springer: Cham.
  • Paulheim, H. (2018). Make embeddings semantic again!. In Erp, M., ISWC-P&D-Industry-BlueSky 2018 : Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks co-located with 17th International Semantic Web Conference (ISWC 2018) Monterey, USA, October 8th to 12th, 2018 (S. 4). CEUR Workshop Proceedings, RWTH: Aachen.
  • Portisch, J. and Paulheim, H. (2018). ALOD2Vec matcher. In Shvaiko, P., OM 2018 : Proceedings of the 13th International Workshop on Ontology Matching co-located with the 17th International Semantic Web Conference (ISWC 2018) Monterey, CA, USA, October 8, 2018 (S. 132-137). CEUR Workshop Proceedings, RWTH: Aachen.
  • Rico, M., Mihindukulasooriya, N., Kontokostas, D., Paulheim, H., Hellmann, S. and Gómez-Pérez, A. (2018). Predicting incorrect mappings : a data-driven approach applied to DBpedia. In , SAC '18 : the 33rd ACM/SIGAPP Symposium On Applied Computing, Pau, France, April 9 - 13, 2018, proceedings (S. 323-330). , ACM: New York, NY.