Aaron Steiner

Researcher

B6, 26, Room C 1.04
68159 Mannheim, Germany

E-Mail: aaron.steineruni-mannheim.de
Phone: +49 (0) 621 / 181-2566

Research Interests

  • End-to-end data integration with LLMs across schema matching, value normalization, entity matching, and data fusion
  • Large language models for entity matching — prompting, fine-tuning, and label-efficient training
  • Web agents and their evaluation, including benchmark design for multi-shop and multi-step reasoning tasks
  • Agent–environment interfaces — how the choice of interface (MCP, RAG, structured web, raw HTML) shapes agent autonomy and effectiveness

Teaching

  • Large Language Models and Agents (FSS2026) – Course Details
  • Data Mining (FSS2026) – Course Details
  • Seminar CS715: Solving Complex Tasks using Large Language Models (FSS2026) – Course Details
  • Web Data Integration (HWS2025) – Course Details
  • Team project: LLM-Agents in Data Integration Pipelines (HWS2025) - Course Details
  • Large Language Models and Agents (FSS2025) – Course Details
  • Data Mining (FSS2025) – Course Details
  • Seminar CS715: Solving Complex Tasks using Large Language Models (FSS2025) – Course Details 

Software and Benchmarks

  • PyDI – Open-source Python framework for end-to-end data integration. Combines traditional string-based methods with embedding- and LLM-based techniques across schema matching, value normalization, entity matching, and data fusion.
  • WebMall – Multi-shop benchmark for evaluating web agents. 91 tasks across four simulated e-shops with over 4,000 heterogeneous product offers, covering comparison-shopping workflows from product search through checkout.

Publications