Generating a Question Answering Dataset About Geographic Changes in a Knowledge Graph

verfasst von
Michalis Mitsios, Dharmen Punjani, Sara Abdollahi, Simon Gottschalk, Eleni Tsalapati, Elena Demidova, Manolis Koubarakis
Abstract

Most studies on semantic question answering (QA) are predominantly focused on encyclopedic knowledge graphs like DBpedia and Wikidata. These studies cover, if at all, the spatial and temporal characteristics of geospatial entities in isolation, not addressing them simultaneously. In this paper, we introduce a pipeline for creating question answering datasets for evaluating the reasoning capabilities of QA models in the context of geographic changes over time. This pipeline generates questions, GeoSPARQL queries, and corresponding answers by leveraging subgraph and query template extraction techniques. We exemplify this pipeline with the creation of the GeoChangesQA dataset with questions over a knowledge graph of US counties and states and their changes from 1629 to 2000. By evaluating GeoChangesQA using a Transformer-based model, we demonstrate that historical geospatial questions pose a substantial challenge for semantic question answering.

Organisationseinheit(en)
Forschungszentrum L3S
Externe Organisation(en)
University of Athens
Université Jean Monnet Saint-Étienne
Deutsche Akademie der Technikwissenschaften (acatech)
Athens Technology Center S.A.
Rheinische Friedrich-Wilhelms-Universität Bonn
Lamarr-Institut für Maschinelles Lernen und Künstliche Intelligenz
Archimedes/Athena RC
Typ
Aufsatz in Konferenzband
Seiten
471-489
Anzahl der Seiten
19
Publikationsdatum
2025
Publikationsstatus
Veröffentlicht
Peer-reviewed
Ja
ASJC Scopus Sachgebiete
Theoretische Informatik, Allgemeine Computerwissenschaft
Elektronische Version(en)
https://doi.org/10.1007/978-3-031-77792-9_28 (Zugang: Geschlossen)