Generating a Question Answering Dataset About Geographic Changes in a Knowledge Graph
- authored by
- Michalis Mitsios, Dharmen Punjani, Sara Abdollahi, Simon Gottschalk, Eleni Tsalapati, Elena Demidova, Manolis Koubarakis
- Abstract
Most studies on semantic question answering (QA) are predominantly focused on encyclopedic knowledge graphs like DBpedia and Wikidata. These studies cover, if at all, the spatial and temporal characteristics of geospatial entities in isolation, not addressing them simultaneously. In this paper, we introduce a pipeline for creating question answering datasets for evaluating the reasoning capabilities of QA models in the context of geographic changes over time. This pipeline generates questions, GeoSPARQL queries, and corresponding answers by leveraging subgraph and query template extraction techniques. We exemplify this pipeline with the creation of the GeoChangesQA dataset with questions over a knowledge graph of US counties and states and their changes from 1629 to 2000. By evaluating GeoChangesQA using a Transformer-based model, we demonstrate that historical geospatial questions pose a substantial challenge for semantic question answering.
- Organisation(s)
-
L3S Research Centre
- External Organisation(s)
-
University of Athens
Université Jean Monnet Saint-Étienne
Deutsche Akademie der Technikwissenschaften (acatech)
Athens Technology Center
University of Bonn
Lamarr Institute for Machine Learning and Artificial Intelligence
Archimedes/Athena RC
- Type
- Conference contribution
- Pages
- 471-489
- No. of pages
- 19
- Publication date
- 2025
- Publication status
- Published
- Peer reviewed
- Yes
- ASJC Scopus subject areas
- Theoretical Computer Science, General Computer Science
- Electronic version(s)
-
https://doi.org/10.1007/978-3-031-77792-9_28 (Access:
Unknown)