ORKG-Leaderboards

a systematic workflow for mining leaderboards as a knowledge graph

verfasst von
Salomon Kabongo, Jennifer D’Souza, Sören Auer
Abstract

The purpose of this work is to describe the orkg-Leaderboard software designed to extract leaderboards defined as task–dataset–metric tuples automatically from large collections of empirical research papers in artificial intelligence (AI). The software can support both the main workflows of scholarly publishing, viz. as LaTeX files or as PDF files. Furthermore, the system is integrated with the open research knowledge graph (ORKG) platform, which fosters the machine-actionable publishing of scholarly findings. Thus, the systemsss output, when integrated within the ORKG’s supported Semantic Web infrastructure of representing machine-actionable ‘resources’ on the Web, enables: (1) broadly, the integration of empirical results of researchers across the world, thus enabling transparency in empirical research with the potential to also being complete contingent on the underlying data source(s) of publications; and (2) specifically, enables researchers to track the progress in AI with an overview of the state-of-the-art across the most common AI tasks and their corresponding datasets via dynamic ORKG frontend views leveraging tables and visualization charts over the machine-actionable data. Our best model achieves performances above 90% F1 on the leaderboard extraction task, thus proving orkg-Leaderboards a practically viable tool for real-world usage. Going forward, in a sense, orkg-Leaderboards transforms the leaderboard extraction task to an automated digitalization task, which has been, for a long time in the community, a crowdsourced endeavor.

Organisationseinheit(en)
Forschungszentrum L3S
Externe Organisation(en)
Technische Informationsbibliothek (TIB) Leibniz-Informationszentrum Technik und Naturwissenschaften und Universitätsbibliothek
Typ
Artikel
Journal
International Journal on Digital Libraries
Band
25
Seiten
41-54
Anzahl der Seiten
14
ISSN
1432-5012
Publikationsdatum
03.2024
Publikationsstatus
Veröffentlicht
Peer-reviewed
Ja
ASJC Scopus Sachgebiete
Bibliotheks- und Informationswissenschaften
Elektronische Version(en)
https://doi.org/10.48550/arXiv.2305.11068 (Zugang: Offen)
https://doi.org/10.1007/s00799-023-00366-1 (Zugang: Offen)
https://doi.org/10.1007/s00799-024-00405-5 (Zugang: Offen)