Embedding and Clustering Multi-Entity Sequences
- verfasst von
- Connor Heaton, Prasenjit Mitra
- Abstract
Core to much of modern deep learning is the notion of representation learning, learning representations of things that are useful for performing some task(s) related to those things. Encoder-only language models, for example, learn representations of language useful for performing language-related tasks, often classification. While fruitful in many applications, inherent is the assumption that only one classification is to be made for a particular input. This poses challenges when multiple classifications are to be made about different portions of a single record, such as emotion recognition in conversation (ERC) where the objective is to classify the emotion in each utterance of a dialog. Existing methods for this task typically either involve redundant computation, non-trivial post-processing outside of the core language model backbone, or both. To address this, we generalize recent work for deriving player-specific embeddings from multi-player sequences of events in sport for domain-agnostic application while also enabling it to leverage inter-entity relationships. Seeing the efficacy of the method in regression and classification tasks, we explore how it can be used to cluster player representations, proposing a novel approach for distribution-aware deep-clustering in the absence of labels. We demonstrate how the proposed methods yield state-of-the-art performance on the disparate tasks of ERC in Natural Language Processing (NLP), long-tail partial-label-learning (LT-PLL) in Computer Vision (CV), and player form clustering in sports analytics.
- Organisationseinheit(en)
-
Forschungszentrum L3S
- Externe Organisation(en)
-
Pennsylvania State University
- Typ
- Artikel
- Journal
- IEEE ACCESS
- Band
- 12
- Seiten
- 57492-57503
- Anzahl der Seiten
- 12
- ISSN
- 2169-3536
- Publikationsdatum
- 22.04.2024
- Publikationsstatus
- Veröffentlicht
- Peer-reviewed
- Ja
- ASJC Scopus Sachgebiete
- Allgemeine Computerwissenschaft, Allgemeine Materialwissenschaften, Allgemeiner Maschinenbau
- Elektronische Version(en)
-
https://doi.org/10.1109/ACCESS.2024.3391820 (Zugang:
Offen)