Aiming beyond the Obvious: Identifying Non-Obvious Cases in Semantic Similarity Datasets

Peinelt, Nicole; Liakata, Maria; Nguyen, Dong

doi:https://doi.org/10.18653/v1/P19-1268

Aiming beyond the Obvious: Identifying Non-Obvious Cases in Semantic Similarity Datasets

DSpace/Manakin Repository

Aiming beyond the Obvious: Identifying Non-Obvious Cases in Semantic Similarity Datasets

Peinelt, Nicole; Liakata, Maria; Nguyen, Dong

(2019) Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pp. 2792 - 2798

(Part of book)

Abstract

Existing datasets for scoring text pairs in terms of semantic similarity contain instances whose resolution differs according to the degree of difficulty. This paper proposes to distinguish obvious from non-obvious text pairs based on superficial lexical overlap and ground-truth labels. We characterise existing datasets in terms of containing difficult cases ... read more

Download/Full Text

Open Access version via Utrecht University Repository

Publisher version

DOI: https://doi.org/10.18653/v1/P19-1268

Publisher: Association for Computational Linguistics

(Peer reviewed)

See more statistics about this item