VALSE: A Task-independent benchmark for Vision and Language models centered on linguistic phenomena

Parcalabescu, L; Cafagna, M; Muradjan, L; Frank, A; Calixto, I; Gatt, A

doi:https://doi.org/10.18653/v1/2022.acl-long.567

VALSE: A Task-independent benchmark for Vision and Language models centered on linguistic phenomena

DSpace/Manakin Repository

VALSE: A Task-independent benchmark for Vision and Language models centered on linguistic phenomena

Parcalabescu, L; Cafagna, M; Muradjan, L; Frank, A; Calixto, I; Gatt, A

(2022) Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL'22), pp.

(Part of book)

Abstract

We propose VALSE (Vision And Language Structured Evaluation), a novel benchmark designed for testing general-purpose pretrained vision and language (V&L) models for their visio-linguistic grounding capabilities on specific linguistic phenomena. VALSE offers a suite of six tests covering various linguistic constructs. Solving these requires models to ground linguistic phenomena in ... read more

Download/Full Text

Open Access version via Utrecht University Repository

Publisher version

DOI: https://doi.org/10.18653/v1/2022.acl-long.567

Publisher: Association for Computational Linguistics

(Peer reviewed)

See more statistics about this item