Gradations of Error Severity in Automatic Image Descriptions

van Miltenburg, Emiel; Lu, Wei-Ting; Krahmer, Emiel; Gatt, Albert; Chen, Guanyi; Li, Lin; van Deemter, Kees

Gradations of Error Severity in Automatic Image Descriptions

DSpace/Manakin Repository

Gradations of Error Severity in Automatic Image Descriptions

van Miltenburg, Emiel; Lu, Wei-Ting; Krahmer, Emiel; Gatt, Albert; Chen, Guanyi; Li, Lin; van Deemter, Kees

(2020) Proceedings of the 13th International Conference on Natural Language Generation, pp. 398 - 411

(Part of book)

Abstract

Earlier research has shown that evaluation metrics based on textual similarity (e.g., BLEU, CIDEr, Meteor) do not correlate well with human evaluation scores for automatically generated text. We carried out an experiment with Chinese speakers, where we systematically manipulated image descriptions to contain different kinds of errors. Because our manipulated ... read more

Download/Full Text

Open Access version via Utrecht University Repository

Publisher version

Keywords: Taverne

Publisher: Association for Computational Linguistics (ACL)

(Peer reviewed)

See more statistics about this item