A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning

Zhao, Yangyang; Qin, Hua; Zhenyu, Wang; Zhu, Changxi; Wang, Shihan

doi:https://doi.org/10.18653/v1/2022.findings-naacl.54

A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning

DSpace/Manakin Repository

A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning

Zhao, Yangyang; Qin, Hua; Zhenyu, Wang; Zhu, Changxi; Wang, Shihan

(2022) Findings of the Association for Computational Linguistics: NAACL 2022, pp. 711 - 723

(Part of book)

Abstract

Training a deep reinforcement learning-based dialogue policy with brute-force random sampling is costly. A new training paradigm was proposed to improve learning performance and efficiency by combining curriculum learning. However, attempts in the field of dialogue policy are very limited due to the lack of reliable evaluation of difficulty scores ... read more

Download/Full Text

Open Access version via Utrecht University Repository

Publisher version

DOI: https://doi.org/10.18653/v1/2022.findings-naacl.54

ISBN: 9781955917766

Publisher: Association for Computational Linguistics

Note: Funding Information: We would like to thank the reviewers for their comments and efforts towards improving our paper. And we would like to acknowledge volunteers of the South China University of Technology who help us with the human experiments. This work was supported by the Key-Area Research and Development Program of Guangdong Province, China (Grant No.2019B0101540042) and the Natural Science Foundation of Guangdong Province, China (Grant No.2019A1515011792). Publisher Copyright: © Findings of the Association for Computational Linguistics: NAACL 2022 - Findings.

(Peer reviewed)

See more statistics about this item