Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning

Zhao, Yangyang; Yin, Kai; Wang, Zhenyu; Dastani, Mehdi; Wang, Shihan

doi:https://doi.org/10.1109/TASLP.2024.3357038

Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning

DSpace/Manakin Repository

Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning

Zhao, Yangyang; Yin, Kai; Wang, Zhenyu; Dastani, Mehdi; Wang, Shihan

(2024) IEEE/ACM Transactions on Audio, Speech, and Language Processing, volume 32, pp. 1380 - 1391

(Article)

Abstract

Reinforcement learning (RL) has emerged as a key technique for designing dialogue policies. However, action space inflation in dialogue tasks has led to a heavy decision burden and incoherence problems for dialogue policies. In this paper, we propose a novel decomposed deep Q-network (D2Q) that exploits the natural structure of ... read more

Download/Full Text

Open Access version via Utrecht University Repository

Publisher version

Keywords: action space inflation, dialogue policy, incoherence problem, Reinforcement learning, Taverne, Computer Science (miscellaneous), Computational Mathematics, Electrical and Electronic Engineering, Acoustics and Ultrasonics

DOI: https://doi.org/10.1109/TASLP.2024.3357038

(Peer reviewed)

See more statistics about this item