Retrospective and Prospective Mixture-of-Generators for Task-oriented Dialogue Response Generation

Pei, Jiahuan; Ren, Pengjie; Monz, Christof; de Rijke, Maarten

Computer Science > Computation and Language

arXiv:1911.08151 (cs)

[Submitted on 19 Nov 2019 (v1), last revised 19 Feb 2020 (this version, v2)]

Title:Retrospective and Prospective Mixture-of-Generators for Task-oriented Dialogue Response Generation

Authors:Jiahuan Pei, Pengjie Ren, Christof Monz, Maarten de Rijke

View PDF

Abstract:Dialogue response generation (DRG) is a critical component of task-oriented dialogue systems (TDSs). Its purpose is to generate proper natural language responses given some context, e.g., historical utterances, system states, etc. State-of-the-art work focuses on how to better tackle DRG in an end-to-end way. Typically, such studies assume that each token is drawn from a single distribution over the output vocabulary, which may not always be optimal. Responses vary greatly with different intents, e.g., domains, system actions.
We propose a novel mixture-of-generators network (MoGNet) for DRG, where we assume that each token of a response is drawn from a mixture of distributions. MoGNet consists of a chair generator and several expert generators. Each expert is specialized for DRG w.r.t. a particular intent. The chair coordinates multiple experts and combines the output they have generated to produce more appropriate responses. We propose two strategies to help the chair make better decisions, namely, a retrospective mixture-of-generators (RMoG) and prospective mixture-of-generators (PMoG). The former only considers the historical expert-generated responses until the current time step while the latter also considers possible expert-generated responses in the future by encouraging exploration. In order to differentiate experts, we also devise a global-and-local (GL) learning scheme that forces each expert to be specialized towards a particular intent using a local loss and trains the chair and all experts to coordinate using a global loss.
We carry out extensive experiments on the MultiWOZ benchmark dataset. MoGNet significantly outperforms state-of-the-art methods in terms of both automatic and human evaluations, demonstrating its effectiveness for DRG.

Comments:	The paper is accepted by 24th European Conference on Artificial Intelligence
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:1911.08151 [cs.CL]
	(or arXiv:1911.08151v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1911.08151

Submission history

From: Jiahuan Pei [view email]
[v1] Tue, 19 Nov 2019 08:20:45 UTC (529 KB)
[v2] Wed, 19 Feb 2020 11:04:28 UTC (530 KB)

Computer Science > Computation and Language

Title:Retrospective and Prospective Mixture-of-Generators for Task-oriented Dialogue Response Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Retrospective and Prospective Mixture-of-Generators for Task-oriented Dialogue Response Generation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators