List of Topics:
Location Research Breakthrough Possible @S-Logix pro@slogix.in

Office Address

Social List

Multi-level curriculum learning for multi-turn dialogue generation - 2023

multi-level-curriculum-learning.png

Research Paper On Multi-level curriculum learning for multi-turn dialogue generation

Research Area:  Machine Learning

Abstract:

Since deep learning is the dominant paradigm in the multi-turn dialogue generation task, large-scale training data is the key factor affecting the model performance. To make full use of the training data, the existing work directly applied curriculum learning to the multi-turn dialogue generation task, training model in a “easy-to-hard” way. But the design of the current methodology does not consider dialogue-specific features. To close this gap, we propose a Multi-Level Curriculum Learning (MLCL) method for multi-turn dialogue generation by considering the word-level linguistic feature and utterance-level semantic relation in a dialogue. The motivation is that word-level knowledge is beneficial to understanding complex utterance-level dependency of dialogue. Thus, we design two difficulty measurements and a self-adaptive curriculum scheduler, making the model gradually shift the learning focus from word-level to utterance-level information during the training process. We also verify the independence and complementarity of the two measurements at different levels. We evaluate the performance on two widely used multi-turn dialogue datasets, and the results demonstrate that our proposed method outperforms the strong baselines and existing CL methods in terms of automated metrics and human evaluation. We will release the code files upon acceptance.

Keywords:  

Author(s) Name:  Guanhua Chen, Runzhe Zhan, Derek F. Wong, Lidia S. Chao

Journal name:  IEEE/ACM Transactions on Audio, Speech, and Language Processing

Conferrence name:  

Publisher name:  IEEE

DOI:  10.1109/TASLP.2023.3322583

Volume Information:  Volume: 31,Pages: 3958 - 3967,(2023)