Latest Research Topic in Multimodal Hierarchical Reinforcement Learning Policy

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog - 2018

multimodal-hierarchical-reinforcement-learning-policy-for-task-oriented-visual-dialog.jpg

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog | S-Logix

Research Area: Machine Learning

Abstract:

Creating an intelligent conversational system that understands vision and language is one of the ultimate goals in Artificial Intelligence. Extensive research has focused on vision-to-language generation, however, limited research has touched on combining these two modalities in a goal-driven dialog context. We propose a multimodal hierarchical reinforcement learning framework that dynamically integrates vision and language for task-oriented visual dialog. The framework jointly learns the multimodal dialog state representation and the hierarchical dialog policy to improve both dialog task success and efficiency. We also propose a new technique, state adaptation, to integrate context awareness in the dialog state representation. We evaluate the proposed framework and the state adaptation technique in an image guessing game and achieve promising results.

Keywords:
Reinforcement learning
Multimodal
Conversational system
Dialog State Representation
Visual Dialog

Author(s) Name: Jiaping Zhang, Tiancheng Zhao, Zhou Yu

Journal name: Computation and Language

Conferrence name:

Publisher name: arXiv:1805.03257

DOI: https://doi.org/10.48550/arXiv.1805.03257

Volume Information: v1

Paper Link: https://arxiv.org/abs/1805.03257

Office Address

Social List

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog - 2018

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog | S-Logix

Abstract:

S-Logix (OPC) Private Limited

Office Address

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog - 2018

Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog | S-Logix

Abstract:

Related Papers