Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Gated Multimodal Units for Information Fusion - 2017

Gated Multimodal Units For Information Fusion

Research Area:  Machine Learning

Abstract:

This paper presents a novel model for multimodal learning based on gated neural networks. The Gated Multimodal Unit (GMU) model is intended to be used as an internal unit in a neural network architecture whose purpose is to find an intermediate representation based on a combination of data from different modalities. The GMU learns to decide how modalities influence the activation of the unit using multiplicative gates. It was evaluated on a multilabel scenario for genre classification of movies using the plot and the poster. The GMU improved the macro f-score performance of single-modality approaches and outperformed other fusion strategies, including mixture of experts models. Along with this work, the MM-IMDb dataset is released which, to the best of our knowledge, is the largest publicly available multimodal dataset for genre prediction on movies.

Keywords:  

Author(s) Name:  John Arevalo, Thamar Solorio, Manuel Montes-y-Gómez, Fabio A. González

Journal name:  Statistics

Conferrence name:  

Publisher name:  arXiv:1702.01992

DOI:  10.48550/arXiv.1702.01992

Volume Information: