Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Visual question answering: a state-of-the-art review - 2020

Visual question answering: a state-of-the-art review

Survey paper on Visual question answering: a state-of-the-art review

Research Area:  Machine Learning

Abstract:

Visual question answering (VQA) is a task that has received immense consideration from two major research communities: computer vision and natural language processing. Recently it has been widely accepted as an AI-complete task which can be used as an alternative to visual turing test. In its most common form, it is a multi-modal challenging task where a computer is required to provide the correct answer for a natural language question asked about an input image. It attracts many deep learning researchers after their remarkable achievements in text, voice and vision technologies. This review extensively and critically examines the current status of VQA research in terms of step by step solution methodologies, datasets and evaluation metrics. Finally, this paper also discusses future research directions for all the above-mentioned aspects of VQA separately.

Keywords:  
Visual question answering
natural language processing
Deep learning
AI

Author(s) Name:  Sruthy Manmadhan & Binsu C. Kovoor

Journal name:  Artificial Intelligence Review

Conferrence name:  

Publisher name:  Springer

DOI:  10.1007/s10462-020-09832-7

Volume Information:  volume 53, pages: 5705–5745 (2020)