Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

A survey on multi-modal summarization - 2023


A Survey on Multi-modal Summarization | S-Logix

Research Area:  Machine Learning

Abstract:

The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the task of automatic multi-modal summarization (MMS) essential. In this article, we present a comprehensive survey of the existing research in the area of MMS, covering various modalities such as text, image, audio, and video. Apart from highlighting the different evaluation metrics and datasets used for the MMS task, our work also discusses the current challenges and future directions in this field.

Keywords:  
text
images
videos
audio
utomatic multi-modal summarization
evaluation metrics
datasets

Author(s) Name:  Anubhav Jangra, Sourajit Mukherjee, Adam Jatowt, Sriparna Saha, Mohammad Hasanuzzaman

Journal name:  ACM Computing Surveys

Conferrence name:  

Publisher name:  ACM

DOI:  https://doi.org/10.1145/3584700

Volume Information:  Volume 55