Research Area:  Machine Learning
With the rapid development of information technology, Internet data is growing exponentially, which makes it difficult for users to extract key information from massive Internet data. Data compression technology represented by text summary has gradually attracted extensive attention from academia and industry. As an extension of text summarization, multimodal summarization task can effectively reduce users’ information burden and improve users’ information acquisition speed by integrating visual and auditory modal information and using the mutual supplement and verification of different modal data. It has high research value in the fields of information retrieval, public opinion analysis, content review and so on. This paper combs the related research of multimodal summarization in recent years, summarizes the existing technologies and related data sets for multimodal summarization tasks, and summarizes the development direction of future research in this field.
Keywords:  
Author(s) Name:  Qiduo Lu, Chenhao Zhu, Xia Ye
Journal name:  
Conferrence name:  2022 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA)
Publisher name:  IEEE
DOI:  https://doi.org/10.1109/AEECA55500.2022.9919012
Volume Information:  -
Paper Link:   https://ieeexplore.ieee.org/abstract/document/9919012