Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

A Comprehensive Survey on Topic Modeling in Text Summarization - 2022

A Comprehensive Survey on Topic Modeling in Text Summarization

Survey paper on Topic Modeling in Text Summarization

Research Area:  Machine Learning

Abstract:

Topic modeling is the statistical model for discovering hidden topics or keywords in a collection of documents. Topic modeling is also considered a probabilistic model for learning, analyzing, and discovering topics from the document collection. The most popular techniques for topic modeling are latent semantic analysis (LSA), probabilistic latent semantic analysis (pLSA), latent Dirichlet allocation (LDA), and the recent deep learning-based lda2vec. LDA is most commonly used in extractive multi-document summarization to determine whether the extracted sentence reflects the concept of the input document. In this paper, we will try to explore various multi-document summarization techniques that use LDA as a topic modeling method for improving final summary coverage and to reduce redundancy. Finally, we compared LDA and LSA using the Genism toolkit, and our experiment results show that LDA outperforms LSA if we increase the number of features considered for sentence selection.

Keywords:  
Topic modeling
Text summarization
LDA
LSA

Author(s) Name:  G. Bharathi Mohan & R. Prasanna Kumar

Journal name:  

Conferrence name:  International Conference on Micro-Electronics and Telecommunication Engineering

Publisher name:  Springer

DOI:  10.1007/978-981-16-8721-1_22

Volume Information: