Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Keyphrase Generation Beyond the Boundaries of Title and Abstract - 2021

Keyphrase Generation Beyond The Boundaries Of Title And Abstract

Research Paper on Keyphrase Generation Beyond The Boundaries Of Title And Abstract

Research Area:  Machine Learning

Abstract:

Keyphrase generation aims at generating phrases (keyphrases) that best describe a given document. In scholarly domains, current approaches to this task are neural approaches and have largely worked with only the title and abstract of the articles. In this work, we explore whether the integration of additional data from semantically similar articles or from the full text of the given article can be helpful for a neural keyphrase generation model. We discover that adding sentences from the full text particularly in the form of summary of the article can significantly improve the generation of both types of keyphrases that are either present or absent from the title and abstract. The experimental results on the three acclaimed models along with one of the latest transformer models suitable for longer documents, Longformer Encoder-Decoder (LED) validate the observation. We also present a new large-scale scholarly dataset FullTextKP for keyphrase generation, which we use for our experiments. Unlike prior large-scale datasets, FullTextKP includes the full text of the articles alongside title and abstract. We will release the source code to stimulate research on the proposed ideas.

Keywords:  
Keyphrase Generation
Longformer Encoder-Decoder (LED)
Deep Learning
Machine Learning

Author(s) Name:  Krishna Garg, Jishnu Ray Chowdhury, Cornelia Caragea

Journal name:  Computer Science

Conferrence name:  

Publisher name:  arXiv:2112.06776

DOI:  10.48550/arXiv.2112.06776

Volume Information: