Document Specific Supervised Keyphrase Extraction

Document Specific Supervised Keyphrase Extraction With Strong Semantic Relations - 2019

Research Paper on Document Specific Supervised Keyphrase Extraction With Strong Semantic Relations

Research Area: Machine Learning

Abstract:

Keyphrase extraction is the task of automatically extracting descriptive phrases or concepts that represent the main topics in a document. Finding good keyphrases in a document can quickly summarize knowledge for information retrieval and decision making. Existing keyphrase extraction methods cannot be customized to each specific document, and cannot capture flexible semantic relations. In this paper, a keyphrase extraction algorithm using maximum sequential pattern mining with one-off and general gaps condition, called Ke-MSMING, is presented. Ke_MSMING first searches all keyphrase candidates from a document using sequential patterns mining and the topic model, and then adopts supervised machine learning to classify each keyphrase candidate as a keyphrase or not. Finally, Ke_MSMING selects top-N keyphrases as the final keyphrases. Ke_MSMING not only uses baseline features and pattern features but also uses centrality features obtained from the cooccurrence semantic network, and the cooccurrence networks can yield powerful semantic relations for keyphrase extraction. Experimental results on two datasets demonstrate that Ke_MSMING has better performance than other state-of-the-art keyphrase extraction approaches.

Keywords:
Supervised
Keyphrase Extraction
Machine Learning
Deep Learning

Author(s) Name: Huiting Liu; Lili Wang; Peng Zhao; Xindong Wu

Journal name: IEEE Access

Conferrence name:

Publisher name: IEEE

DOI: Page(s): 167507 - 167520

Volume Information: ( Volume: 7)

Paper Link: https://ieeexplore.ieee.org/abstract/document/8879476

Office Address

Social List