Research Area:  Machine Learning
Keyphrase extraction is the task of automatically extracting descriptive phrases or concepts that represent the main topics in a document. Finding good keyphrases in a document can quickly summarize knowledge for information retrieval and decision making. Existing keyphrase extraction methods cannot be customized to each specific document, and cannot capture flexible semantic relations. In this paper, a keyphrase extraction algorithm using maximum sequential pattern mining with one-off and general gaps condition, called Ke-MSMING, is presented. Ke_MSMING first searches all keyphrase candidates from a document using sequential patterns mining and the topic model, and then adopts supervised machine learning to classify each keyphrase candidate as a keyphrase or not. Finally, Ke_MSMING selects top-N keyphrases as the final keyphrases. Ke_MSMING not only uses baseline features and pattern features but also uses centrality features obtained from the cooccurrence semantic network, and the cooccurrence networks can yield powerful semantic relations for keyphrase extraction. Experimental results on two datasets demonstrate that Ke_MSMING has better performance than other state-of-the-art keyphrase extraction approaches.
Keywords:  
Supervised
Keyphrase Extraction
Machine Learning
Deep Learning
Author(s) Name:  Huiting Liu; Lili Wang; Peng Zhao; Xindong Wu
Journal name:  IEEE Access
Conferrence name:  
Publisher name:  IEEE
DOI:  Page(s): 167507 - 167520
Volume Information:  ( Volume: 7)
Paper Link:   https://ieeexplore.ieee.org/abstract/document/8879476