The challenge for text classification in the open world

Overcoming the challenge for text classification in the open world - 2017

Research Paper on Overcoming The Challenge For Text Classification In The Open World

Research Area: Machine Learning

Abstract:

Classification is often referred to as the task of discriminating one class from others in a given set of classes. Traditionally, classifiers work well assuming that a priori knowledge of all classes are given. Unfortunately, a presenting of unknown class during testing can lead to poor performance of even state-of-the-art classifiers due to observed classes being incorrectly identified to other classes. Recent proposed open world recognition framework provides a promising venue for tackling this challenge. While the majority of works in this relative new field is in computer vision, the rare work in Natural Language Processing shows its instability in its performance and is not based on the open world recognition framework. To tackle this problem, we represent our Nearest Centroid Class (NCC) which is incremental learning and able to detect unknown class during testing. Our model yields promising results in a document classification on text classification domains among current state-of-the-art models.

Keywords:
Text Classification
Natural Language Processing
Nearest Centroid Class
document classification
Machine Learning
Deep Learning

Author(s) Name: Tri Doan; Jugal Kalita

Journal name:

Conferrence name: IEEE 7th Annual Computing and Communication Workshop and Conference (CCWC)

Publisher name: IEEE

DOI: 10.1109/CCWC.2017.7868366

Volume Information:

Paper Link: https://ieeexplore.ieee.org/abstract/document/7868366

Office Address

Social List

Overcoming the challenge for text classification in the open world - 2017

Research Paper on Overcoming The Challenge For Text Classification In The Open World

Abstract:

S-Logix (OPC) Private Limited

Office Address

Overcoming the challenge for text classification in the open world - 2017

Research Paper on Overcoming The Challenge For Text Classification In The Open World

Abstract:

Related Papers