Research Area:  Machine Learning
The automated news classification concerns the assignment of news to one or more predefined categories. The automated classified news helps the search engines to mine and categorize the type of news that the user asks for. Most of the researchers focused on the classification of English news and ignore the Arabic news due to the complexity of the Arabic morphology. This article presents a novel methodology to classify the Arabic news. It relies on the use of features extraction and the application of machine learning classifiers which are the Naive Bayes (NB), the Logistic Regression (LR), the Random Forest (RF), the Xtreme Gradient Boosting (XGB), the K-Nearest Neighbors (KNN), the Stochastic Gradient Descent (SGD), the Decision Tree (DT), and the Multi-Layer Perceptron (MLP). The methodology is applied to the Arabic news dataset provided by Mendeley. The accuracy of the classification is more than 95%.
Keywords:  
Author(s) Name:  Marco Alfonse, Mariam Gawich
Journal name:  WIREs Data Mining and Knowledge Discovery
Conferrence name:  
Publisher name:  Wiley
DOI:  10.1002/widm.1440
Volume Information:  Volume 2021
Paper Link:   https://wires.onlinelibrary.wiley.com/doi/abs/10.1002/widm.1440