Research Area:  Machine Learning
A model can be easily influenced by unseen factors in nonstationary environments and fail to fit dynamic data distribution. In a classification scenario, this is known as a concept drift. For instance, the shopping preference of customers may change after they move from one city to another. Therefore, a shopping website or application should alter recommendations based on its poorer predictions of such user patterns. In this article, we propose a novel approach called the multiscale drift detection test (MDDT) that efficiently localizes abrupt drift points when feature values fluctuate, meaning that the current model needs immediate adaption. MDDT is based on a resampling scheme and a paired student t-test. It applies a detection procedure on two different scales. Initially, the detection is performed on a broad scale to check if recently gathered drift indicators remain stationary. If a drift is claimed, a narrow scale detection is performed to trace the refined change time. This multiscale structure reduces the massive time of constantly checking and filters noises in drift indicators. Experiments are performed to compare the proposed method with several algorithms via synthetic and real-world datasets. The results indicate that it outperforms others when abrupt shift datasets are handled, and achieves the highest recall score in localizing drift points.
Author(s) Name:  XueSong Wang; Qi Kang; MengChu Zhou; Le Pan; Abdullah Abusorrah
Journal name:  IEEE Transactions on Cybernetics
Publisher name:  IEEE
Volume Information:  ( Volume: 51, Issue: 7, July 2021) Page(s): 3483 - 3495
Paper Link:   https://ieeexplore.ieee.org/abstract/document/9119144