Amazing technological breakthrough possible @S-Logix pro@slogix.in

Office Address

  • #5, First Floor, 4th Street Dr. Subbarayan Nagar Kodambakkam, Chennai-600 024 Landmark : Samiyar Madam
  • pro@slogix.in
  • +91- 81240 01111

Social List

Convex Surrogates for Unbiased Loss Functions in Extreme Classification With Missing Labels - 2021

Convex Surrogates For Unbiased Loss Functions In Extreme Classification With Missing Labels

Research Area:  Machine Learning

Abstract:

Extreme Classification (XC) refers to supervised learning where each training/test instance is labeled with small subset of relevant labels that are chosen from a large set of possible target labels. The framework of XC has been widely employed in web applications such as automatic labeling of web-encyclopedia, prediction of related searches, and recommendation systems. While most state-of-the-art models in XC achieve high overall accuracy by performing well on the frequently occurring labels, they perform poorly on a large number of infrequent (tail) labels. This arises from two statistical challenges, (i) missing labels, as it is virtually impossible to manually assign every relevant label to an instance, and (ii) highly imbalanced data distribution where a large fraction of labels are tail labels. In this work, we consider common loss functions that decompose over labels, and calculate unbiased estimates that compensate missing labels according to Natarajan et al. [26]. This turns out to be disadvantageous from an optimization perspective, as important properties such as convexity and lower-boundedness are lost. To circumvent this problem, we use the fact that typical loss functions in XC are convex surrogates of the 0-1 loss, and thus propose to switch to convex surrogates of its unbiased version. These surrogates are further adapted to the label imbalance by combining with label-frequency-based rebalancing.

Keywords:  

Author(s) Name:  Mohammadreza Qaraei, Erik Schultheis, Priyanshu Gupta, and Rohit Babbar

Journal name:  

Conferrence name:  WWW-21: Proceedings of the Web Conference April 2021

Publisher name:  ACM

DOI:  https://doi.org/10.1145/3442381.3450139

Volume Information:  Pages 3711–3720