ePrints@IIScePrints@IISc Home | About | Browse | Latest Additions | Advanced Search | Contact | Help

A Sparse Nonlinear Classifier Design Using AUC Optimization

Kakkar, Vishal and Shevade, Shirish and Sundararajan, S and Garg, Dinesh (2017) A Sparse Nonlinear Classifier Design Using AUC Optimization. In: 17th SIAM International Conference on Data Mining, SDM 2017, 27 - 29 April 2017, Houston, pp. 291-299.

SIAM_SDM 2017_291-299_2017 .pdf - Published Version

Download (1MB) | Preview
Official URL: https://doi.org/10.1137/1.9781611974973.33


AUC (Area under the ROC curve) is an important performance measure for applications where the data is highly imbalanced. Efficient AUC optimization is a challenging research problem as the objective function is non-decomposable and non-continuous. Using a max-margin based surrogate loss function, AUC optimization problem can be approximated as a pairwise RankSVM learning problem. Batch learning algorithms for solving the kernelized version of this problem suffer from scalability issues. Therefore, recent years have witnessed an increased interest in the development of online or single-pass algorithms that design a nonlinear classifier by maximizing the AUC performance. However, on many real-world datasets, the AUC performance of these classifiers was observed to be inferior to that of the classifiers designed using batch learning algorithms. Further, many practical imbalanced data classification problems demand fast inference, which underlines the need for designing sparse nonlinear classifiers. Motivated by these observations, we design a scalable algorithm for maximizing the AUC performance by greedily adding the required number of basis functions into the classifier model. The resulting sparse classifier performs faster inference and its AUC performance is comparable with that of the classifier designed using batch mode. Our experimental results show that the level of sparsity achievable can be an order of magnitude larger than that achieved by the Kernel RankSVM model without significantly affecting the AUC performance.

Item Type: Conference Paper
Publisher: Proceedings of the 17th SIAM International Conference on Data Mining, SDM 2017
Additional Information: The copyright for this article belongs to the Authors.
Department/Centre: Division of Electrical Sciences > Computer Science & Automation
Date Deposited: 14 Jun 2022 05:46
Last Modified: 14 Jun 2022 05:46
URI: https://eprints.iisc.ac.in/id/eprint/73450

Actions (login required)

View Item View Item