research-article

Evaluating outlier probabilities: assessing sharpness, refinement, and calibration using stratified and weighted measures

Authors:

Philipp Röchner,

Henrique O. Marques,

Ricardo J. G. B. Campello,

Arthur ZimekAuthors Info & Claims

Data Mining and Knowledge Discovery, Volume 38, Issue 6

Pages 3719 - 3757

https://doi.org/10.1007/s10618-024-01056-5

Published: 19 July 2024 Publication History

Abstract

An outlier probability is the probability that an observation is an outlier. Typically, outlier detection algorithms calculate real-valued outlier scores to identify outliers. Converting outlier scores into outlier probabilities increases the interpretability of outlier scores for domain experts and makes outlier scores from different outlier detection algorithms comparable. Although several transformations to convert outlier scores to outlier probabilities have been proposed in the literature, there is no common understanding of good outlier probabilities and no standard approach to evaluate outlier probabilities. We require that good outlier probabilities be sharp, refined, and calibrated. To evaluate these properties, we adapt and propose novel measures that use ground-truth labels indicating which observation is an outlier or an inlier. The refinement and calibration measures partition the outlier probabilities into bins or use kernel smoothing. Compared to the evaluation of probability in supervised learning, several aspects are relevant when evaluating outlier probabilities, mainly due to the imbalanced and often unsupervised nature of outlier detection. First, stratified and weighted measures are necessary to evaluate the probabilities of outliers well. Second, the joint use of the sharpness, refinement, and calibration errors makes it possible to independently measure the corresponding characteristics of outlier probabilities. Third, equiareal bins, where the product of observations per bin times bin length is constant, balance the number of observations per bin and bin length, allowing accurate evaluation of different outlier probability ranges. Finally, we show that good outlier probabilities, according to the proposed measures, improve the performance of the follow-up task of converting outlier probabilities into labels for outliers and inliers.

References

[1]

Achtert E, Kriegel H, Reichert L, et al. (2010) Visual evaluation of outlier detection models. In: DASFAA (2), Lecture Notes in Computer Science, vol 5982. Springer, pp 396–399

Abstract

References

Index Terms

Recommendations

Robust Statistical Scaling of Outlier Scores: Improving the Quality of Outlier Probabilities for Outliers

Outlier detection and disparity refinement in stereo matching

CoMadOut—a robust outlier detection algorithm based on CoMAD

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Funding Sources

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations