A Weighted Rough Set Approach for Cost-Sensitive Learning

Jinfu Liu²⁴ &
Daren Yu²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4482))

Included in the following conference series:

International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing

1552 Accesses

Abstract

In many real-world applications, the costs of different errors are often unequal. Therefore, the inclusion of costs into learning, also named cost-sensitive learning, has been regarded as one of the most relevant topics of future machine learning research. Rough set theory is a powerful mathematic tool dealing with inconsistent information for attribute dependence analysis, knowledge reduction and decision rule extraction. However, it is insensitive to the costs of misclassification due to the absence of a mechanism of considering the subjective knowledge. This paper discusses problems connected with introducing the subjective knowledge into rough set learning and proposes a weighted rough set approach for cost-sensitive learning. In this method, weights are employed to represent the subjective knowledge of costs and a weighted information system is defined firstly. With the introduction of weights, weighted attribute dependence analysis is carried out and an index of weighted approximate quality is given. Furthermore, weighted attribute reduction algorithm and weighted rule extraction algorithm are designed to find the reducts and rules with the consideration of weights. Based on the proposed weighted rough set, a series of comparing experimentations with several familiar general techniques on cost-sensitive learning are constructed. The results show that the approach of weighted rough set produces averagely the minimum misclassification costs and the lowest high cost errors.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Decision-Theoretic Rough Set Approach to Multi-class Cost-Sensitive Classification

Uncertainty Optimization Based Rough Set and its Applications

Article 18 March 2023

Attribute reduction based on interval-set rough sets

Article 08 January 2024

References

Domingos, P.: MetaCost: A General Method for Making Classifiers Cost-Sensitive. In: Proc. Fifth ACM SIGKDD Int’l Conf. Knowledge Discovery and Data Mining, pp. 155–164 (1999)
Google Scholar
Ting, K.M.: An Instance-Weighting Method to Induce Cost-Sensitive Trees. IEEE Trans. Knowledge and Data Eng. 14(3), 659–665 (2002)
Article MathSciNet Google Scholar
Zhou, Z.-H., Liu, X.-Y.: Training Cost-Sensitive Neural Networks with Methods Addressing the Class Imbalance Problem. IEEE Trans. Knowledge and Data Eng. 18(1), 63–77 (2006)
Article MathSciNet Google Scholar
Wysotzki, F., Geibel, P., Brefeld, U.: Support Vector Machines with Example Dependent Costs. In: Lavrač, N., et al. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 23–34. Springer, Heidelberg (2003)
Google Scholar
Pawlak, Z.: Rough Sets. International Journal of Computer and Information Sciences 11, 341–356 (1982)
Article MathSciNet MATH Google Scholar
Min, F., Xu, C.: Weighted Reduction for Decision Tables. In: Wang, L., et al. (eds.) FSKD 2006. LNCS (LNAI), vol. 4223, pp. 246–255. Springer, Heidelberg (2006)
Chapter Google Scholar
Ma, T.-H., Tang, M.-L.: Weighted Rough Set Model. In: Sixth International Conference on Intelligent Systems Design and Applications, pp. 481–485 (2006)
Google Scholar
Hu, Q.-H., et al.: Fuzzy Probabilistic Approximation Spaces and Their Information Measures. IEEE Transactions on Fuzzy Systems 14(2), 191–201 (2006)
Article Google Scholar
Batista, G., Prati, R.C., Monard, M.C.: A Study of the Behavior of Several Methods for Balancing Machine Learning Training Data. SIGKDD Explorations 6(1), 20–29 (2004)
Article Google Scholar
Michie, D., Spiegelhalter, D.J., Taylor, C.C.: Machine Learning, Neural and Statistical Classification. Ellis Horwood Limited, New York (1994)
MATH Google Scholar
Grzymala-Busse, J.W.: LERS - A System for Learning From Examples Based on Rough Sets. In: Slowinski, R. (ed.) Intelligent Decision Support, pp. 3–18. Kluwer Academic Publishers, Dordrecht (1992)
Chapter Google Scholar
Blake, C., Keogh, E., Merz, C.J.: UCI Repository of Machine Learning Databases. Dept. of Information and Computer Science, Univ. of California, Irvine (1998), http://www.ics.uci.edu/~mlearn/MLRepository.html
Fayyad, U., Irani, K.: Discretizing Continuous Attributes While Learning Bayesian Networks. In: Proc. Thirteenth International Conference on Machine Learning, pp. 157–165. Morgan Kaufmann, San Francisco (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Harbin Institute of Technology, 150001 Harbin, China
Jinfu Liu & Daren Yu

Authors

Jinfu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Daren Yu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science and Engineering, York University, M3J 1P3, Toronto, Ontario, Canada
Aijun An
Institute of Computing Sciences, Poznań University of Technology, ul. Piotrowo 2, 60–965, Poznań, Poland
Jerzy Stefanowski
Department of Applied Computer Science, University of Winnipeg, R3B 2E9, Winnipeg, Manitoba, Canada
Sheela Ramanna
Department of Computer Science, University of Regina, S4S 0A2, Regina, Saskatchewan, Canada
Cory J. Butz
Department of Electrical and Computer Engineering, University of Alberta, T6G 2V4, Edmonton, Alberta, Canada
Witold Pedrycz
Institute of Compuer Science and Technology, Chongqing University of Posts and Telecommunications, 40065, Chongqing, P.R. China
Guoyin Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, J., Yu, D. (2007). A Weighted Rough Set Approach for Cost-Sensitive Learning. In: An, A., Stefanowski, J., Ramanna, S., Butz, C.J., Pedrycz, W., Wang, G. (eds) Rough Sets, Fuzzy Sets, Data Mining and Granular Computing. RSFDGrC 2007. Lecture Notes in Computer Science(), vol 4482. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72530-5_42

Download citation

DOI: https://doi.org/10.1007/978-3-540-72530-5_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72529-9
Online ISBN: 978-3-540-72530-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics