Abstract
Rough Set theory and its extensions are found to be very effective in Knowledge Discovery in Data Bases. A number of tools are available in RST to solve Data Mining tasks such as clustering, rule mining, handling missing values and elimination of redundant data. In Data Mining handling of information tables with missing data values plays a very important role as missing values reduces the quality of information extracted. In this paper we discuss three different approaches to handle missing values. First method is called RSFit approach which predicts the missing attribute values based on a distance function.Second method, called Characteristic set based approach, provides decision rules from incomplete information systems. Finally a novel approach is introduced for constructing decision rules. This is based on a similarity relation. Experiment with small data set shows that the new approach is slightly better than the second method.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Pal, S.K., Skowron, A.(eds.): Rough Fuzzy Hybridization: A New Trend in Decision Making. Springer, Singapore (1999)
Lin, T.Y., Yao, Y.Y., Zadeh, L.A.(eds.): Data Mining, Rough Sets and Granular Computing. Physica-Verlag (2002)
Ziarko, W.: Rough Sets as a Methodology for Data Mining, Rough Sets in Knowledge discovery. Methodology and Applications, pp. 554–576. Physica-Verlag (1998)
Magnani, M.: Technical Report on Rough Set Theory for Knowledge Discovery in Data Bases (2003), http://magnanim.web.cs.unibo.it/data/pdf/roughkdd.pdf
Duntsch, I., Gediga, G., Nguyen, H.S.: Rough set data analysis in the KDD process, citeseer.nj.nec.com/387773.html
Li, J., Cercone, N.: Assigning Missing Attribute Values Based on Rough Sets Theory. In: IEEE Int. Conf. on Granular. Computing, pp. 607–610 (2006)
Grzymala-Busse, J.W., Siddhaye, S.: Rough Set Approaches to Rule Induction from Incomplete Data. In: Proc. of IPMU, Italy, vol. 2, pp. 923–930 (2004)
Rady, E.A., Abd El-Monsef, M.M.E., Abd El-Latif, W.A.: A Modified Rough Set Approach to Incomplete Information Systems. J. of Applied Mathematics and Decision Sciences (2007)
Rough Sets, http://en.wikipedia.org/wiki/Rough_set
Pawlak, Z., Grzymala-Busse, J., Slowinski, R., Ziarko, W.: Rough Sets. Communications of the ACM 38(11), 89–95 (1995)
Li, J., Cercone, N.: Discovering and Ranking Important Rules. In: Proc. of IEEE International Conf. on Granular Computing, Canada, vol. 2, pp. 506–511 (2005)
Grzymala-Busse, J.W.: Rough Set Theory with Applications to Data Mining. Real World Applications of Computational Intelligence. Springer, Heidelberg (2004)
Little, R., Rubin, D.: Statistical Analysis with Missing data, 2nd edn. Wiley, Chichester (2002)
Shalabi, L.A., Najjar, M., Kayed, A.A.: A Frame Work to deal with Missing Data in Data sets. Journal of Computer Science 2(9), 740–745 (2007)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sabu, M.K., Raju, G. (2008). Rough Set Approaches for Mining Incomplete Information Systems. In: Huang, DS., Wunsch, D.C., Levine, D.S., Jo, KH. (eds) Advanced Intelligent Computing Theories and Applications. With Aspects of Artificial Intelligence. ICIC 2008. Lecture Notes in Computer Science(), vol 5227. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85984-0_110
Download citation
DOI: https://doi.org/10.1007/978-3-540-85984-0_110
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85983-3
Online ISBN: 978-3-540-85984-0
eBook Packages: Computer ScienceComputer Science (R0)