[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Accurate and fast prototype selection based on the notion of relevant and border prototypes

Published: 01 January 2018 Publication History

Abstract

In supervised classification, a training set is given to a classifier to learn a decision rule for classifying unseen cases. When large training sets are processed, the training stage becomes slow especially for instance-based learning. However, not all information in a training set is useful for classification because it could contain either redundant or noisy prototypes. Therefore a process for discarding useless prototypes is required; this process is known as prototype selection. In this work, we present some methods for selecting prototypes based on prototype relevance, which are accurate and fast for large datasets; in addition, our methods can be applied over datasets described by nominal features. We report experimental results showing the effectiveness of our methods as well as a comparison against other successful prototype selection methods.

References

[1]
Kuncheva L.I. and Bezdek J.C., Nearest prototype classification: Clustering. Genetic algorithms. Or random search? IEEE Transactions on Systems. Man and Cybernetics 28-1(Part C) (1998), 160–164.
[2]
Olvera-López J.A., Carrasco-Ochoa J.A., Martínez-Trinidad J.F. Prototype Selection Via Prototype Relevance Ruiz-Shulcloer J. and Kropatsch W.G. CIARP LNCS 153–160.
[3]
Cover T. and Hart P., Nearest neighbor pattern classification, IEEE Transactions on Information Theory 13 (1967), 21–27.
[4]
Atkeson C.G., Moorel A.W. and Schaal S., Locally weighted learning, Artificial Intelligence Review 11(1-5) (1997), 11–73.
[5]
Vapnik V., The Nature of Statistical Learning Theory, Springer-Verlag New York 1995.
[6]
Quinlan J.R., C4.5: Programs for Machine Learning Morgan Kaufmann 1993.
[7]
Hart P.E., The condensed nearest neighbor rule, IEEE Transactions on Information Theory 14 (1968), 515–516.
[8]
Chien-Hsing C., Bo-Han K. and Fu C., The Generalized Condensed Nearest Neighbor Rule as A Data Reduction Method, in: 18th International Conference on Pattern Recognition (2006),2, 556–559.
[9]
Wilson D.L., Asymptotic properties of nearest neighbor rules using edited data, IEEE Transactions on Systems, Man and Cybernetics 2 (1972), 408–421.
[10]
Devijver P.A. and Kittler J., On the edited nearest neighbor rule, In Proceedings of the 5th International Conference on Pattern Recognition (1980), 72–80.
[11]
Wilson D.R. and Martínez T.R., Reduction techniques for instance-based learning algorithms, Machine Learning 38 (2000), 257–286.
[12]
Brighton H. and Mellish C., Advances in instance selection for instance-based learning algorithms, Data Mining and Knowledge Discovery 6(2) (2002), 153–172.
[13]
Yuangui L., Zhonhui H., Yunze C., Weidong Z., Support Vector Based Prototype Selection Method for Nearest Neighbor Rules, Wang L. et al., ICNIC 2005, LNCS 3610 (2005), 528–535.
[14]
Garain U., Prototype reduction using an artificial immune model, Pattern Analysis and Applications 11 (2008), 353–363.
[15]
García S., Cano J.R. and Herera F., A memetic algorithm for evolutionary prototype selection: A scaling up approach, Pattern Recognition 41(8) (2008), 2693–2709.
[16]
García-Pedrajas N. and Romero del Castillo J.A., and Ortíz-Boyer D., A cooperative coevolutionary algorithm for instance selection for instance-based learning, Machine Learning 78(3) (2010), 381–420.
[17]
Vallejo C.G., Troyano J.A. and Ortega J., InstanceRank: Bringing order datasets, Pattern Recognition Letters 31 (2010), 133–142.
[18]
Bezdek J.C. and Kuncheva L.I., Nearest prototype classifier designs: An experimental study, International Journal of Intelligent Systems 16(12) (2001), 1445–1473.
[19]
Liu H. and Motoda H., On issues of instance selection, Data Mining and Knowledge Discovery 6 (2002), 115–130.
[20]
Spillmann B., Neuhaus M., Bunke H., Pękalska E., Duin R.P.W., Transforming Strings to Vector Spaces Using Prototype Selection, Yeung D.-Y., et al. SSPR&SPR 2006 LNCS 4109 (2006), 287–296.
[21]
Caises Y., González A., Leyva E., Pérez R. SCIS: Combining Instance Selection Methods to Increase Their Effectiveness over a Wide Range of Domains, Corchado E. and Yin H., IDEAL 2009 LNCS 5788 (2009), 17–24.
[22]
Lumini A. and Nanni L., A clustering method for automatic biometric template selection, Pattern Recognition 39(3) (2006), 495–497.
[23]
Olvera-López J.A., Carrasco-Ochoa J.A. and Martínez-Trinidad J.F., A new Fast prototype selection method based on clustering, Pattern Analysis and Applications 13(2) (2010), 131–141.
[24]
Leyva E., González A. and Pérez R., Three new instance selection methods based on local sets: A comparative study with several approaches from a bi-objective perspective, Pattern Recognition 48(4) (2015), 1523–1537.
[25]
Hamidzadeh J., Monsefi R. and Yazdi H.S., IRAHC: Instance reduction algorithm using hyperrectangle clustering, Pattern Recognition 48(5) (2015), 1878–1889.
[26]
Ougiaroglou S. and Evangelidis G., Fast and accurate k-nearest neighbor classification using prototype selection by clustering, In: 16th IEEE Panhellenic Conference on Informatics (2012), pp 168–173.
[27]
Hernandez-Leal P., Carrasco-Ochoa J.A., Martínez-Trinidad J.F. and Olvera-Lopez J.A., InstanceRank based on borders for instance selection, Pattern Recognition 46(1) (2013), 365–375.
[28]
De Haro-García A. and García-Pedrajas N., A divide-and-conquer approach for scaling up instance selection algorithm, Data Mining and Knowledge Discovery 18(3) (2009), 392–418.
[29]
De Haro-García A., García-Pedrajas N. and del Castillo J.A., Large scale instance selection by means of federal instance selection, Data & Knowledge Engineering 75 (2012), 58–77.
[30]
Swonger C.W., Sample set condensation for a condensed nearest neighbour decision rule for pattern recognition, Watanabe S., In Frontiers of Pattern Recognition (1972), 511–519.
[31]
Bache K. and Lichman M., UCI Machine Learning Repository, Irvine CA University of California, School of Information and Computer Science (2013) http://archive.ics.uci.edu/ml.
[32]
Wilson D.R. and Martínez T.R., Improved heterogeneous distance functions, Journal of Artificial Intelligence Research 6(1) (1997), 1–34.
[33]
Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P. and Witten I.H., The WEKA data mining software: An update, ACM SIGKDD Explorations News Letter 11(1) (2009), 10–18.
[34]
Demšar J., Statistical comparisons of classifiers over multiple data sets, Journal of Machine Learning Research 7 (2006), 1–30.
[35]
Zhai J., Li T. and Wnag X., A cross-selection instance algorithm, Journal of Intelligent & Fuzzy Systems 30 (2016), 717–728.
[36]
Liu C., Wang W., Lv F. and Konan M., An efficient instance selection algorithm to reconstruct training sets for support vector machine, Knowledge-Based Systems 116 (2017), 58–73.

Cited By

View all
  • (2023)Class Representatives Selection in Non-metric Spaces for Nearest Prototype ClassificationSimilarity Search and Applications10.1007/978-3-031-46994-7_10(111-124)Online publication date: 9-Oct-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology  Volume 34, Issue 5
Intelligent and Fuzzy Systems applied to Language & Knowledge Engineering
2018
528 pages

Publisher

IOS Press

Netherlands

Publication History

Published: 01 January 2018

Author Tags

  1. Prototype selection
  2. prototype relevance
  3. border prototypes

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Class Representatives Selection in Non-metric Spaces for Nearest Prototype ClassificationSimilarity Search and Applications10.1007/978-3-031-46994-7_10(111-124)Online publication date: 9-Oct-2023

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media