[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study

Published: 01 March 2012 Publication History

Abstract

The nearest neighbor classifier is one of the most used and well-known techniques for performing recognition tasks. It has also demonstrated itself to be one of the most useful algorithms in data mining in spite of its simplicity. However, the nearest neighbor classifier suffers from several drawbacks such as high storage requirements, low efficiency in classification response, and low noise tolerance. These weaknesses have been the subject of study for many researchers and many solutions have been proposed. Among them, one of the most promising solutions consists of reducing the data used for establishing a classification rule (training data) by means of selecting relevant prototypes. Many prototype selection methods exist in the literature and the research in this area is still advancing. Different properties could be observed in the definition of them, but no formal categorization has been established yet. This paper provides a survey of the prototype selection methods proposed in the literature from a theoretical and empirical point of view. Considering a theoretical point of view, we propose a taxonomy based on the main characteristics presented in prototype selection and we analyze their advantages and drawbacks. Empirically, we conduct an experimental study involving different sizes of data sets for measuring their performance in terms of accuracy, reduction capabilities, and runtime. The results obtained by all the methods studied have been verified by nonparametric statistical tests. Several remarks, guidelines, and recommendations are made for the use of prototype selection for nearest neighbor classification.

Cited By

View all
  • (2025)SpISInformation Sciences: an International Journal10.1016/j.ins.2024.121738695:COnline publication date: 1-Mar-2025
  • (2024)Weighted distance nearest neighbor condensingProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692713(16153-16166)Online publication date: 21-Jul-2024
  • (2024)A selective LVQ algorithm for improving instance reduction techniques and its application for text classificationJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-23529046:5-6(11353-11366)Online publication date: 24-Oct-2024
  • Show More Cited By
  1. Prototype Selection for Nearest Neighbor Classification: Taxonomy and Empirical Study

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image IEEE Transactions on Pattern Analysis and Machine Intelligence
      IEEE Transactions on Pattern Analysis and Machine Intelligence  Volume 34, Issue 3
      March 2012
      205 pages

      Publisher

      IEEE Computer Society

      United States

      Publication History

      Published: 01 March 2012

      Author Tags

      1. Prototype selection
      2. classification.
      3. condensation
      4. edition
      5. nearest neighbor
      6. taxonomy

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 05 Mar 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025)SpISInformation Sciences: an International Journal10.1016/j.ins.2024.121738695:COnline publication date: 1-Mar-2025
      • (2024)Weighted distance nearest neighbor condensingProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692713(16153-16166)Online publication date: 21-Jul-2024
      • (2024)A selective LVQ algorithm for improving instance reduction techniques and its application for text classificationJournal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology10.3233/JIFS-23529046:5-6(11353-11366)Online publication date: 24-Oct-2024
      • (2024)Prototype-Based Semantic SegmentationIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/TPAMI.2024.338711646:10(6858-6872)Online publication date: 1-Oct-2024
      • (2024)Instance Selection via Voronoi Neighbors for Binary Classification TasksIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2023.332895236:8(3921-3933)Online publication date: 1-Aug-2024
      • (2024)Boundary-Aware Prototype in Semi-Supervised Medical Image SegmentationIEEE Transactions on Image Processing10.1109/TIP.2024.346341233(5456-5467)Online publication date: 1-Jan-2024
      • (2024)Reduction Through Homogeneous Clustering: Variations for Categorical Data and Fast Data ReductionSN Computer Science10.1007/s42979-024-03007-95:6Online publication date: 25-Jun-2024
      • (2024)IDS-FRNN: an intrusion detection system with optimized fuzziness-based sample selection techniqueNeural Computing and Applications10.1007/s00521-024-10333-936:36(22789-22803)Online publication date: 1-Dec-2024
      • (2024)Enhancing dynamic ensemble selection: combining self-generating prototypes and meta-classifier for data classificationNeural Computing and Applications10.1007/s00521-024-10237-836:32(20295-20320)Online publication date: 1-Nov-2024
      • (2024)Fault distance estimation for transmission lines with dynamic regressor selectionNeural Computing and Applications10.1007/s00521-023-09155-y36:4(1741-1759)Online publication date: 1-Feb-2024
      • Show More Cited By

      View Options

      View options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media