[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2490257.2490260acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbciConference Proceedingsconference-collections
research-article

AIB2: an abstraction data reduction technique based on IB2

Published: 19 September 2013 Publication History

Abstract

Data reduction improves the efficiency of k-NN classifier on large datasets since it accelerates the classification process and reduces storage requirements for the training data. IB2 is an effective data reduction technique that selects some training items form the initial dataset and uses them as representatives (prototypes). Contrary to many other data reduction techniques, IB2 is a very fast, one-pass method that builds its reduced (condensing) set in an incremental manner. New training data can update the condensing set without the need of the "old" removed items. This paper proposes AIB2, a variation of IB2, which generates new prototypes instead of selecting them. AIB2 attempts to improve the efficiency of IB2 by positioning the prototypes in the center of the data areas they represent. The experimental study shows that AIB2 performs better than IB2.

References

[1]
C. Aggarwal. Data Streams: Models and Algorithms. Advances in Database Systems Series. Springer Science+Business Media, LLC, 2007.
[2]
D. W. Aha. Tolerating noisy, irrelevant and novel attributes in instance-based learning algorithms. Int. J. Man-Mach. Stud., 36(2):267--287, Feb. 1992.
[3]
D. W. Aha, D. Kibler, and M. K. Albert. Instance-based learning algorithms. Mach. Learn., 6(1):37--66, Jan. 1991.
[4]
J. Alcalá-Fdez, A. Fernández, J. Luengo, J. Derrac, and S. García. Keel data-mining software tool: Data set repository, integration of algorithms and experimental analysis framework. Multiple-Valued Logic and Soft Computing, 17(2-3):255--287, 2011.
[5]
J. Beringer and E. Hüllermeier. Efficient instance-based learning on data streams. Intell. Data Anal., 11(6):627--650, Dec. 2007.
[6]
B. V. Dasarathy. Nearest neighbor (NN) norms: NN pattern classification techniques. IEEE Computer Society Press, 1991.
[7]
J. Demšar. Statistical comparisons of classifiers over multiple data sets. J. Mach. Learn. Res., 7:1--30, Dec. 2006.
[8]
S. Garcia, J. Derrac, J. Cano, and F. Herrera. Prototype selection for nearest neighbor classification: Taxonomy and empirical study. IEEE Trans. Pattern Anal. Mach. Intell., 34(3):417--435, Mar. 2012.
[9]
M. Grochowski and N. Jankowski. Comparison of instance selection algorithms ii. results and comments. In Artificial Intelligence and Soft Computing - ICAISC 2004, volume 3070 of LNCS, pages 580--585. Springer Berlin/Heidelberg, 2004.
[10]
P. E. Hart. The condensed nearest neighbor rule. IEEE Transactions on Information Theory, 14(3):515--516, 1968.
[11]
N. Jankowski and M. Grochowski. Comparison of instances seletion algorithms i. algorithms survey. In Artificial Intelligence and Soft Computing - ICAISC 2004, volume 3070 of LNCS, pages 598--603. Springer Berlin/Heidelberg, 2004.
[12]
J. A. Olvera-López, J. A. Carrasco-Ochoa, J. F. Martínez-Trinidad, and J. Kittler. A review of instance selection methods. Artif. Intell. Rev., 34(2):133--143, Aug. 2010.
[13]
H. Samet. Foundations of multidimensional and metric data structures. The Morgan Kaufmann series in computer graphics. Elsevier/Morgan Kaufmann, 2006.
[14]
I. Triguero, J. Derrac, and S. G. andFrancisco Herrera. A taxonomy and experimental study on prototype generation for nearest neighbor classification. IEEE Transactions on Systems, Man, and Cybernetics, Part C, 42(1):86--100, 2012.
[15]
A. Tsymbal. The problem of concept drift: definitions and related work. Technical Report TCD-CS-2004-15, The University of Dublin, Trinity College, Department of Computer Science, Dublin, Ireland, 2004.

Cited By

View all
  • (2021)Prototype Selection and Generation with Minority Classes Preservation2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA52424.2021.9555514(1-8)Online publication date: 12-Jul-2021
  • (2015)Applying Prototype Selection and Abstraction Algorithms for Efficient Time-Series ClassificationArtificial Neural Networks10.1007/978-3-319-09903-3_16(333-348)Online publication date: 2015

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
BCI '13: Proceedings of the 6th Balkan Conference in Informatics
September 2013
293 pages
ISBN:9781450318518
DOI:10.1145/2490257
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

  • University of Macedonia
  • Aristotle University of Thessaloniki
  • The University of Sheffield: The University of Sheffield
  • Greek Com Soc: Greek Computer Society
  • SEERC: South-East European Research Centre
  • Alexander TEI of Thessaloniki

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 September 2013

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. k-NN classification
  2. data reduction
  3. prototypes

Qualifiers

  • Research-article

Conference

BCI '13
Sponsor:
  • The University of Sheffield
  • Greek Com Soc
  • SEERC
BCI '13: Balkan Conference in Informatics
September 19 - 21, 2013
Thessaloniki, Greece

Acceptance Rates

BCI '13 Paper Acceptance Rate 41 of 103 submissions, 40%;
Overall Acceptance Rate 97 of 250 submissions, 39%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2021)Prototype Selection and Generation with Minority Classes Preservation2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA)10.1109/IISA52424.2021.9555514(1-8)Online publication date: 12-Jul-2021
  • (2015)Applying Prototype Selection and Abstraction Algorithms for Efficient Time-Series ClassificationArtificial Neural Networks10.1007/978-3-319-09903-3_16(333-348)Online publication date: 2015

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media