[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content
Log in

A Binary Integer Program to Maximize the Agreement Between Partitions

  • Published:
Journal of Classification Aims and scope Submit manuscript

Abstract

This research note focuses on a problem where the cluster sizes for two partitions of the same object set are assumed known; however, the actual assignments of objects to clusters are unknown for one or both partitions. The objective is to find a contingency table that produces maximum possible agreement between the two partitions, subject to constraints that the row and column marginal frequencies for the table correspond exactly to the cluster sizes for the partitions. This problem was described by H. Messatfa (Journal of Classification, 1992, pp. 5–15), who provided a heuristic procedure based on the linear transportation problem. We present an exact solution procedure using binary integer programming. We demonstrate that our proposed method efficiently obtains optimal solutions for problems of practical size.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Similar content being viewed by others

References

  • ALBATINEH, A.N., NIEWIADORMSKA-BUGAJ, M., and MIHALKO, D. (2006), “On Similarity Indices and Correction for Chance Agreement,” Journal of Classification, 23, 301–313.

    Article  MathSciNet  Google Scholar 

  • COHEN, J. (1960), “A Coefficient of Agreement for Nominal Scales,” Educational and Psychological Measurement, 20, 37–46.

    Article  Google Scholar 

  • COSARES, S., and HOCHBAUM, D.S. (1994), “Strongly Polynomial Algorithms for the Quadratic Transportation Problem with a Fixed Number of Sources,” Mathematics of Operations Research, 19, 94–111.

    Article  MATH  MathSciNet  Google Scholar 

  • FOWLKES. E.B., and MALLOWS, C.L. (1983), “A Method for Comparing Two Hierarchical Clusterings,” Journal of the American Statistical Association, 78, 553–569.

    Article  MATH  Google Scholar 

  • GOODMAN, L., and KRUSKAL, W. (1979), Measures of Association for Cross Classification, New York: Springer-Verlag.

    Google Scholar 

  • HOCHBAUM, D.S., SHAMIR, R., and SHANTHIKUMAR, J.G. (1992), “A Polynomial Algorithm for an Integer Quadratic Non-Separable Transportation Problem,” Mathematical Programming, 55, 359–371.

    Article  MATH  MathSciNet  Google Scholar 

  • HOFFMAN, A.J., and WIELANDT, H.W. (1953), “The Variation of the Spectrum of a Normal Matrix,” Duke Mathematical Journal, 20, 37–39.

    Article  MATH  MathSciNet  Google Scholar 

  • HUBERT, L.J. (1987), Assignment Methods in Combinatorial Data Analysis, New York: Marcel Dekker.

    MATH  Google Scholar 

  • HUBERT, L., and ARABIE, P. (1985), “Comparing Partitions,” Journal of Classification, 2, 193–218.

    Article  Google Scholar 

  • HUBERT, L.J., and BAKER, F.J. (1978),. “Evaluating the Conformity of Sociometric Measurements,” Psychometrika, 43, 31–41.

    Article  MathSciNet  Google Scholar 

  • ILOG CPLEX 6.5 (1999), User’s Manual, Mountain View, CA: ILOG, Inc.

    Google Scholar 

  • JACCARD, P. (1912), “The Distribution of the Flora in the Alpine Zone,” The New Phytologist, 11, 37–50.

    Article  Google Scholar 

  • KATZ, L., and POWELL, J.H. (1953), “A Proposed Index of the Conformity of One Sociometric Measurement to Another,” Psychometrika, 18, 249–256.

    Article  MATH  Google Scholar 

  • LIGHT, R.J. (1971), “Measures of Response Agreement for Categorical Data: Some Generalizations and Alternatives,” Psychological Bulletin, 66, 376–390.

    MathSciNet  Google Scholar 

  • MESSATFA, H. (1992), “An Algorithm to Maximize the Agreement Between Partitions,” Journal of Classification, 9, 5–15.

    Article  MATH  MathSciNet  Google Scholar 

  • RAND, W.M. (1971), “Objective Criteria for the Evaluation of Clustering Methods,” Journal of the American Statistical Association, 66, 846–850.

    Article  Google Scholar 

  • STEINLEY, D. (2004), “Properties of the Hubert-Arabie Adjusted Rand Index,” Psychological Methods, 9, 386–396.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michael J. Brusco.

Additional information

We would like to thank the Editor, Willem Heiser, and an anonymous reviewer for helpful comments that resulted in improvements of this article.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Brusco, M.J., Steinley, D. A Binary Integer Program to Maximize the Agreement Between Partitions. J Classif 25, 185–193 (2008). https://doi.org/10.1007/s00357-008-9013-9

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00357-008-9013-9

Keywords

Navigation