Visualising Departures from Symmetry and Bowker’s X2 Statistic
<p>Correspondence plot that visually examines the departure from symmetry for <a href="#symmetry-14-01103-t001" class="html-table">Table 1</a>; <math display="inline"><semantics> <mrow> <mi>C</mi> <mo>=</mo> <mn>50</mn> </mrow> </semantics></math>.</p> "> Figure 2
<p>Correspondence plot that visually examines the departure from symmetry for <a href="#symmetry-14-01103-t001" class="html-table">Table 1</a>; <math display="inline"><semantics> <mrow> <mi>C</mi> <mo>=</mo> <mn>75</mn> </mrow> </semantics></math>.</p> "> Figure 3
<p>Correspondence plot that visually examines the departure from symmetry for <a href="#symmetry-14-01103-t001" class="html-table">Table 1</a>; <math display="inline"><semantics> <mrow> <mi>C</mi> <mo>=</mo> <mn>100</mn> </mrow> </semantics></math>.</p> "> Figure 4
<p>Correspondence plot that visually examines the departure from symmetry for <a href="#symmetry-14-01103-t001" class="html-table">Table 1</a>; <math display="inline"><semantics> <mrow> <mi>C</mi> <mo>=</mo> <mn>150</mn> </mrow> </semantics></math>.</p> "> Figure 5
<p>Correspondence plot that visually examines the departure from symmetry for <a href="#symmetry-14-01103-t001" class="html-table">Table 1</a>; <math display="inline"><semantics> <mrow> <mi>C</mi> <mo>=</mo> <mn>150</mn> </mrow> </semantics></math>.</p> ">
Abstract
:1. Introduction
2. On the Classical Approach
3. On Studying the Symmetry of a Categorical Variable
4. On Bowker’s Residuals and Departures from Symmetry
5. Correspondence Analysis and Bowker’s Statistic
5.1. On the SVD of the Matrix of the Bowker Residuals
5.2. The Total Inertia
5.3. The Principal Coordinates
5.4. On the Origin and Transition Formulae
5.5. Intra-Variable Distances
6. Example 1 Artificial Data
6.1. The Data
6.2. Preliminary Examination of the Departure from Symmetry
6.3. Features of Correspondence Analysis & Symmetry
7. Example 2 on the Purchase of Decaffeinated Coffee
- the purchase of the five coffee brands is different across the first and second purchases and so reflects the departure from symmetry that Bowker’s statistic shows,
- the greatest departure from symmetry is for the coffee brand “High Point” since “HP” and “hp” lie furthest from the origin than any of the four remaining brands. Thus, it is this brand that has undergone the greatest difference in purchasing preference over the two time periods,
- the coffee brands are ordered as follows based on the greatest to least departure from perfect symmetry: “High Point”, “Taster’s Choice”, ”Sanka”, “Nescafé” and “Brim”,
- therefore, “Brim” is the coffee brand that has the most similar purchasing pattern across the two time periods when the brands were purchased.
- the purchasing preferences of the brands “Sanka” and “Taster’s Choice” are very similar on their first purchase as well as on their second purchase. This can be seen because of the close proximity of “sa” and “tc” on the left of the plot, and “SA” and “TC” on the right of the plot,
- the purchasing preferences of the brands “High Point” and “Nescafé” are similar (although not as similar as “SA” and “TC”) within each of the two purchases.
8. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
SVD | Singular value decomposition |
Appendix A. The Singular Values of a 2 × 2 S Matrix
Appendix B. The Singular Values of a 3 × 3 S Matrix
Appendix C. R Code
- N—the two-way contingency table of size , where ,
- scaleplot—rescales the limit of the axes used to construct the two-dimensional correspondence plot. By default, scaleplot = 1.2,
- dim1—the first dimension of the correspondence plot. By default, dim1 = 1 so that the first dimension is depicted horizontally, and
- dim2—the second dimension of the correspondence plot. By default, dim2 = 2 so that the second dimension is depicted vertically
- the contingency table under investigation, N,
- the matrix of Bowker residuals, s, where the elements are defined by (7),
- Bowker’s chi-squared statistic defined by (6), Bowker.X2, and its p-value, P.Value, and
- the principal inertia value for each of the M dimensions, Principal.Inertia, the percentage of the total inertia accounted for by each of these dimensions, Perc.Inertia, and the cumulative percentage of the M principal inertia values, Cumm.Inertia.
References
- Agresti, A. Categorical Data Analysis, 2nd ed.; Wiley: New York, NY, USA, 2002. [Google Scholar]
- Anderson, E.B. The Statistical Analysis of Categorical Data; Springer: Berlin/Heidelberg, Germany, 1991. [Google Scholar]
- Bove, G. Asymmetrical multidimensional scaling and correspondence analysis for square tables. Stat. Appl. 1992, 4, 587–598. [Google Scholar]
- De Falguerolles, A.; van der Heijden, P.G.M. Reduced rank quasi-symmetry and quasi-skew symmetry: A generalized bi-linear model approach. Ann. Fac. Sci. Toulouse 2002, 11, 507–524. [Google Scholar] [CrossRef]
- Iki, K.; Yamamoto, K.; Tomizawa, S. Quasi-diagonal exponent symmetry model for square contingency tables with ordered categories. Stat. Probab. Lett. 2014, 92, 33–38. [Google Scholar] [CrossRef] [Green Version]
- Tomizawa, S. Two kinds of measures of departure from symmetry in square contingency tables having nominal categories. Stat. Sin. 1994, 4, 325–334. [Google Scholar]
- Yamamoto, H. A measure of departure from symmetry for multi-way contingency tables with nominal categories. Jpn. J. Biom. 2004, 25, 69–88. [Google Scholar] [CrossRef] [Green Version]
- Yamamoto, K.; Shimada, F.; Tomizawa, S. Measure of departure from symmetry for the analysis of collapsed square contingency tables with ordered categories. J. Appl. Stat. 2015, 42, 866–875. [Google Scholar] [CrossRef]
- Arellano-Valle, R.B.; Contreras-Reyes, J.E.; Stehlik, M. Generalized skew-normal negentropy and its application to fish condition factor time series. Entropy 2017, 19, 528. [Google Scholar] [CrossRef] [Green Version]
- Nishisato, S. Optimal Quantification and Symmetry; Springer: Singapore, 2022. [Google Scholar]
- Constantine, A.G.; Gower, J.C. Graphical representation of asymmetry. Appl. Stat. 1978, 27, 297–304. [Google Scholar] [CrossRef]
- Gower, J.C. The analysis of asymmetry and orthogonality. In Recent Developments in Statistics; Barra, J.R., Brodeau, F., Romer, G., van Cutsem, B., Eds.; North-Holland: Amsterdam, The Netherlands, 1977; pp. 109–123. [Google Scholar]
- Greenacre, M. Correspondence analysis of square asymmetric matrices. J. R. Stat. Soc. Ser. C Appl. Stat. 2000, 49, 297–310. [Google Scholar] [CrossRef]
- Bowker, A.H. A test for symmetry in contingency tables. J. Am. Stat. Assoc. 1948, 43, 572–598. [Google Scholar] [CrossRef] [PubMed]
- Grover, R.; Srinivasan, V. A simultaneous approach to market segmentation and market structuring. J. Mark. Res. 1987, 24, 129–153. [Google Scholar] [CrossRef]
- Beh, E.J.; Lombardo, R. Correspondence Analysis: Theory, Practice and New Strategies; Wiley: Chichester, UK, 2014. [Google Scholar]
- McNemar, Q. Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 1947, 12, 153–157. [Google Scholar] [CrossRef] [PubMed]
- Bishop, Y.M.; Fienberg, S.E.; Holland, P.W. Discrete Multivariate Analysis: Theory and Practice; MIT Press: Cambridge, MA, USA, 1975. [Google Scholar]
- Lancaster, H.O. The Chi-squared Distribution; Wiley: Sydney, Australia, 1969. [Google Scholar]
- Plackett, R.L. The Analysis of Categorical Data; Charles Griffin and Company, Limited: London, UK, 1974. [Google Scholar]
- Cressie, N.A.C.; Read, T.R.C. Multinomial goodness-of-fit tests. J. R. Stat. Soc. Ser. B 1984, 46, 440–464. [Google Scholar] [CrossRef]
- Beh, E.J.; Lombardo, R. Correspondence Analysis and the Cressie–Read Family of Divergence Statistics. National Institute for Applied Statistics Reasearch Australia (NIASRA) Working Paper Series. 2022. Available online: https://www.uow.edu.au/niasra/publications/ (accessed on 19 May 2022).
- Ward, R.C.; Gray, L.J. Eigensystem computation for skew-symmetric matrices and a class of symmetric matrices. Acm Trans. Math. Softw. 1978, 4, 278–285. [Google Scholar] [CrossRef]
- Murnaghan, F.D.; Wintner, A. A canonical form for real matrices under orthogonal transformations. Proc. Natl. Acad. Sci. USA 1931, 17, 417–420. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Paardekooper, M.H.C. An eigenvalue algorithm for skew-symmetric matrices. Numer. Math. 1971, 17, 189–202. [Google Scholar] [CrossRef]
- Agresti, A. An Introduction to Categorical Data Analysis, 3rd ed.; Wiley: New York, NY, USA, 2019. [Google Scholar]
- Freeman, M.F.; Tukey, J.W. Transformations related to the angular and square root. Ann. Math. Stat. 1950, 21, 607–611. [Google Scholar] [CrossRef]
- Hubert, L.; Arabie, P. Comparing partitions. J. Classif. 1985, 2, 193–218. [Google Scholar] [CrossRef]
C1 | C2 | C3 | C4 | |
---|---|---|---|---|
R1 | 10 | 20 | 30 | 40 |
R2 | 20 + C | 50 | 60 | 70 |
R3 | 30 | 60 | 20 | 40 |
R4 | 40 | 70 | 40 | 80 |
C | ||||
---|---|---|---|---|
Output | 50 | 75 | 100 | 150 |
27.778 | 48.913 | 71.429 | 118.421 | |
0.038 | 0.065 | 0.092 | 0.143 | |
0.138 | 0.180 | 0.214 | 0.267 | |
−0.333 | −0.422 | −0.488 | −0.582 | |
−0.248 | −0.321 | −0.378 | −0.464 | |
−0.333 | −0.422 | −0.488 | −0.582 | |
0.248 | 0.321 | 0.378 | 0.464 |
Second Purchase | ||||||
---|---|---|---|---|---|---|
First | High Pt | Taster’s | Sanka | Nescafé | Brim | Total |
Purchase | (hp) | (ta) | (sa) | (ne) | (br) | |
High Point (HP) | 93 | 17 | 44 | 7 | 10 | 171 |
Taster’s Choice (TC) | 9 | 46 | 11 | 0 | 9 | 75 |
Sanka (SA) | 17 | 11 | 155 | 9 | 12 | 204 |
Nescafé | 6 | 4 | 9 | 15 | 2 | 36 |
Brim | 10 | 4 | 12 | 2 | 27 | 55 |
Total | 135 | 82 | 231 | 33 | 60 | 541 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Beh, E.J.; Lombardo, R. Visualising Departures from Symmetry and Bowker’s X2 Statistic. Symmetry 2022, 14, 1103. https://doi.org/10.3390/sym14061103
Beh EJ, Lombardo R. Visualising Departures from Symmetry and Bowker’s X2 Statistic. Symmetry. 2022; 14(6):1103. https://doi.org/10.3390/sym14061103
Chicago/Turabian StyleBeh, Eric J., and Rosaria Lombardo. 2022. "Visualising Departures from Symmetry and Bowker’s X2 Statistic" Symmetry 14, no. 6: 1103. https://doi.org/10.3390/sym14061103
APA StyleBeh, E. J., & Lombardo, R. (2022). Visualising Departures from Symmetry and Bowker’s X2 Statistic. Symmetry, 14(6), 1103. https://doi.org/10.3390/sym14061103