[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
article

Formalization of the classification pattern: survey of classification modeling in information systems engineering

Published: 01 February 2018 Publication History

Abstract

Formalization is becoming more common in all stages of the development of information systems, as a better understanding of its benefits emerges. Classification systems are ubiquitous, no more so than in domain modeling. The classification pattern that underlies these systems provides a good case study of the move toward formalization in part because it illustrates some of the barriers to formalization, including the formal complexity of the pattern and the ontological issues surrounding the "one and the many." Powersets are a way of characterizing the (complex) formal structure of the classification pattern, and their formalization has been extensively studied in mathematics since Cantor's work in the late nineteenth century. One can use this formalization to develop a useful benchmark. There are various communities within information systems engineering (ISE) that are gradually working toward a formalization of the classification pattern. However, for most of these communities, this work is incomplete, in that they have not yet arrived at a solution with the expressiveness of the powerset benchmark. This contrasts with the early smooth adoption of powerset by other information systems communities to, for example, formalize relations. One way of understanding the varying rates of adoption is recognizing that the different communities have different historical baggage. Many conceptual modeling communities emerged from work done on database design, and this creates hurdles to the adoption of the high level of expressiveness of powersets. Another relevant factor is that these communities also often feel, particularly in the case of domain modeling, a responsibility to explain the semantics of whatever formal structures they adopt. This paper aims to make sense of the formalization of the classification pattern in ISE and surveys its history through the literature, starting from the relevant theoretical works of the mathematical literature and gradually shifting focus to the ISE literature. The literature survey follows the evolution of ISE's understanding of how to formalize the classification pattern. The various proposals are assessed using the classical example of classification; the Linnaean taxonomy formalized using powersets as a benchmark for formal expressiveness. The broad conclusion of the survey is that (1) the ISE community is currently in the early stages of the process of understanding how to formalize the classification pattern, particularly in the requirements for expressiveness exemplified by powersets, and (2) that there is an opportunity to intervene and speed up the process of adoption by clarifying this expressiveness. Given the central place that the classification pattern has in domain modeling, this intervention has the potential to lead to significant improvements.

References

[1]
Abrial, J.: Data semantics. In: IFIP Working Conference Data Base Management, pp. 1---60. (1974)
[2]
Albano, A.: Type Hierarchies and Semantic Data Models, vol. 18. ACM, New York (1983)
[3]
Aristotle: Categories and De Interpretatione. Clarendon Aristotle series. Oxford University Press, USA (1975)
[4]
Armstrong, D.M.: A World of States of Affairs. Cambridge Studies in Philosophy. Cambridge University Press, Cambridge, New York (1997)
[5]
Aspinall, D.: Subtyping with power types. In: Proceedings of the 14th International Workshop, CSL 2000 Annual Conference of the EACSL vol. 1862, pp. 156---171. Springer (2000)
[6]
Atkinson, C.: Dimensions of component-based development. In: Proceedings of the Workshop on Object-Oriented Technology, pp. 185---186. (1999)
[7]
Atkinson, C., Kühne, T.: Rearchitecting the UML infrastructure. ACM Trans. Model. Comput. Simul. (TOMACS) 12(4), 290---321 (2002)
[8]
Atkinson, C., Kühne, T.: Reducing accidental complexity in domain models. Softw. Syst. Model. 7(3), 345---359 (2008)
[9]
Barwise, J., Moss, L.: Vicious Circles: On the Mathematics of Non-wellfounded Phenomena. Center for the Study of Language and Information, Stanford (1996)
[10]
Black, M.: The elusiveness of sets. Rev. Metaphys. 24(4), 614---636 (1971)
[11]
Blass, A.: The interaction between category theory and set theory. Contemp. Math. 30, 5---29 (1984)
[12]
Cantor, G.: Ein beitrag zur mannigfaltigkeitslehre. J. für die reine und angewandte Math. 1878(84), 242---258 (1878)
[13]
Cantor, G.: Beiträge zur begründung der transfiniten mengenlehre. Math. Ann. 46(4), 481---512 (1895)
[14]
Cardelli, L.: Structural subtyping and the notion of power type. In: Proceedings of the 15th ACM SIGPLAN-SIGACT Symposium on Principles of Programming Languages, pp. 70---79. ACM (1988)
[15]
Cardelli, L., Wegner, P.: On understanding types, data abstraction, and polymorphism. ACM Comput. Surv. (CSUR) 17(4), 471---523 (1985)
[16]
Carnap, R.: The Logical Structure of the World: And, Pseudoproblems in Philosophy. Open Court Publishing, Chicago (1969)
[17]
Chen, P.P.S.: The entity-relationship model: toward a unified view of data. ACM Trans. Database Syst. (TODS) 1(1), 9---36 (1976)
[18]
Church, A.: Set theory with a universal set. In: Henkin, L. (ed.) Proceedings of the Tarski Symposium. Proceedings of Symposia in Pure Mathematics, vol. XXV, pp. 297---308. American Mathematical Society (1974)
[19]
Clark, T.: A foundation for multi-level modelling. In: Proceedings of the Workshop on Multi-level Modelling Co-located with ACM/IEEE 17th International Conference on Model Driven Engineering Languages& Systems (MoDELS 2014), pp. 43. (2014)
[20]
Coad, P.: Object-oriented patterns. Commun. ACM 35(9), 152---159 (1992)
[21]
Codd, E.F.: Derivability, redundancy, and consistency of relations stored in large data banks. IBM Res. Rep. RJ 599, 19 (1969)
[22]
Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377---387 (1970)
[23]
Codd, E.F.: Extending the database relational model to capture more meaning. ACM Trans. Database Syst. (TODS) 4(4), 397---434 (1979)
[24]
Daga, A., de Cesare, S., Lycett, M., Partridge, C.: Software stability: recovering general patterns of business. AMCIS 2004 Proceedings, pp. 4278---4285. (2004)
[25]
Daga, A.: An ontological approach for recovering legacy business content. In: System Sciences, 2005. HICSS'05. Proceedings of the 38th Annual Hawaii International Conference On, pp. 224a. IEEE. (2005)
[26]
Dahchour, M., Pirotte, A., Zimányi, E.: Materialization and its metaclass implementation. IEEE Trans. Knowl. Data Eng. 14(5), 1078---1094 (2002)
[27]
Devlin, K.: Mathematics: The Science of Patterns: The Search for Order in Life, Mind and the Universe. Holt Paperbacks, New York (1996)
[28]
Durkheim, E., Mauss, M.: Primitive Classification. University of Chicago Press, Chicago (1963)
[29]
Eilenberg, S., MacLane, S.: General theory of natural equivalences. Trans. Am. Math. Soc. 58(2), 231---294 (1945)
[30]
Elmasri, R., Weeldreyer, J., Hevner, A.: The category concept: an extension to the entity-relationship model. Data Knowl. Eng. 1(1), 75---116 (1985)
[31]
Ferreirós, J.: Traditional logic and the early history of sets, 1854---1908. Arch. Hist. Exact Sci. 50(1), 5---71 (1996)
[32]
Ferreirós, J.: On arbitrary sets and ZFC. Bull. Symb. Log. 17(3), 361---393 (2011)
[33]
Fine, G.: On Ideas: Aristotle's Criticism of Plato's Theory of Forms. Oxford University Press, Oxford (1995)
[34]
Fine, K.: The study of ontology. Noûs 25(3), 263---294 (1991)
[35]
Forster, T.: Church's set theory with a universal set. In: Logic, Meaning and Computation, vol. 305, pp. 109---138 (2001)
[36]
Foucault, M.: The Order of Things: An Archaeology of the Human Sciences. Tavistock Publications, London (1970)
[37]
Fowler, M.: Analysis Patterns: Reusable Object Models. Addison-Wesley, Boston (1997)
[38]
Freyd, P.: Abelian Categories: An Introduction to Theory of Functors. Harper & Row, New York (1964)
[39]
Ghiselin, M.T.: A radical solution to the species problem. Syst. Zool. 23(4), 536---544 (1974)
[40]
Goldstein, R.C., Storey, V.C.: Materialization. IEEE Trans. Knowl. Data Eng. 6(5), 835---842 (1994)
[41]
Goldstein, R.C., Storey, V.C.: Data abstractions: Why and how? Data Knowl. Eng. 29(3), 293---311 (1999)
[42]
Gonzalez-Perez, C., Henderson-Sellers, B.: A powertype-based metamodelling framework. Softw. Syst. Model. 5(1), 72---90 (2006)
[43]
Gonzalez-Perez, C., Henderson-Sellers, B.: Modelling software development methodologies: a conceptual foundation. J. Syst. Softw. 80(11), 1778---1796 (2007)
[44]
Grattan-Guinness, I.: The Search for Mathematical Roots, 1870---1940: Logics, Set Theories and the Foundations of Mathematics from Cantor through Russell to Gödel. Princeton University Press, Princeton (2000)
[45]
Grothendieck, A.: Sur quelques points d'algèbre homologique, II. Tôhoku Math. J. 9(3), 119---221 (1957)
[46]
Gyssens, M., Van Gucht, D.: The Powerset Algebra as a Result of Adding Programming Constructs to the Nested Relational Algebra, vol. 17. ACM, New York (1988)
[47]
Halmos, P.R.: Naive Set Theory. Van Nostrand, Princeton (1960)
[48]
Halpin, T.A.: Information modeling and higher-order types. In: CAiSE Workshops (1), pp. 233---248. (2004)
[49]
Hay, D.C.: Data Model Patterns: Conventions of Thought. Dorset House Pub, New York (1996)
[50]
Henderson-Sellers, B., Gonzalez-Perez, C.: Connecting powertypes and stereotypes. J. Object Technol. 4(7), 83---96 (2005)
[51]
Henderson-Sellers, B., Gonzalez-Perez, C.: The rationale of powertype-based metamodelling to underpin software development methodologies. In: Proceedings of the 2nd Asia-Pacific Conference on Conceptual Modelling, vol. 43, pp. 7---16. Australian Computer Society Inc (2005)
[52]
Henderson-Sellers, B., Gonzalez-Perez, C.: On the ease of extending a powertype-based methodology metamodel. Meta Model. Ontol. WoMM 2006, 11---25 (2006)
[53]
Hoare, C.A.R.: Chapter II: Notes on data structuring. In: Structured Programming of APIC Studies in Data Processing No. 8, pp. 83---174. Academic Press Ltd (1972)
[54]
Hull, D.: A matter of individuality. Philos. Sci. 45, 335---360 (1978)
[55]
Hull, R.: Four views of complex objects: a sophisticate's introduction. In: Nested Relations and Complex Objects in Databases, pp. 85---116. (1989)
[56]
Hull, R., King, R.: Semantic database modeling: survey, applications, and research issues. ACM Comput. Surv. (CSUR) 19(3), 201---260 (1987)
[57]
Hull, R., Su, J.: On the expressive power of database queries with intermediate types. J. Comput. Syst. Sci. 43(1), 219---267 (1991)
[58]
International Standards Office: ISO 24744: Software engineering: metamodel for development methodologies (2007)
[59]
International Standards Office: ISO 24744: Software engineering--metamodel for development methodologies. (2014)
[60]
Johnson, M., Naumann, D., Power, J.: Category theoretic models of data refinement. Electron. Notes Theor. Comput. Sci. 225, 21---38 (2009)
[61]
Kanamori, A.: The mathematical development of set theory from cantor to cohen. Bull. Symb. Log. 2(1), 1---71 (1996)
[62]
Kent, W.: Data and Reality: Basic Assumptions in Data Processing Reconsidered. North-Holland Pub. Co.; sole distributors for the USA and Canada Elsevier/North-Holland, Amsterdam; New York New York (1978)
[63]
Kühne, T.: Contrasting classification with generalisation. In: Proceedings of the Sixth Asia-Pacific Conference on Conceptual Modeling, vol. 96, pp. 71---78. Australian Computer Society Inc, Darlinghurst (2009)
[64]
Kühne, T., Schreiber, D.: Can programming be liberated from the two-level style: multi-level programming with deepjava. In: Proceedings of the 22nd Annual ACM SIGPLAN Conference on Object-oriented Programming Systems and Applications. ACM, New York (2007)
[65]
Kuper, G.M.: The logical data model: a new approach to database logic. Stanford University, Department of Computer Science (1985)
[66]
Lakoff, G.: Women, Fire and Dangerous Things: What Categories Reveal About the Mind. University of Chicago Press, Chicago (1987)
[67]
Lewis, D.: On the Plurality of Worlds. B. Blackwell, New York (1986)
[68]
Lockemann, P.C., Mayr, H.C., Weil, W.H., Wohlleber, W.H.: Data abstractions for database systems. ACM Trans. Database Syst. (TODS) 4(1), 60---75 (1979)
[69]
Lycett, M., Partridge, C.: The challenge of epistemic divergence in IS development. Commun. ACM 52(6), 127---131 (2009)
[70]
Maietti, M.E., Valentini, S.: Can you add Power-Sets to Martin-Lof's intuitionistic set theory? Math. Log. Q. 45(4), 521---532 (1999)
[71]
Martin, J., Odell, J.J.: Object-Oriented Analysis and Design. Prentice-Hall Inc, Upper Saddle River (1992)
[72]
Martin, J., Odell, J.J.: Object-Oriented Methods. Prentice Hall PTR, Upper Saddle River (1994)
[73]
Martin, J., Odell, J.J.: Object Oriented Methods: A Foundation. Prentice Hall, Englewood Cliffs (1995)
[74]
Martin-Löf, P.: An intuitionistic theory of types: predicative part. In: Logic Colloquium'73: Proceedings of the Logic Colloquium, vol. 80, pp. 73---118. North Holland Amsterdam (1975)
[75]
Mayr, E., Bock, W.J.: Classifications and other ordering systems. J. Zool. Syst. Evol. Res. 40(4), 169---194 (2002)
[76]
Mealy, G.H.: Another look at data. In: Proceeding of the Fall Joint Computer Conference, vol. 31, pp. 525---534. AFIPS (1967)
[77]
Neumayr B., Schrefl M. (2008) Comparison criteria for ontological multi-level modeling 08.03/November 2008. Institute für Wirtschaftsinformatik, Johannes Kepler Universität Linz
[78]
Neumayr, B., Schrefl, M.: Multi-level conceptual modeling and OWL. In: Heuser, C.A., Pernul, G. (eds.) Advances in Conceptual Modeling-Challenging Perspectives: ER 2009 Workshops. LNCS, vol. 5833, pp. 189---199. Springer, Berlin, Heidelberg (2009)
[79]
Newman, A.: The Physical Basis of Predication. Cambridge studies in philosophy. Cambridge University Press, Cambridge; New York (1992)
[80]
Odell, J.J.: Power types. JOOP 7(2), 8---12 (1994)
[81]
Odell, J.J.: Advanced Object-Oriented Analysis and Design using UML, vol. 12. Cambridge University Press, Cambridge (1998)
[82]
Olson, D.R.: The World on Paper: The Conceptual and Cognitive Implications of Writing and Reading. Cambridge University Press, Cambridge; New York (1994)
[83]
OMG: Unified modelling language: Superstructure ver. 2.1. (2006)
[84]
OMG: Unified modelling language: Superstructure, ver. 2.3 OMG. (2010)
[85]
OMG: Unified modeling language (UML), Version 2.5 - formal/15-03-01. (version 2.5) (2015)
[86]
Ong, W.J.: Orality and Literacy: The Technologizing of the Word, vol. 1. Methuen, London and New York (1982)
[87]
Partridge, C.: Modelling the real world: are classes abstractions or objects? JOOP 7(7), 39---45 (1994)
[88]
Partridge, C.: Business Objects: Re-engineering for Re-use. Butterworth Heinemann, Oxford (1996)
[89]
Partridge, C.: The role of ontology in semantic integration. In: Second International Workshop on Semantics of Enterprise Integration at OOPSLA 2002 (2002). http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.332.2651
[90]
Partridge, C.: What is a customer? the beginnings of a reference ontology for customer. In: Proceedings of the 11th OOPSLA Workshop on Behavioral SemanticsNortheastern (2002)
[91]
Partridge, C.: Shifting the ontological foundations of accounting's conceptual scheme. Presented at the Sixth European Conference on Accounting Information Systems, Seville, Spain (2003). http://bit.ly/1YeC602
[92]
Partridge, C.: Business Objects: Re-Engineering for Re-Use. BORO Centre, London (2005)
[93]
Partridge, C., Stefanova, M.: A synthesis of state of the art enterprise ontologies: Lessons learned. In: Open Enterprise Solutions: Systems, Experiences, and Organizations (OES-SEO 2001), pp. 130---133. Luiss Edizioni, Centro di Ricerca sui Sistemi Informativi (2001)
[94]
Partridge, C., Stefanova, M.: Building a foundation for ontologies of organisation. In: The ontology and modelling of real estate transactions: European jurisdictions (international land management series), pp. 141---149. Ashgate Publishing Limited (2003)
[95]
Partridge, C., Stefanova, M.: A program for building a state of the art enterprise ontology: Report on progress. In: The Practical Foundations of Business and Systems Specifications, pp. 271. Kluwer Academic Publishers (2003)
[96]
Peckham, J., Maryanski, F.: Semantic data models. ACM Comput. Surv. (CSUR) 20(3), 153---189 (1988)
[97]
Pirotte, A., Massart, D.: Integrating two descriptions of taxonomies with materialization. J. Object Technol. 3(5), 143---149 (2004)
[98]
Pirotte, A.: Materialization: a powerful and ubiquitous abstraction pattern. In: Proceedings of the 20th International Conference on very Large Data Bases, pp. 630---641. Morgan Kaufmann Publishers Inc (1994)
[99]
Plato, Gill M.L., Ryan P.: Parmenides. Hackett Publishing Company, UK (1996)
[100]
Plato, Waterfield R.: Republic. Oxford world's classics. Oxford Paperbacks, UK (1994)
[101]
Ranganathan, S.R.: Colon Classification. Madras Library Association, Madras (1933)
[102]
Richardson, E.C.: Classification, Theoretical and Practical. H.W. Wilson, New York (1901)
[103]
Russell, B.: Principles of Mathematics. Routledge, London (1992)
[104]
Sambin, G., Valentini, S.: Building up a toolbox for martin-löf's type theory: Subset theory. In: Twenty-Five Years of Constructive Type Theory: Proceedings of a Congress Held in Venice, October 1995 vol. 36, pp. 221. Oxford University Press, USA (1998)
[105]
Shapiro, S.: Foundations Without Foundationalism: A Case for Second-Order Logic. Cambridge Univ Press, Cambridge (1991)
[106]
Shapiro, S.: Foundations without Foundationalism: A Case for Second-Order Logic. Oxford logic guides; 17. Clarendon Press Oxford University Press, Oxford New York (1991)
[107]
Silverston, L.: The Data Model Resource Book, A Library of Universal Data Models by Industry Types, vol. 2. Wiley, New York (2001)
[108]
Silverston, L.: The Data Model Resource Book, A Library of Universal Data Models for all Enterprises, vol. 1. Wiley, New York (2001)
[109]
Silverston, L., Agnew, P.: The Data Model Resource Book, Universal Patterns for Data Modelling, vol. 3. Wiley, Indianapolis (2008)
[110]
Silverston, L., Inmon, W.H., Graziano, K.: The Data Model Resource Book, A Library of Logical Data Models and Data Warehouse Designs. Wiley, New York (1997)
[111]
Smith, J.M., Smith, D.C.P.: Database abstractions: aggregation and generalization. ACM Trans. Database Syst. (TODS) 2(2), 105---133 (1977)
[112]
Soldano, H., Ventos, V.: Abstract concept lattices. In: Valtchev, P., Jaschke, R. (eds.) Formal Concept Analysis: ICFCA 2011. Lecture Notes in Computer Science, vol. 6628, pp. 235---250. Springer, Berlin, Heidelberg (2011)
[113]
Teorey, T.J., Yang, D., Fry, J.P.: A logical design methodology for relational databases using the extended entity-relationship model. ACM Comput. Surv. (CSUR) 18(2), 197---222 (1986)
[114]
The Univalent Foundations Program: Homotopy Type Theory: Univalent Foundations of Mathematics (2013)
[115]
Valentini, S.: The forget-restore principle: a paradigmatic example. Twenty Five Years Constr. Type Theory 36, 275---283 (1998)
[116]
Van Griethuysen, J.J.: Concepts and terminology for the conceptual schema and the information base. ANSI. ISO/TC97/SC5/WG3-N695. International Organization for Standardization (1982)
[117]
Van Heijenoort, J.: From Frege to Gödel: A Source Book in Mathematical Logic, 1879---1931. Harvard University Press, Massachusetts (1967)
[118]
West, M.: Developing High Quality Data Models. Morgan Kaufmann, Burlington (2010)
[119]
Woolf, B., Johnson, R.: The type object pattern. Pattern Lang. Progr. Des. 3, 132 (1996)
[120]
Zermelo, E.: Untersuchungen über die grundlagen der mengenlehre. I. Math. Ann. 65(2), 261---281 (1908)

Cited By

View all
  • (2023)Sanity-Checking Multiple Levels of ClassificationConceptual Modeling10.1007/978-3-031-47262-6_9(162-180)Online publication date: 6-Nov-2023
  • (2022)A goal-oriented framework for ontology reuseApplied Ontology10.3233/AO-22026917:3(365-399)Online publication date: 1-Jan-2022
  • (2018)Exploring PotencyProceedings of the 21th ACM/IEEE International Conference on Model Driven Engineering Languages and Systems10.1145/3239372.3239411(2-12)Online publication date: 14-Oct-2018
  1. Formalization of the classification pattern: survey of classification modeling in information systems engineering

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Software and Systems Modeling (SoSyM)
    Software and Systems Modeling (SoSyM)  Volume 17, Issue 1
    February 2018
    352 pages

    Publisher

    Springer-Verlag

    Berlin, Heidelberg

    Publication History

    Published: 01 February 2018

    Author Tags

    1. Classification
    2. Classification system
    3. Powerset
    4. Powertype
    5. Set theory

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 01 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Sanity-Checking Multiple Levels of ClassificationConceptual Modeling10.1007/978-3-031-47262-6_9(162-180)Online publication date: 6-Nov-2023
    • (2022)A goal-oriented framework for ontology reuseApplied Ontology10.3233/AO-22026917:3(365-399)Online publication date: 1-Jan-2022
    • (2018)Exploring PotencyProceedings of the 21th ACM/IEEE International Conference on Model Driven Engineering Languages and Systems10.1145/3239372.3239411(2-12)Online publication date: 14-Oct-2018

    View Options

    View options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media