[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/2024587.2024599acmconferencesArticle/Chapter ViewAbstractPublication PagesicseConference Proceedingsconference-collections
research-article

A process for assessing data quality

Published: 04 September 2011 Publication History

Abstract

Abstract: This industrial report stems from practical experience in assessing the quality of customer databases. The process it describes unites three automated audits, - an audit of the database schema, an audit of the database structure and an audit of the database content. The audit of the database schema checks for design smells and rule violations. The audit of the database structure measures the size, complexity and quality of the database model. The audit of the database content processes the data itself to uncover invalid data values, missing records and redundant records. The purpose of these audits is to assess the quality of the database and to determine whether a data reengineering or data clean-up project is required.

References

[1]
Belady, L., Lehman, M.: "A Model of Large Program Development", IBM Systems Journal, Vol. 15, Nr. 3, 1976
[2]
Blaha, M. Premerlani, W.: "Observed Idiosyncracies of relational Database Design", IEEE Proc. of 2nd WCRE, Toronto, July, 1995, p. 116
[3]
Premerleni, W. Blaha, M.: "An Approach for Reengineering of relational databases", Comm. Of ACM, Vol. 37, No. 5, May, 1994, p. 42
[4]
Tayi, G.-K. Ballou, D.: "Examining Data Quality", Comm. Of ACM, Vol. 41, No. 2, Feb. 1998
[5]
Date, C.J.: An Introduction to Database Systems, Addison-Wesley Pub., Reading Mass., 1975
[6]
Redman, T.C.: Data Quality for the Information Age, Artech House, Boston, 1996
[7]
CW: Computer Weekly, Nr. 26, June 1998, p. 1
[8]
Yong, J.K. Kishore, R. Sanders, G.L.: "From DQ to EQ -- Understanding Data Quality in the Context of E-Business Systems", Comm. Of ACM, Vol. 48, No. 10, Oct. 2005, p. 75
[9]
Blaha, M.: "A copper Bullet for Software Quality Improvement", IEEE Computer, Feb. 2004, p. 21
[10]
Wand, Y./Wang, R.: "Anchoring Data Quality Dimensions in Ontological Foundations", Comm. Of ACM, Vol. 39, No. 11, Nov. 1996, p. 86
[11]
Kaplan, D. Krishnan, R. Padman, R. Peters, J.: "Assessing Data Quality in Accounting Information Systems" in Comm. of ACM, Vol. 41, No. 2, Feb. 1998
[12]
DeMarco, T.: Controlling Software Projects -- Management, Measurement & Estimation, Yourdon Press, New York, 1982
[13]
Kan, S.H.: Metrics and Models in Software Quality Engineering, Addison-Wesley, Boston, 2001
[14]
International Function-Point User Group - IFPUG: Counting Practices Manual, Release 4.1. IFPUG, Westerville, Ohio, 1999
[15]
Sneed, H.: Die Data-Point Methode, Online, Zeitschrift für DV, Nr. 5, May 1990, S. 48
[16]
Card, D., Agresti, W.: "Measuring Software Design Complexity", Journal of Systems & Software, Vol. 8, 1988, S. 185
[17]
Hainaut, J-L.: "Strategies for Data Reengineering", IEEE Proc. of 9th WCRE, Richmond, Oct. 2002, p. 211
[18]
Aiken, P.: Data Reverse Engineering, McGraw Hill, New York, 1996
[19]
Blaha, M.: A Manager's Guide to Database Technology -- building and purchasing better Applications, Prentice-Hall, Englewood Cliffs, 2001
[20]
Orr, K.: "Data Quality and System Theory" in Comm. of ACM, Vol. 41, No. 2, Feb. 1998
[21]
Jackson, M.: Principles of Program Design, Academic Press, London, 1975
[22]
Aiken, P. Muntz,A. Richards,R: "DOD Legacy Systems Reverse Engineering Data Requirements", Comm of ACM, Vol. 37, No. 5, Feb. 1994, p. 26
[23]
Brathwaite, K.: Systems Design in a Database Environment, McGraw-Hill, New York, 1989, p. 106
[24]
Wang, R.: "Total Data Quality Management", in Comm. of ACM, Vol. 41, No. 2, Feb. 1998
[25]
ISO/IEC: "Software Product Evaluation - Quality Characteristics and Guidelines for their Use" ISO/IEC Standard ISO-9126, Geneva, 1994
[26]
Howden, W.: "The Theory and Practice of Functional Testing", IEEE Software, Vol. 2, No. 5, Sept. 1985, p. 6
[27]
Fewster, M. Graham, D.: Software Test Automation, Addison-Wesley, Harlow, G.B., 1999
[28]
Sneed, H.: "Bridging the Concept to Implementation Gap in System Testing", IEEE Proc. of TAICPART Workshop, Windsor, G.B., Sept. 2009, p. 172
[29]
Sneed, H.: "Reverse Engineering of Test Cases for Selective Regression Testing", Proc. of European Conference on Software Maintenance and Reengineering, CSMR-2004, IEEE Computer Society Press, Tampere, Finnland, March 2004, S. 69
[30]
Sneed, H.: "Testing a Datawarehouse -- an industrial challenge", IEEE Proc. of TAICPART Workshop, Windsor, G.B., August, 2006, p. 203
[31]
Sneed, H.: "Migrating from COBOL to Java -- a Report from the Field", IEEE Proc. of European Conference on Software Maintenance and Reengineering, CSMR2007, Oldenbourg, March 2011, p. 309
[32]
Sneed, H.: "Migration of a PowerBuilder Application to Java" in Proc. of GI Workshop on Software Reengineering- WRE, May, 2011, p. 55
[33]
Sneed, H. Baumgartner,M. Seidl,R.: Software in Zahlen, Hanser Verlag, München/Wien, 2010, S. 197
[34]
Basili, V., Caldiera, C., Rombach, H-D.: "Goal Question Metric Paradigm", Encyclopedia of Software Engineering, Vol 1, John Wiley & Sons, New York, 1994, S. 528

Cited By

View all

Index Terms

  1. A process for assessing data quality

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WoSQ '11: Proceedings of the 8th international workshop on Software quality
    September 2011
    64 pages
    ISBN:9781450308519
    DOI:10.1145/2024587
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 04 September 2011

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data auditing
    2. data content validation
    3. data metrics
    4. data quality

    Qualifiers

    • Research-article

    Conference

    ESEC/FSE'11
    Sponsor:

    Acceptance Rates

    WoSQ '11 Paper Acceptance Rate 7 of 11 submissions, 64%;
    Overall Acceptance Rate 7 of 11 submissions, 64%

    Upcoming Conference

    ICSE 2025

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)13
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 12 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media