[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

Defining and measuring completeness of electronic health records for secondary use

Published: 01 October 2013 Publication History

Abstract

Graphical abstractDisplay Omitted The completeness of EHR data is dependent upon the definition of completeness being used.We present four definitions of EHR completeness: documentation, breadth, density, and predictive.Each definition results in a different set of complete patient records.Researchers reusing EHR data should report completeness limitations and findings. We demonstrate the importance of explicit definitions of electronic health record (EHR) data completeness and how different conceptualizations of completeness may impact findings from EHR-derived datasets. This study has important repercussions for researchers and clinicians engaged in the secondary use of EHR data. We describe four prototypical definitions of EHR completeness: documentation, breadth, density, and predictive completeness. Each definition dictates a different approach to the measurement of completeness. These measures were applied to representative data from NewYork-Presbyterian Hospital's clinical data warehouse. We found that according to any definition, the number of complete records in our clinical database is far lower than the nominal total. The proportion that meets criteria for completeness is heavily dependent on the definition of completeness used, and the different definitions generate different subsets of records. We conclude that the concept of completeness in EHR is contextual. We urge data consumers to be explicit in how they define a complete record and transparent about the limitations of their data.

References

[1]
Hersh, W.R., Adding value to the electronic health record through secondary use of data for quality assurance, research, and surveillance. Am J Manag Care. v13. 277-278.
[2]
Safran, C., Bloomrosen, M., Hammond, W.E., Labkoff, S., Markel-Fox, S. and Tang, P.C., Toward a national framework for the secondary use of health data: an American Medical Informatics Association White Paper. J Am Med Inform Assoc. v14. 1-9.
[3]
Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. v20. 144-151.
[4]
Thiru, K., Hassey, A. and Sullivan, F., Systematic review of scope and quality of electronic patient record data in primary care. BMJ. v326. 1070
[5]
Hogan, W.R. and Wagner, M.M., Accuracy of data in computer-based patient records. J Am Med Inform Assoc. v4. 342-355.
[6]
Chan, K.S., Fowles, J.B. and Weiner, J.P., Review: electronic health records and the reliability and validity of quality measures: a review of the literature. Med Care Res Rev. v67. 503-527.
[7]
Rubin, D., Inference and missing data. Biometrika. v63. 581-592.
[8]
Missing data: our view of the state of the art. Psychol Methods. v7. 147-177.
[9]
Blobel, B., Advanced and secure architectural EHR approaches. Int J Med Inform. v75. 185-190.
[10]
Botsis, T., Hartvigsen, G., Chen, F. and Weng, C., Secondary use of EHR: data quality issues and informatics opportunities. AMIA Summits Transl Sci Proc. v2010. 1-5.
[11]
Arts, D.G., De Keizer, N.F. and Scheffer, G.J., Defining and improving data quality in medical registries: a literature review, case study, and generic framework. J Am Med Inform Assoc. v9. 600-611.
[12]
Juran, J.M. and Gryna, F.M., Juran's quality control handbook. 1988. 4th ed. McGraw-Hill, New York.
[13]
Wang, R.Y. and Strong, D.M., Beyond accuracy: what data quality means to data consumers. J Manag Inform Syst. v12. 5-34.
[14]
Lewis, J.D. and Brensinger, C., Agreement between GPRD smoking data: a survey of general practitioners and a population-based survey. Pharmacoepidemiol Drug Saf. v13. 437-441.
[15]
Logan, J.R., Gorman, P.N. and Middleton, B., Measuring the quality of medical records: a method for comparing completeness and correctness of clinical encounter data. Proc AMIA Symp. 408-412.
[16]
Barrie, J.L. and Marsh, D.R., Quality of data in the Manchester orthopaedic database. BMJ (Clinical research ed.). v304. 159-162.
[17]
Hohnloser, J.H., Fischer, M.R., Konig, A. and Emmerich, B., Data quality in computerized patient records. Analysis of a haematology biopsy report database. Int J Clin Monit Comput. v11. 233-240.
[18]
Ricketts, D., Newey, M., Patterson, M., Hitchin, D. and Fowler, S., Markers of data quality in computer audit: the Manchester Orthopaedic Database. Ann R Coll Surg Engl. v75. 393-396.
[19]
Roukema, J., Los, R.K., Bleeker, S.E., van Ginneken, A.M., van der Lei, J. and Moll, H.A., Paper versus computer: feasibility of an electronic medical record in general pediatrics. Pediatrics. v117. 15-21.
[20]
Centers for Medicare & Medicaid Services. Quality measures.
[21]
Ritchie, M.D., Denny, J.C., Crawford, D.C., Ramirez, A.H., Weiner, J.B. and Pulley, J.M., Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record. Am J Hum Genet. v86. 560-572.
[22]
Thadani, S.R., Weng, C., Bigger, J.T., Ennever, J.F. and Wajngurt, D., Electronic screening improves efficiency in clinical trial recruitment. J Am Med Inform Assoc. v16. 869-873.
[23]
Weng, C., Batres, C., Borda, T., Weiskopf, N.G., Wilcox, A.B. and Bigger, J.T., A real-time screening alert improves patient recruitment efficiency. AMIA Annu Symp Proc. v2011. 1489-1498.
[24]
Agnew-Blais, J.C., Coblyn, J.S., Katz, J.N., Anderson, R.J., Mehta, J. and Solomon, D.H., Measuring quality of care for rheumatic diseases using an electronic medical record. Ann Rheum Dis. v68. 680-684.
[25]
Goulet, J.L., Erdos, J., Kancir, S., Levin, F.L., Wright, S.M. and Daniels, S.M., Measuring performance directly using the veterans health administration electronic medical record: a comparison with external peer review. Med Care. v45. 73-79.
[26]
Jensen, R.E., Chan, K.S., Weiner, J.P., Fowles, J.B. and Neale, S.M., Implementing electronic health record-based quality measures for developmental screening. Pediatrics. v124. e648-e654.
[27]
Linder, J.A., Kaleba, E.O. and Kmetik, K.S., Using electronic health records to measure physician performance for acute conditions in primary care: empirical evaluation of the community-acquired pneumonia clinical quality measure set. Med Care. v47. 208-216.
[28]
Asche, C., Said, Q., Joish, V., Hall, C.O. and Brixner, D., Assessment of COPD-related outcomes via a national electronic medical record database. Int J Chron Obstruct Pulm Dis. v3. 323-326.
[29]
Einbinder JS, Rury C, Safran C. Outcomes research using the electronic patient record: Beth Israel Hospital's experience with anticoagulation. In: Proceedings of the annual symposium on computer application {sic} in medical care; 1995. p. 819-23.
[30]
Sperrin, M., Thew, S., Weatherall, J., Dixon, W. and Buchan, I., Quantifying the longitudinal value of healthcare record collections for pharmacoepidemiology. AMIA Annu Symp Proc. v2011. 1318-1325.
[31]
Kho, A.N., Hayes, M.G., Rasmussen-Torvik, L., Pacheco, J.A., Thompson, W.K. and Armstrong, L.L., Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study. J Am Med Inform Assoc. v19. 212-218.
[32]
Denny, J.C., Crawford, D.C., Ritchie, M.D., Bielinski, S.J., Basford, M.A. and Bradford, Y., Variants near FOXE1 are associated with hypothyroidism and other thyroid conditions: using electronic medical records for genome- and phenome-wide studies. Am J Hum Genet. v89. 529-542.
[33]
Overby C, Pathak J, Haerian K, Perotte A, Murphy S, Bruce K, et al. A collaborative approach to develop an electronic health record phenotyping algorithm for drug-induced liver injury. J Am Med Inform Assoc; 2013, {published online} http//dx.doi.org/10.1136/amiajnl-2013-001930.
[34]
Boland, M.R., Tu, S.W., Carini, S., Sim, I. and Weng, C., EliXR-TIME: a temporal knowledge representation for clinical research eligibility criteria. AMIA Summits Transl Sci Proc. v2012. 71-80.
[35]
Hao T, Rusanov A, Weng C. Extracting and normalizing temporal expressions in clinical data requests from researchers. In: International health informatics conference, Beijing, China; 2013.
[36]
Zhao, D. and Weng, C., Combining knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction. J Biomed Inform. v44. 859-868.
[37]
A predictive model for progression of chronic kidney disease to kidney failure. JAMA. v305. 1553-1559.
[38]
Wu, J., Roy, J. and Stewart, W.F., Prediction modeling using EHR data: challenges, strategies, and a comparison of machine learning approaches. Med Care. v48. S106-S113.
[39]
Cholleti, S., Post, A., Gao, J., Lin, X., Bornstein, W. and Cantrell, D., Leveraging derived data elements in data analytic models for understanding and predicting hospital readmissions. AMIA Annu Symp Proc. v2012. 103-111.
[40]
Nijhawan, A.E., Clark, C., Kaplan, R., Moore, B., Halm, E.A. and Amarasingham, R., An electronic medical record-based model to predict 30-day risk of readmission and death among HIV-infected inpatients. J Acquir Immune Defic Syndr. v61. 349-358.
[41]
Wells, B.J., Jain, A., Arrigain, S., Yu, C., Rosenkrans Jr., W.A. and Kattan, M.W., Predicting 6-year mortality risk in patients with type 2 diabetes. Diabetes Care. v31. 2301-2306.
[42]
Kennedy, E.H., Wiitala, W.L., Hayward, R.A. and Sussman, J.B., Improved cardiovascular risk prediction using nonparametric regression and electronic health record data. Med Care. v51. 251-258.
[43]
Micallef L, Rodgers P. Drawing area-proportional venn-3 diagrams using ellipses. In: 12th Annual grace hopper celebration of women in computing, ACM student research competition and poster session, Baltimore, MD2012.

Cited By

View all
  • (2024)Towards the automatic calculation of the EQUAL Candida ScoreJournal of Biomedical Informatics10.1016/j.jbi.2024.104667156:COnline publication date: 1-Aug-2024
  • (2024)Converting OMOP CDM to phenopacketsJournal of Biomedical Informatics10.1016/j.jbi.2024.104659155:COnline publication date: 1-Jul-2024
  • (2023)Diagnostic rate estimation from Medicare recordsJournal of Biomedical Informatics10.1016/j.jbi.2023.104463145:COnline publication date: 1-Sep-2023
  • Show More Cited By
  1. Defining and measuring completeness of electronic health records for secondary use

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Journal of Biomedical Informatics
      Journal of Biomedical Informatics  Volume 46, Issue 5
      October, 2013
      193 pages

      Publisher

      Elsevier Science

      San Diego, CA, United States

      Publication History

      Published: 01 October 2013

      Author Tags

      1. Completeness
      2. Data quality
      3. Electronic health records
      4. Secondary use

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 01 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Towards the automatic calculation of the EQUAL Candida ScoreJournal of Biomedical Informatics10.1016/j.jbi.2024.104667156:COnline publication date: 1-Aug-2024
      • (2024)Converting OMOP CDM to phenopacketsJournal of Biomedical Informatics10.1016/j.jbi.2024.104659155:COnline publication date: 1-Jul-2024
      • (2023)Diagnostic rate estimation from Medicare recordsJournal of Biomedical Informatics10.1016/j.jbi.2023.104463145:COnline publication date: 1-Sep-2023
      • (2023)A data preparation framework for cleaning electronic health records and assessing cleaning outcomes for secondary analysisInformation Systems10.1016/j.is.2022.102130111:COnline publication date: 1-Jan-2023
      • (2023)Five-dimensional evaluation system and perceptron intelligent computing performance measurement methods based on medical heterogeneous equipment health dataNeural Computing and Applications10.1007/s00521-023-08316-335:35(24651-24664)Online publication date: 1-Dec-2023
      • (2022)The Secondary Use of Electronic Health Records for Data Mining: Data Characteristics and ChallengesACM Computing Surveys10.1145/349023455:2(1-40)Online publication date: 18-Jan-2022
      • (2022)Predicting pharmacotherapeutic outcomes for type 2 diabetesJournal of Biomedical Informatics10.1016/j.jbi.2022.104001129:COnline publication date: 1-May-2022
      • (2022)Information extraction from electronic medical documents: state of the art and future research directionsKnowledge and Information Systems10.1007/s10115-022-01779-165:2(463-516)Online publication date: 8-Nov-2022
      • (2021)Visualising Developing Nations Health Records: Opportunities, Challenges and Research AgendaProceedings of the 12th International Conference on Advances in Information Technology10.1145/3468784.3471607(1-9)Online publication date: 29-Jun-2021
      • (2021)Evaluation of the key active ingredients of ‘Radix Astragali and Rehmanniae Radix Mixture’ and related signaling pathways involved in ameliorating diabetic foot ulcers from the perspective of TCM-related theoriesJournal of Biomedical Informatics10.1016/j.jbi.2021.103904123:COnline publication date: 1-Nov-2021
      • Show More Cited By

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media