Abstract
The capability of eXtensible Markup Language (XML) for data representation has been widely accepted by research communities and industries. Even though it can be used for efficient data transfer, many industries look for a more promising language on which to rely when it comes to their important data. An ability to provide good XML data quality is necessary to make this data format more reliable and usable. To measure data quality, the current methods are largely driven by structural and technical factors and often assess data quality impartially, not accounting for contextual factors. It is well known that different data share common quality features: completeness, validity, accuracy and timeliness. Nevertheless, the measurement of quality features will be unique, based on the data format. The measurement of quality for XML documents cannot be generalised from quality measurement in other data formats. In this chapter, we describe the development of a user-defined quality metric for XML documents. For implementation, we develop a tool that enables users to control XML data quality. We use a case study in health informatics as the proof of concept.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ballou, D. P., and Tayi, G. K. (1999) Enhancing data quality in data warehouse environments, Communications of the ACM 42(1): 73–78.
Batini, C., and Scannapieco, M. (2006) Data Quality: Concepts, Methodologies and Techniques. Springer, Berlin.
Bierer, A. (2007) Methodological assistance for integrating data quality evaluations into case-based reasoning systems, Proceedings of the 7th International Conference on Case-Based Reasoning (ICCBR 2007), Belfast, Northern Ireland, UK, pp. 254–268.
Even, A., and Shankaranarayanan, G. (2007) Utility-driven configuration of data quality in data repositories, IJIQ 1(1): 22–40.
Paulson, L. D. (2000) Data quality: A rising e-business concern, IEEE IT Professionals 2(4): 10–14.
Serrano, M. A., Calero, C., and Piattini, M. (2005) Metrics for data warehouse quality, In: Khosrow-Pour, M. (Ed.) Encyclopedia of Information Science and Technology IV, Idea Group, Hershey, PA, pp. 1938–1844.
Shankaranarayanan, G., and Cai, Y. (2005) A web services application for the data quality management in the B2B networked environment, Proceedings of the 38th Hawaii International Conference on System Sciences (HICCS 2005), Hawaii, USA, pp. 166.
Welzer, T., Brumen, V., Golob, I., and Druzovec, M. (2002) Medical diagnostic and data quality, Proceedings of the IEEE Symposium on Computer-Based Medical Systems (CBMS 2002), Maribor, Slovenia, pp. 97–101.
Zhu, B., Shankar, G., and Cai, Y. (2007) Integrating data quality data into decision-making process: An information visualization approach, Proceedings of the 12th International Conference HCI International (HCII 2007) Part I, Beijing, China, pp. 366–369.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this paper
Cite this paper
Pardede, E., Gaur, T. (2011). On the Development of a User-Defined Quality Measurement Tool for XML Documents. In: Song, W., et al. Information Systems Development. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-7355-9_18
Download citation
DOI: https://doi.org/10.1007/978-1-4419-7355-9_18
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-7205-7
Online ISBN: 978-1-4419-7355-9
eBook Packages: Computer ScienceComputer Science (R0)