Abstract
In lung cancer computer-aided diagnosis (CAD) systems, having an accurate and available ground truth is critical and time consuming. In this study, we have explored Lung Image Database Consortium (LIDC) database containing pulmonary computed tomography (CT) scans, and we have implemented content-based image retrieval (CBIR) approach to exploit the limited amount of diagnostically labeled data in order to annotate unlabeled images with diagnoses. By applying CBIR method iteratively and using pathologically confirmed cases, we expand the set of diagnosed data available for CAD systems from 17 nodules to 121 nodules. We evaluated the method by implementing a CAD system that uses various combinations of lung nodule sets as queries and retrieves similar nodules from the diagnostically labeled dataset. In calculating the precision of this system Diagnosed dataset and computer-predicted malignancy data are used as ground truth for the undiagnosed query nodules. Our results indicate that CBIR expansion is an effective method for labeling undiagnosed images in order to improve the performance of CAD systems. It also indicated that little knowledge of biopsy confirmed cases not only assist the physician’s as second opinion to mark the undiagnosed cases and avoid unnecessary biopsies too.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Wormanns, D., Fiebich, M., Saidi, M., Diederich, S., Heindel, W.: Automatic detection of pulmonary nodules at spiral CT: clinical application of a computer-aided diagnosis system. European Radiology 12, 1052–1057 (2002)
Blum, A., Mitchell, T.: Combining Labelled and Unlabelled Data with Co-Training. In: Proceedings of the 11th Annual Conference on Computational Learning Theory, COLT 1998, pp. 92–100 (1998)
McNitt-Gray, M.F., Armato, S.G., Meyer, C.R., Reeves, A.P., McLennan, G., Pais, R.C., et al.: The lung image database consortium (LIDC) data collection process for nodule detection and annotation. Academic Radiology 14(12), 1464–1474 (2007)
Armato III, S.G., McLennan, G., Bidaut, L., McNitt-Gray, M.F., Meyer, C.R., Reeves, A.P., Zhao, B., Aberle, D.R., Henschke, C.I., Hoffman, E.A., Kazerooni, E.A., MacMahon, H., van Beek, E.J.R., Yankelevitz, D., et al.: The Lung Image Database Consortium (LIDC) and Image Database Resources Initiative (IDRI): A completed reference database of lung nodules on CT scans. Medical Physics 38, 915–931 (2011)
Horsthemke, W.H., Raicu, D.S., Furst, J.D., Armato III, S.G.: Evaluation Challenges for Computer-Aided Diagnostic Characterization: Shape Disagreements in the Lung Image Database Consortium Pulmonary Nodule Dataset. In: Tan, J. (ed.) New Technologies for Advancing Healthcare and Clinical Practices, pp. 18–43. IGI Global, Hershey PA (2011)
Jabon, S.A., Raicu, D.S., Furst, J.D.: Content-based versus semantic-based similarity retrieval: a LIDC case study. In: SPIE Medical Imaging Conference, Orlando (February 2009)
McNitt-Gray, M.F., Hart, E.M., Wyckoff, N., Sayre, J.W., Goldin, J.G., Aberle, D.R.: A pattern classification approach to characterizing solitary pulmonary nodules imaged on high resolution CT: Preliminary results. Med. Phys. 26, 880–888 (1999)
Armato III, S.G., Altman, M.B., Wilkie, J., Sone, S., Li, F., Doi, K., Roy, A.S.: Automated lung nodule classification following automated nodule detection on CT: A serial approach. Med. Phys. 30, 1188–1197 (2003)
Takashima, S., Sone, S., Li, F., Maruyama, Y., Hasegawa, M., Kadoya, M.: Indeterminate solitary pulmonary nodules revealed at population-based CT screening of the lung: using first follow-up diagnostic CT to differentiate benign and malignant lesions. Am. J. Roentgenol. 180, 1255–1263 (2003)
Samuel, C.C., Saravanan, V., Vimala, D.M.R.: Lung nodule diagnosis from CT images using fuzzy logic. In: Proceedings of International Conference on Computational Intelligence and Multimedia Applications, Sivakasi, Tamilnadu, India, December 13-15, pp. 159–163 (2007)
Raicu, D.S., Varutbangkul, E., Furst, J.D.: Modelling semantics from image data: opportunities from LIDC. International Journal of Biomedical Engineering and Technology 3(1-2), 83–113 (2009)
Giuca, A.-M., Seitz Jr., K.A., Furst, J., Raicu, D.: Expanding diagnostically labeled datasets using content-based image retrieval. In: IEEE International Conference on Image Processing 2012, Lake Buena Vista, Florida, September 30-October 3 (2012)
Aggarwal, P., Vig, R., Sardana, H.K.: Largest Versus Smallest Nodules Marked by Different Radiologists in Chest CT Scans for Lung Cancer Detection. In: International Conference on Image Engineering, ICIE 2013 Organized by IAENG at Hong Kong (in press, 2013)
Zhou, Z.-H.: Learning with Unlabeled Data and Its Application to Image Retrieval. In: Yang, Q., Webb, G. (eds.) PRICAI 2006. LNCS (LNAI), vol. 4099, pp. 5–10. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Aggarwal, P., Sardana, H.K., Vig, R. (2013). Correlation between Biopsy Confirmed Cases and Radiologist’s Annotations in the Detection of Lung Nodules by Expanding the Diagnostic Database Using Content Based Image Retrieval. In: Wilson, R., Hancock, E., Bors, A., Smith, W. (eds) Computer Analysis of Images and Patterns. CAIP 2013. Lecture Notes in Computer Science, vol 8047. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40261-6_64
Download citation
DOI: https://doi.org/10.1007/978-3-642-40261-6_64
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40260-9
Online ISBN: 978-3-642-40261-6
eBook Packages: Computer ScienceComputer Science (R0)