[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

A method for extracting tumor events from clinical CT examination reports

Published: 01 June 2023 Publication History

Graphical abstract

Display Omitted

Abstract

Accurate and efficient extraction of key information related to diseases from medical examination reports, such as X-ray and ultrasound images, CT scans, and others, is crucial for accurate diagnosis and treatment. These reports provide a detailed record of a patient's health condition and are an important part of the clinical examination process. By organizing this information in a structured way, doctors can more easily review and analyze the data, leading to better patient care. In this paper, we introduce a new technique for extracting useful information from unstructured clinical text examination reports, which we refer to as a medical event extraction (EE) task. Our approach is based on Machine Reading Comprehension (MRC) and involves two sub-tasks: Question Answerability Judgment (QAJ) and Span Selection (SS). We use BERT to build a question answerability discriminator (Judger) that determines whether a reading comprehension question can be answered or not, thereby avoiding the extraction of arguments from unanswerable questions. The SS sub-task first obtains the encoding of each word in the medical text from the final layer of BERT's Transformer, then utilizes the attention mechanism to identify important information related to the answer from these word encodings. This information is then input into a bidirectional LSTM (BiLSTM) module to obtain a global representation of the text, which is used, along with the softmax function, to predict the span of the answer (i.e., the start and end positions of the answer in the text report). We use interpretable methods to calculate the Jensen-Shannon Divergence (JSD) score between various layers of the network and confirm that our model has strong word representation capabilities, enabling it to effectively extract contextual information from medical reports. Our experiments demonstrate that our method outperforms existing medical event extraction methods, achieving state-of-the-art results with a notable F1 score.

References

[1]
F. Elizabeth, J.A. Carroll, H.E. Smith, et al., Extracting information from the text of electronic medical records to improve case detection: a systematic review, J. Am. Med. Informatics Associat. : JAMIA 23 (5) (2016),.
[2]
F. Martin-Sanchez, K. Verspoor, et al., Big data in medicine is driving big changes, Year Med. Inform. 9 (1) (2014) 14–20. https://doi.org/10.15265/IY-2014-0020.
[3]
D.A. Hanauer, G. Miela, A.M. Chinnaiyan, et al., The registry case finding engine: an automated tool to identify cancer cases from unstructured, free-text pathology reports and clinical notes, J. Am. Coll. Surg. 205 (5) (2007) 690–697,.
[4]
E. Soysal, J.L. Warner, J.C. Denny, et al., Identifying Metastases-related Information from Pathology Reports of Lung Cancer Patients, Amia Joint Summits on Translational Science Proceedings Amia Joint Summits on Translational Science 2017 (2017) 268–277. https://europepmc.org/backend/ptpmcrender.fcgi?accid=PMC5543353&blobtype=pdf.
[5]
M. Sevenster, R.V. Ommering, Y. Qian, Automatically Correlating Clinical Findings and Body Locations in Radiology Reports Using MedLEE, J. Digit. Imaging 25 (2) (2012) 240–249,.
[6]
R.S. Crowley, M. Castine, K. Mitchell, et al., caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational research, J. Am. Med. Informatics Associat. : JAMIA (2010),.
[7]
ChatGPT: Optimizing Language Models for Dialogue, 2022, https://openai.com/blog/chatgpt/.
[8]
Steven Bethard, Guergana Savova, Martha Palmer, and James Pustejovsky. 2017. SemEval-2017 Task 12: Clinical TempEval. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pages 565–572, Vancouver, Canada. Association for Computational Linguistics. https://doi.org/10.18653/v1/S17-2093.
[9]
D. Kokkinakis, Medical event extraction using frame semantics – challenges and opportunities, Int. J. Comput. Linguistics Appl. 4 (2) (2013) 121–133. http://www.gelbukh.com/ijcla/2013-2/IJCLA-2013-2-pp-121-133-Medical.pdf.
[10]
D. Xu, M. Zhang, T. Zhao, C. Ge, W. Gao, J. Wei, et al., Data-Driven Information Extraction from Chinese Electronic Medical Records, PLoS One 10 (8) (2015) e0136270.
[11]
S. Miao, T. Xu, Y. Wu, et al., Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches, Int. J. Med. Inf. 119 (2018) 17–21,.
[12]
G. Johanna Johnsi Rani, D. Gladis, M.T. Manipadam, G. Ishitha, Breast cancer staging using Natural Language Processing. Paper presented at: Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on; 10-13 Aug. 2015, 2015. https://doi.org/10.1109/ICACCI.2015.7275834.
[13]
Pyysalo, Sampo, Ohta, Tomoko, Miwa, Makoto, et al. Event extraction across multiple levels of biological organization. Bioinformatics,2012,28(18):I575-I581. https://doi.org/10.1093/bioinformatics/bts407.
[14]
D. Zhou, D. Zhong, A semi-supervised learning framework for biomedical event extraction based on hidden topics, Artif. Intell. Med. 64 (1) (2015) 51–58,.
[15]
Anran Wang, Jian Wang, Hongfei Lin, Jianhai Zhang, Zhihao Yang, Kan Xu, Biomedical event extraction based on distributed representation and deep learning, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen, 2016, pp. 775-775. https://doi.org/10.1109/BIBM.2016.7822620.
[16]
M. Ma, L. Huang, B. Xiang, B. Zhou, Dependency-based Convolutional Neural Networks for Sentence Embedding, Meeting of the Association for Computational Linguistics and the International Joint Conference on Natural Language Processing (2015),.
[17]
Jian Liu, Yubo Chen, Kang Liu, and Jun Zhao. 2019b.Neural cross-lingual event detection with minimal parallel resources. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLPIJCNLP), pages 738–748, Hong Kong, China. Association for Computational Linguistics. https://doi.org/10.18653/v1/D19-1068.
[18]
P. Liu, X. Qiu, X. Huang, Recurrent neural network for text classification with multitask learning. arXiv preprint arXiv:1605.05101, 2016.
[19]
Thien Huu Nguyen, Kyunghyun Cho, and Ralph Grishman. 2016. Joint event extraction via recurrent neural networks. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 300–309, San Diego, California. Association for Computational Linguistics. https://doi.org/10.18653/v1/N16-1034.
[20]
T. Zhang, H. Ji, A. Sil, Joint entity and event extraction with generative adversarial imitation learning, Data Intelligence 1 (2) (2019) 99–120,.
[21]
David Wadden, Ulme Wennberg, Yi Luan, and Hannaneh Hajishirzi. 2019. Entity, relation, and event extraction with contextualized span representations. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 5784–5789, Hong Kong, China. Association for Computational Linguistics. https://doi.org/10.18653/v1/D19-1585.
[22]
Y. Chen, T. Chen, S. Ebner, D.B. Van, et al. Reading the Manual: Event Extraction as Definition Comprehension, SPNLP@EMNLP, 2020: 74-83. http://dx.doi.org/10.18653/v1/2020.spnlp-1.9.
[23]
J.S. Min, K. Aniruddha, F. Ali, H. Hannaneh, et al. Bidirectional Attention Flow for Machine Comprehension, Computing Research Repository, 2017, abs/1611.01603. https://doi.org/10.48550/arXiv.1611.01603.
[24]
Yubo Chen, Liheng Xu, Kang Liu, Daojian Zeng, and Jun Zhao. 2015. Event extraction via dynamic multipooling convolutional neural networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 167–176, Beijing, China. Association for Computational Linguistics. https://doi.org/10.3115/v1/P15-1017.
[25]
Thien Huu Nguyen and Ralph Grishman. 2015. Event detection and domain adaptation with convolutional neural networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 365–371. https://doi.org/10.3115/v1/P15-2060.
[26]
B. Tian, C. Xing, Deep learning based temporal information extraction framework on Chinese electronic health records, in: International Conference on Web Information Systems and Applications, Springer, Cham, 2018, pp. 203–214.
[27]
Du Xinya, Cardie Claire. Event Extraction by Answering (Almost) Natural Questions, Conference on Empirical Methods in Natural Language Processing, 2020, 2020.emnlp-main: 671-683. https://doi.org/10.18653/v1/2020.emnlp-main.49.
[28]
Omer Levy, Minjoon Seo, Eunsol Choi, and Luke Zettlemoyer. 2017. Zero-shot relation extraction via reading comprehension. In Proceedings of the 21st Conference on Computational Natural LanguageLearning (CoNLL 2017), pages 333–342, Vancouver, Canada. Association for Computational Linguistics. https://doi.org/10.18653/v1/K17-1034.
[29]
Xiaoya Li, Fan Yin, Zijun Sun, Xiayu Li, Arianna Yuan, Duo Chai, Mingxin Zhou, and Jiwei Li. 2019b. Entity-relation extraction as multi-turn question answering. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 1340–1350, Florence, Italy. Association for Computational Linguistics. https://doi.org/10.18653/v1/P19-1129.
[30]
Wei Wu, Fei Wang, Arianna Yuan, Fei Wu, and Jiwei Li. 2019. Coreference resolution as query-based span prediction. https://doi.org/10.48550/arXiv.1911.01746.
[31]
Fayuan Li, Weihua Peng, Yuguang Chen, Quan Wang, Lu Pan, Yajuan Lyu, and Yong Zhu. 2020. Event Extraction as Multi-turn Question Answering. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 829–838, Online. Association for Computational Linguistics. https://doi.org/10.18653/v1/2020.findings-emnlp.73.
[32]
L. Jian, C. Yubo, L. Kang, B. Wei, L. Xiaojiang, et al. Event Extraction as Machine Reading Comprehension, Conference on Empirical Methods in Natural Language Processing, 2020, 2020.emnlp-main: 1641-1651.
[33]
Y. Hongbin, Z. Ningyu, B. Zhen, D. Shumin, T. Chuanqi, C. Hui, H. Fei, C. Huajun, et al., Learning to Ask for Data-Efficient Event Argument Extraction (Student Abstract), National Conference on Artificial Intelligence (2022) 13099–13100,.
[34]
J. Liu, Y. Chen, K. Liu, et al. Event Extraction as Machine Reading Comprehension. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2020.
[35]
A. Vaswani, N. Shazeer, N. Parmar, et al., Attention is all you need, Adv. Neural Inf. Proces. Syst. 30 (2017) 5998–6008. https://doi.org/10.48550/arXiv.1706.03762.
[36]
H. Minghao, W. Furu, P. Yuxing, H. Zhen, Y. Nan, L. Dongsheng, et al. Read plus Verify: Machine Reading Comprehension with Unanswerable Questions. In: National Conference on Artificial Intelligence, 2019, abs/1808.05759: 6529-6537. https://doi.org/10.1609/aaai.v33i01.33016529.
[37]
C. Sun, Z. Yang, L. Wang, et al., Biomedical named entity recognition using BERT in the machine reading comprehension framework, J. Biomed. Inform. 118 (2021),.
[38]
Y. Cui, W. Che, T. Liu, et al. Pre-training with whole word masking for chinese bert. arXiv preprint arXiv:1906.08101, 2019. https://doi.org/10.48550/arXiv.1906.08101.
[39]
O. Vinyals, M. Fortunato, N. Jaitly, Pointer Networks. Computer Science, 2015, 28. https://doi.org/10.48550/arXiv.1506.03134.
[40]
D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473, 2014. https://doi.org/10.48550/arXiv.1409.0473.
[41]
S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural Comput. 9 (8) (1997) 1735–1780.
[42]
Evaluation of Medical Entity and Event Extraction for Chinese Electronic Medical Records Task Sheet, 2020, URL:http://sigkg.cn/ccks2020/wp-content/uploads/2020/03/3-CCKS2020.
[43]
M. Tang, P. Gandhi, M.A. Kabir, et al. Progress notes classification and keyword extraction using attention-based deep learning models with BERT. arXiv preprint arXiv:1910.05786, 2019. https://doi.org/10.48550/arXiv.1910.05786.
[44]
D. Wadden, U. Wennberg, Y. Luan, et al. Entity, relation, and event extraction with contextualized span representations. arXiv preprint arXiv:1909.03546, 2019. https://doi.org/10.18653/v1/D19-1585.
[45]
Y.C. Huang, Y.H. Huang, Y.Y. Cheng, et al., TMUDS at the NTCIR-15 DialEval-1 Task. https://research.nii.ac.jp/ntcir/workshop/OnlineProceedings15/pdf/ntcir/02-NTCIR15-DIALEVAL-HuangY.pdf.
[46]
J. Dai, C. Chen, Text classification system of academic papers based on hybrid Bert-BiGRU model, 2020 12th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), 2, IEEE, 2020, pp. 40–44,.
[47]
X. Chen, L. Ke, Z. Lu, et al., A Novel Hybrid Model for Cantonese Rumor Detection on Twitter, Appl. Sci. 10 (20) (2020) 7093,.
[48]
F. Ren, S. Yuan, F. Gao, Extraction of Transitional Relations in Healthcare Processes from Chinese Medical Text based on Deep Learning. In: Proceedings of the 2019 4th International Conference on Mathematics and Artificial Intelligence. 2019: 56-60. https://doi.org/10.1145/3325730.3325771.
[49]
S. Ramnath, P. Nema, D. Sahni, et al. Towards interpreting BERT for reading comprehension based QA. arXiv preprint arXiv:2010.08983, 2020.
[50]
M. Sundararajan, A. Taly, Q. Yan, Axiomatic attribution for deep networks. In: International Conference on Machine Learning. PMLR, 2017: 3319- 3328. https://doi.org/10.48550/arXiv.1703.01365.
[51]
P.S.H. Lewis, L. Denoyer, S. Riedel, Unsupervised Question Answering By Cloze Translation. In: Annual Meeting of the Association for Computational Linguistics, 2019, abs/1906.04980: 4896-4910.
[52]
Yu Jie, Bin Ji, Lei Liu, Shasha Li, Jun Ma, Huijun Liu, et al., Joint Extraction Method for Chinese Medical Events .Computer Science,2021,48(11):287-293. https://doi.org/10.11896/jsjkx.201200016.

Cited By

View all
  • (2024)Clinical natural language processing for secondary usesJournal of Biomedical Informatics10.1016/j.jbi.2024.104596150:COnline publication date: 25-Jun-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Journal of Biomedical Informatics
Journal of Biomedical Informatics  Volume 142, Issue C
Jun 2023
290 pages

Publisher

Elsevier Science

San Diego, CA, United States

Publication History

Published: 01 June 2023

Author Tags

  1. Event extraction
  2. Argument extraction
  3. Machine reading comprehension
  4. Clinical examination reports

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Clinical natural language processing for secondary usesJournal of Biomedical Informatics10.1016/j.jbi.2024.104596150:COnline publication date: 25-Jun-2024

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media