[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to main content

Quantifying Similarity: Text-Mining Approaches to Evaluate ChatGPT and Google Bard Content in Relation to BioMedical Literature

  • Conference paper
  • First Online:
Computational Science – ICCS 2024 (ICCS 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14836))

Included in the following conference series:

  • 371 Accesses

Abstract

The emergence of generative AI tools, empowered by Large Language Models (LLMs), has shown power in generating content. The assessment of the usefulness of such content has become an interesting research question. Using prompt engineering, we assess the similarity of such contents to real literature produced by scientists. In this exploratory analysis, we prompt-engineer ChatGPT and Google Bard to generate clinical content to be compared with medical literature, and we assess the similarities of the generated contents by comparing them with biomedical literature. Our approach is to use text-mining methods to compare documents and bigrams and to use network analysis to check the centrality. The experiments demonstrated that ChatGPT outperformed Google Bard in different similarity and term network centrality methods, but both tools achieved good results compared to the baseline.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
GBP 19.95
Price includes VAT (United Kingdom)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
GBP 99.99
Price includes VAT (United Kingdom)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
GBP 64.99
Price includes VAT (United Kingdom)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Google bard. https://bard.google.com/. Accessed 03 Aug 2023

  2. Openai chatgpt. https://chat.openai.com/. Accessed 03 Aug 2023

  3. Baumfeld Andre, E., et al.: The current landscape and emerging applications for real-world data in diagnostics and clinical decision support and its impact on regulatory decision making. Clinical Pharmacol. Therapeut. 112(6), 1172–1182 (2022)

    Article  Google Scholar 

  4. Chung, J., Kamar, E., Amershi, S.: Increasing diversity while maintaining accuracy: text data generation with large language models and human interventions, pp. 575–593. Association for Computational Linguistics (ACL) (2023). https://doi.org/10.18653/v1/2023.acl-long.34

  5. Eggmann, F., Weiger, R., Zitzmann, N.U., Blatz, M.B.: Implications of large language models such as chatGPT for dental medicine (2023). https://doi.org/10.1111/jerd.13046

  6. Gao, C.A., et al.: Comparing scientific abstracts generated by chatGPT to real abstracts with detectors and blinded human reviewers. NPJ Digit. Med. 6(1), 75 (2023)

    Article  Google Scholar 

  7. Hamed, A.A., Wu, X.: Improving detection of chatGPT-generated fake science using real publication text: introducing xfakebibs a supervised-learning network algorithm (2023)

    Google Scholar 

  8. Hamed, A.A., Zachara-Szymanska, M., Wu, X.: Safeguarding authenticity for mitigating the harms of generative AI: issues, research agenda, and policies for detection, fact-checking, and ethical AI. IScience (2024)

    Google Scholar 

  9. Kim, S.W., Gil, J.M.: Research paper classification systems based ON TF-IDF and LDA schemes. Human-Centric Comput. Inf. Sci. 9 (12 2019). https://doi.org/10.1186/s13673-019-0192-7

  10. Liao, Z., Wang, J., Shi, Z., Lu, L., Tabata, H.: Revolutionary potential of chatGPT in constructing intelligent clinical decision support systems (2023). https://doi.org/10.1007/s10439-023-03288-w

  11. Moro, A., Greco, M., Cappa, S.F.: Large languages, impossible languages and human brains. Cortex 167, 82–85 (2023). https://doi.org/10.1016/j.cortex.2023.07.003

  12. Mu, Y., et al.: Augmenting large language model translators via translation memories, pp. 10287–10299. Association for Computational Linguistics (ACL) (2023). https://doi.org/10.18653/v1/2023.findings-acl.653

  13. Shortliffe, E.H.: Role of evaluation throughout the life cycle of biomedical and health AI applications. BMJ Health Care Inform. 30(1), e100925 (2023). https://doi.org/10.1136/bmjhci-2023-100925

    Article  Google Scholar 

  14. Singhal, K., et al.: Large language models encode clinical knowledge. Nature 620, 172–180 (2023). https://doi.org/10.1038/s41586-023-06291-2

  15. Thada, V., Jaglan, V.: Comparison of Jaccard, dice, cosine similarity coefficient to find best fitness value for web retrieved documents using genetic algorithm. Int. J. Innov. Eng. Technol. 2, 202–205 (2013). http://www.dknmu.org/uploads/file/6842.pdf

  16. Thirunavukarasu, A.J., Ting, D.S.J., Elangovan, K., Gutierrez, L., Tan, T.F., Ting, D.S.W.: Large language models in medicine (2023). https://doi.org/10.1038/s41591-023-02448-8

  17. U.S. Food and Drug Administration: Framework for FDA’s real-world evidence program (Year of Publication). https://www.fda.gov/media/120060/download. Accessed 27 Oct 2023

  18. Wang, G., Shen, Y., Luan, E.: Measure of centrality based on modularity matrix. Progr. Nat. Sci. 18 (2008). https://doi.org/10.1016/j.pnsc.2008.03.015

  19. Zhang, J., Luo, Y.: Degree Centrality, Betweenness Centrality, and Closeness Centrality in Social Network. Atlantis Press (2017). https://doi.org/10.2991/msam-17.2017.68

Download references

Acknowledgements

This publication is partially supported by the European Union’s Horizon 2020 research and innovation programme under grant agreement Sano No. 857533 and carried out within the International Research Agendas programme of the Foundation for Polish Science, co-financed by the European Union under the European Regional Development Fund. Additionally is partially created as part of the Ministry of Science and Higher Education’s initiative to support the activities of Excellence Centers established in Poland under the Horizon 2020 program based on the agreement No MEiN/2023/DIR/3796.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Jakub Klimczak or Ahmed Abdeen Hamed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Klimczak, J., Abdeen Hamed, A. (2024). Quantifying Similarity: Text-Mining Approaches to Evaluate ChatGPT and Google Bard Content in Relation to BioMedical Literature. In: Franco, L., de Mulatier, C., Paszynski, M., Krzhizhanovskaya, V.V., Dongarra, J.J., Sloot, P.M.A. (eds) Computational Science – ICCS 2024. ICCS 2024. Lecture Notes in Computer Science, vol 14836. Springer, Cham. https://doi.org/10.1007/978-3-031-63775-9_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-63775-9_18

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-63774-2

  • Online ISBN: 978-3-031-63775-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics