[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
Detection of Defective Apples Using Learnable Residual Multi-Head Attention Networks Integrated with CNNs
Previous Article in Journal
Unified Generative Data Augmentation for Efficient Solar Panel Soiling Localization
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing

by
Mohammed Y. Shakor
1,2,*,† and
Mustafa Ibrahim Khaleel
2,†
1
Information Technology Department, College of Computer and Information Technology, University of Garmian, Kalar 46021, Iraq
2
Department of Computer, College of Science, University of Sulaimani, Sulaymaniyah 46001, Iraq
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Electronics 2024, 13(24), 4860; https://doi.org/10.3390/electronics13244860
Submission received: 8 October 2024 / Revised: 3 December 2024 / Accepted: 4 December 2024 / Published: 10 December 2024

Abstract

:
This comprehensive study investigates the integration of cloud computing and deep learning technologies in medical data analysis, focusing on their combined effects on healthcare delivery and patient outcomes. Through a methodical examination of implementation instances at various healthcare facilities, we investigate how well these technologies manage a variety of medical data sources, such as wearable device data, medical imaging data, and electronic health records (EHRs). Our research demonstrates significant improvements in diagnostic accuracy (15–20% average increase) and operational efficiency (60% reduction in processing time) when utilizing cloud-based deep learning systems. We found that healthcare organizations implementing phased deployment approaches achieved 90% successful integration rates, while hybrid cloud architectures improved regulatory compliance by 50%. This study also revealed critical challenges, with 35% of implementations facing data integration issues and 5% experiencing security breaches. Through empirical analysis, we propose a structured implementation framework that addresses these challenges while maintaining high performance standards. Our findings indicate that federated learning techniques retain 95% model accuracy while enhancing privacy protection, and edge computing reduces latency by 40% in real-time processing. By offering quantitative proof of the advantages and difficulties of combining deep learning and cloud computing in medical data analysis, as well as useful recommendations for healthcare organizations seeking technological transformation, this study adds to the expanding body of knowledge on healthcare digitalization.

1. Introduction

Modern medical procedures are increasingly driven by data analytics, processing massive volumes of information from diverse sources, including wearable technology, genetic analysis, and EHRs. The complexity and exponential increase in healthcare data are beyond the capacity of traditional statistical tools and require sophisticated computational techniques driven by cloud computing infrastructures and deep learning [1,2].
Deep learning (DL), an superior subfield of machine learning, has transformed the interpretation of medical data by building intricate neural networks that are capable of processing and analyzing an ample amounts of data. These advanced algorithms have demonstrated remarkable success in medical imaging applications, often matching or exceeding the diagnostic accuracy of board-certified radiologists [3,4]. The integration of DL with cloud computing has further enhanced these capabilities, enabling the real-time processing of large-scale medical data and facilitating more accessible healthcare analytics solutions.
Cloud computing platforms have become instrumental in supporting DL implementation for healthcare, providing the required resources for computational and scalability for processing extensive medical datasets. This synergy between deep learning and cloud computing has enabled sophisticated analysis of EHRs, facilitating intelligent resource allocation, personalized treatment planning, and accurate patient outcome prediction [5,6]. The cloud-based infrastructure also supports collaborative research and the seamless sharing of medical insights across healthcare institutions.
The deep learning combination and cloud computing has particularly transformed predictive health analytics, significantly impacting preventive care. These technologies excel in identifying at-risk individuals and forecasting disease outbreaks. Through unending, these systems have been crucial for learning and adaptation in managing chronic conditions, reduction of hospital readmission rates, and improvement of patient outcomes [7,8].
However, several of these advanced analytical system implementations are associated with a number of technical and ethical challenges. Many integrations across different data sources may cause compatibility and quality problems that could eventually affect the accuracy of the models. Medical data is sensitive; hence, privacy and algorithmic bias continue to be vital considerations. This itself introduces more complexity in the implementation of such systems, as strong security protocols and adherence to healthcare regulations become mandatory [9,10]. In more general terms, the transformation of the healthcare sector toward a more sophisticated data-driven paradigm requires heavy infrastructure investments and organizational changes.
Practitioners in the healthcare industry have to enhance their data literacy and understanding of these technologies. Furthermore, effective execution necessitates a strong partnership among data scientists, healthcare practitioners, and policy specialists to guarantee that technological progress is in accordance with clinical requirements and ethical principles [11].
This study investigates contemporary developments in medical data analysis by examining the role of DL and cloud computing technologies. We examine how these technologies are reshaping healthcare delivery, from diagnostic procedures to treatment planning, while addressing the technical challenges and implementation considerations. Our study focuses on cutting-edge strategies that enhance medical data analysis and healthcare outcomes by utilizing the combined strengths of DL and cloud computing.
Twelve interrelated sections make up this study, which methodically examines how deep learning and cloud computing might be used together to analyze medical data. Section 1 introduces the research context, establishes the significance of deep learning and cloud computing in healthcare, and presents this paper’s objectives. The introduction outlines the current challenges in medical data analysis and the potential of advanced computational methods to address these challenges. Section 2 presents the literature review for the most recent solutions in the DL, highlighting the critical need for advanced data analytics in healthcare and identifying the specific gaps in current healthcare data management practices that this study aims to address. Section 3 shows the recent solutions in the cloud computing and DL for medical data processing fields with comprehension analytics for each solution. Section 4 examines the fundamental features of medical big data, analyzing its seven key characteristics: variety, volume, velocity, value, veracity, variability, and visualization. Each feature is discussed in detail, with particular emphasis on its implications for healthcare applications. Three main categories are examined in depth in Section 5’s investigation of sources of medical big data: wearable technology data, medical imaging data (including data from X-ray, CT, MRI, and ultrasound modalities), and EHRs. In-depth subsections within this section examine the specific traits and difficulties of each data source. Section 6 presents a systematic categorization of big data analytics, exploring predictive, diagnostic, descriptive, and prescriptive analytics in healthcare contexts. This section establishes the theoretical framework for understanding how different analytical approaches contribute to healthcare decision making. Section 7 investigates the challenges in healthcare big data analysis, addressing data structure issues, standardization concerns, security considerations, and data storage and transfer complications. Each challenge is examined with reference to current research and potential solutions. Section 8 explores the opportunities presented by big data analysis in healthcare, focusing on improvements in data quality, care delivery, early disease detection, and decision-making processes. This section provides empirical evidence of the benefits achieved through big data implementation. Section 9 examines specific applications of healthcare analysis of medical big data, with detailed subsections on ML and DL techniques. This section includes comparative analyses of various studies and their outcomes, supported by comprehensive tables of research findings. Section 10 focuses on research implication in technical and clinical terms. Section 11 specifically focuses on cloud computing’s role in medical big data and AI processing, discussing its advantages, applications, challenges, and future directions in healthcare settings. Section 12 focuses on implementation of cloud computing in healthcare from case studies with future research direction and advances in the DL cloud based solutions. A thorough discussion of the results, practical ramifications, and suggestions for future study possibilities are included in this paper’s conclusions. Throughout this study, we consistently emphasize the combination of cloud computing and DL technologies, backed by verified research findings and empirical data.

1.1. Research Contributions

This paper provides five main contributions to the medical data field analysis:
  • A comprehensive review has been provided of recent advances (2019–November 2024) in medical data analysis through DL and cloud computing, analyzing 37 key studies to identify successful implementation patterns and challenges.
  • We present a detailed analysis of three major medical data sources: wearable devices, medical imaging (X-ray, CT, and MRI), and electronic health records, examining their unique characteristics and processing requirements.
  • We quantify the impact of integrated deep learning and cloud computing solutions on healthcare delivery, demonstrating improvements in diagnostic accuracy (15–20%) and processing speed (60% faster) and a reduction in infrastructure costs (40% reduction).
  • We identify common implementation challenges, including data integration issues (affecting 35% of implementations) and security concerns (5% breach rate) and provide evidence-based solutions for addressing these challenges.
  • We develop practical recommendations for healthcare organizations implementing these technologies, supported by empirical evidence showing that phased deployments achieve 90% success rates compared to 45% for immediate full-scale implementations.
These contributions provide healthcare organizations and researchers with actionable insights for implementing deep learning and cloud computing solutions in medical data analysis.

1.2. Motivation

This academic study examines the transformative potential of big data analytics in healthcare using extensive and varied datasets, highlighting how it may enhance treatment outcomes, individualized patient care, and service efficacy. In order to leverage big data, healthcare organizations must overcome organizational and technological challenges. This research focuses on issues such as data confidentiality, resistance to technological change, and analytical abilities. This paper offers a comprehensive evaluation of the latest technologies, applications, and cutting-edge research to provide healthcare organizations with the essential knowledge they need to successfully integrate and use big data analytics. This initiative aims to equip these institutions with the information they need to overcome implementation challenges and fully utilize big data’s potential to enhance healthcare delivery and patient outcomes.

2. Literature Selection Methodology

The research studies presented in Table 1 were systematically selected following a rigorous methodology to ensure comprehensive coverage and scientific validity. Our selection process incorporated multiple criteria and filtering stages to identify the most relevant and impactful research in healthcare big data analysis.
Our systematic review encompassed research papers published between 2019 and November 2024, with particular emphasis on recent publications (2021–2024) to capture the latest developments in medical data analysis. Through our rigorous selection process, we shortlisted 37 papers for detailed analysis, distributed across three main categories:
  • Fourteen papers focusing on foundational research in healthcare big data analysis;
  • Thirteen papers specifically examining machine learning applications in healthcare;
  • Ten papers investigating deep learning techniques in medical data processing.
This carefully curated selection represents the most impactful and methodologically sound research in the field during the specified timeframe. The temporal distribution of the selected papers shows an increasing trend toward the sophisticated combination of DL and cloud computing technologies in recent years, with 65% of the selected papers published between 2022 and 2024. Figure 1 shows the main steps of the methodology process.

2.1. Selection Criteria

Our literature search was conducted using multiple major academic databases: IEEE Xplore, PubMed, Scopus, Web of Science, and Google Scholar. This multi-database approach ensured comprehensive coverage of both the medical and technical literature relevant to our review scope.
Our temporal coverage spanned from 2019 to November 2024. While we emphasized recent works (2021–2024) to capture the latest technological developments, we included seminal papers from 2019–2020 that made fundamental contributions to the field. Papers from 2019–2020 were selected based on their citation impact (minimum 50 citations) and foundational contributions to subsequent research developments.
The technical scope of our selection focused on studies that applied novel computational approaches, especially in ML and DL applications. We filtered the research to show successful integration with existing healthcare systems and workflows while bringing novelty to the way medical data were processed, always maintaining practical applicability within clinical environments. This ensured that the selected studies represented not only theoretical but also practical implementations in real-life healthcare settings.
The most crucial inclusion criterion was the quality indicators. Every study that was included was published in a peer-reviewed journal and included comprehensive technical explanations along with explicit validation methods. We specifically sought papers that incorporated detailed performance evaluations and reproducible experimental designs to ensure the reliability and validity of their findings.

2.2. Selection Process

Our literature selection was a three-stage, systematic process calibrated to address the most relevant and rigorous research within the field. To this end, the first stage was a literature search performed in a comprehensive manner using databases with keywords such as “healthcare big data”, “medical deep learning”, and “clinical data analysis”. The screening process continued with a preliminary review of titles and abstracts, assessing relevance at a basic level and methodological soundness.
The second stage of detailed evaluation entailed a thorough full-text review of the studies that had passed through the initial screening. In this stage, we assessed the experimental design of each study, its methods of validation, and the significance of the results. This in-depth evaluation identified those studies that presented not only new findings but also strong approaches to methodology and clinically meaningful applications.
The final selection was based on papers with strong empirical evidence, clear methodological frameworks, and significant contributions to the field. Studies demonstrating real-world healthcare applications and repeatable outcomes were particularly valuable, ensuring that the articles we selected would provide significant insights in the field for both researchers and practitioners.

2.3. Selection Outcomes

After a thorough screening process, we selected articles that are considered to be at the forefront of medical field of big data analytics. The selected studies cut across many medical domains such as, but not limited to, neurology, oncology, and cardiology, in a wide range of data types, ranging from imaging to electronic health records. Our selected papers also comprise diversified areas of healthcare settings and geographical regions, hence giving an overview of the current progress across the globe in the area of medical data analysis.
The final set of selected works would show effective enhancement in leveraging DL and cloud computing technologies in the analysis of medical data. Each analytics technique or method reviewed for these studies is offering something different. Their studies have great implications for both clinical and healthcare delivery; hence, their findings are invaluable in the provision of insight into the present state and potential futures of healthcare big data analytics.
Through this systematic selection process, we have assembled a collection of studies that provides robust evidence of the current state and future directions of big data analysis in healthcare. The chosen articles, which provide insightful information for academics and practitioners operating at the nexus of healthcare and technology, have a special emphasis on real-world applications and verified results.

3. Recent Solutions

The following section critically reviews the literature on big data analytics in the health sector in terms of the kinds of data explored and the benefits that may accrue from the technology. Based on the literature reviewed, one noticeable omission within the existing body of research would appear to be how such technologies are being deployed within healthcare and resultant organizational issues being experienced in practice. The implications of this finding for the organizational dynamics and real-world implementation challenges surrounding the big data integration analytics into healthcare systems are highlighted.
Our study endeavors to address this gap by establishing the big data analytics techniques used in healthcare environments; exploring the hurdles faced by healthcare institutions during the integration of technology; and furnishing both practical guidance and theoretical insights to healthcare organizations.
Table 1 presents a comparative analysis that indicates how our work differs from previous studies. This comparison highlights our unique focus on implementation issues in the real world and organizational challenges, which have been largely overlooked in the literature thus far.
Our work bridges a significant gap by translating the theoretical potential of big data analytics into practical applications in the healthcare industry. This approach offers useful strategies to assist healthcare organizations in effectively navigating the complex landscape of big data implementation, in addition to contributing to the scholarly discourse.

Performance Analysis of Healthcare Analytics Algorithms

Based on the comprehensive review of research studies presented in Table 1, we conducted a systematic analysis of algorithm performance across various healthcare applications. Table 2 shows a consolidated summerization for the algorithmic approaches and their corresponding performance metrics.
The performance analysis reveals several key findings:
  • Accuracy Range: The algorithms demonstrated consistently high accuracy levels, ranging from 89.2% to 97.2%, with most achieving above 90% accuracy in their respective applications.
  • Sample Size Variation: Sample sizes varied significantly across the studies, from 150 MRI images to 50,000 patient records, with larger datasets generally associated with more robust validation.
  • Application Diversity: The algorithms showed versatility across various medical applications, from diagnostic imaging to patient management and disease prediction.
  • Performance Improvements: Most studies reported substantial improvements over traditional methods, with improvements ranging from 15% to 23% in various performance metrics.
  • Validation Rigor: The studies employed diverse validation approaches, including cross-validation, independent testing, and statistical significance testing, strengthening the reliability of their findings.
In addition to highlighting the necessity of thorough performance evaluation metrics and the significance of large, diverse datasets for validation, this comparative analysis shows the strong performance of contemporary ML and DL algorithms across a range of healthcare applications.

4. Big Data Features

In general, big data has seven features, as shown in Figure 2. Each feature adds value to big data and should be handled and considered when processing big data in general and medical big data in particular.
The seven features of big data are described as follows:
  • Volume: Big data involves the extremely large volume of structured and unstructured data sources, transferred across diverse systems or structures, which may come in various formats. There are challenges in collecting, storing, analyzing, and presenting the vast amounts of data that are growing exponentially. Consequently, big data requires intricate technologies and systems that allow fast access, evaluation, and use of both unstructured and structured data [26].
    The technologies available today can be used for handling large data, as they are plentiful, cheap, and even available in open-source formats. However, further advancements are needed in the field of aquatic research to create novel, efficient tools to manage the physical challenges generated by big data. The emergence of new data formats and the digitization of existing records further complicate the management of these extensive datasets. Handling and analyzing such vast volumes of data require advanced technologies and data processing techniques. For example, unstructured data (EHRs) are extracted and analyzed using Natural Language Processing (NLP), particularly for gathering critical insights into Complementary and Integrative Health (CIH) from clinical notes. This approach effectively manages the large data volumes in healthcare by addressing the challenge of processing extensive unstructured data in EHRs and facilitating the integration of alternative health practices into mainstream medical care [27].
  • Variety: In the healthcare field, the “variety” component of big data covers a wide spectrum of data formats, including structured data like prescription information and instrument readings, as well as unstructured formats such as handwritten medical notes and clinical observations. The advent of new data sources, including social media and wearable technology, has further expanded the data available for medical research and diagnosis. This diverse integration offers a holistic view of patient health data, enhancing the accuracy and consistency of health assessments and interventions. However, leveraging big data in healthcare environments demands strict adherence to guidelines, particularly in managing related challenges with data privacy and interoperability [28,29].
  • Velocity: The “velocity” component of big data analytics in the healthcare field emphasizes how quickly data must be processed to meet the demands of modern medical practice. This feature is crucial because it allows healthcare systems to conduct in-the-moment assessments required for timely decision making. Efficient high-velocity data handling enables reactive adjustments to patient situations, potentially leading to life-saving measures.
    Structured data are used for statistics and other common operations such as calculation and visualizations. At present, these data mainly include electronic health records (e.g., clinical supply chain and inpatient lists, clinical treatment notes, nursing care, and pathological information), hospital management information, medical devices, wearable devices, and medical home furnishing data. However, most medical data are unstructured. Unstructured medical data mainly consist of image data (e.g., radiology, pathology, endoscopy, and dermatology), voice data (e.g., electronic medical records), and text (e.g., electronic medical records). Despite the obvious advantages of analyzing and managing structured data, current research shows that structured data account for the majority of medical big data research findings. Therefore, the subsequent text focuses mainly on exploring the concept of variety in structured data and the research carried out on structured data [30].
    Therefore, complex algorithms that can quickly gather and process data must be included in healthcare analytical frameworks. In order to ensure that insights are generated quickly enough to significantly influence clinical choices, these frameworks are designed to handle massive amounts of data coming at various speeds. The use of these technologies is essential for enhancing the effectiveness and responsiveness of patient care in the healthcare system [31].
    As a result, the design of data analytics systems is fundamentally impacted by the velocity of big data in the healthcare field. Decision latency has a significant impact on clinical outcomes, so these systems need to be able to operate at the speeds needed to improve healthcare operations.
  • Veracity: Because it directly affects patient outcomes, the term “veracity” in big data analytics highlights how crucial accurate and dependable data are to the healthcare industry. Because healthcare data serve as the foundation for all future medical decisions and treatments, it is imperative to ensure their quality and integrity. The healthcare industry handles a wide spectrum of data, such as patient information, diagnostic data, treatment responses, and results. As such, the process of confirming and verifying these data is challenging yet essential [32].
    Managing missing, inconsistent, or noisy data is a significant obstacle to ensuring data accuracy, as it can cause inadequate treatment, incorrect diagnoses, or inefficient remedies. Strategies such as validation, standardization, and data cleaning are achieved to enhance the reliability and quality of data. Additionally, integrating data from various sources complicates data integrity assurance, as different standards and formats must be adhered to [33].
    Numerous ideas and solutions have been proposed and implemented to address these challenges. These approaches leverage machine learning and advanced algorithms to improve data processing and validation. By enhancing data consistency and accuracy, these techniques contribute to better patient care outcomes and increase the reliability of healthcare analytics.
  • Value: Big data mining is widely acknowledged as offering significant benefits for enhancing operational efficiency and medical outcomes in the healthcare sector. Healthcare professionals can proactively develop efforts to reduce hospital readmission rates and related costs by utilizing predictive algorithms. The delivery of individualized care is supported, and health management is further optimized through the integration of various data sources, which also increases treatment efficacy. Ultimately, big data analytics is crucial to enhance healthcare systems’ efficacy, efficiency, and long-term viability [27,34].
  • Variability: In the medical field, maintaining the “variability” of big data involves allowing for a diverse range of data formats and types, including structured electronic health records, images, and unstructured clinical notes. Advanced integration approaches are required for efficient data management in order to guarantee consistency across diverse data sources and allow comprehensive analytics for better patient care [35]. Because of this variety, precision medicine and individualized treatment strategies require the use of sophisticated interpretive systems in machine learning. Thus, controlling data variability is essential to improving the efficacy of prediction models and tailored healthcare solutions [36].
  • Visualization: The healthcare sector depends on effective big data visualization because it transforms complex, large-scale datasets into insights that can be used to enhance clinical decision making. Visualization tools such as charts and heat maps provide quick and accurate interpretations of enormous data arrays, improving diagnostics, patient monitoring, and resource management. Furthermore, these methods support efforts in predictive analytics and preventive medicine by assisting in the discovery of hidden patterns and trends [37]. As the volume and complexity of healthcare data increase, sophisticated visualization tools will become more and more crucial for better patient outcomes and healthcare delivery [32].

5. Big Data Sources

Wearable technology, electronic health records, are medical imaging, are some of the big data sources in the healthcare industry. Large databases like these are of utmost importance, providing insights into how to improve patient care, prevent diseases, and manage health administrations, while also posing various challenges with respect to data privacy, integration, and analysis. In the following sections, the main data sources are discussed in detail, as summarized in Figure 3.

5.1. Electronic Health Records

EHRs contain a vast quantity of data regarding the medical, health, demographic, and social aspects of patient care. This enormous data collection is the cornerstone of advanced healthcare analytics, essential for developing real-time information and support systems that are diagnostic, predictive, and preventive. The healthcare sector mainly depends on advanced computational models and continuously updated infrastructures to efficiently manage and understand this abundance of data [38].
By organizing data, identifying trends, and interpreting findings, these models help medical practitioners make well-informed decisions quickly. The accuracy and efficiency of patient care are improved when AI and ML technologies are integrated with EHR systems [39,40]. EHRs often include a variety of data, such as, but not limited to, the following:
  • Patient Demographics: Age, gender, ethnicity, and other background information.
  • Medical History: Previous treatments, diagnoses, surgeries, and family medical history.
  • Medication Information: Current and past medications, dosages, and any adverse reactions.
  • Diagnostic Test Results: Blood tests and radiology images.
  • Treatment Outcomes: Responses to treatment, recovery progress, and notes on follow-up care.
This dynamic and comprehensive data integration in EHRs not only supports day-to-day clinical decisions but also underpins larger health information systems that monitor trends and outcomes across populations, thus shaping future healthcare policies and practices [41].

5.2. Medical Imaging Data

Medical imaging has been advanced by big data analytics, greatly enhancing therapy effectiveness, diagnostic accuracy, precision, and performance. Nowadays, DL techniques enable the analysis of MRIs, CT scans, and X-rays to extract more features and details from these images. DL models, techniques, and algorithms allow for the accurate and early diagnosis of illnesses, which is essential for developing effective treatment plans and providing a futuristic vision of diseases. Nonetheless, there are several challenges in handling the enormous amounts of complex data from medical imaging. To protect patient privacy and data security, sophisticated data management and strong security measures are still required. Advanced computational techniques and innovative solutions are needed to integrate various data types for coherent analysis [42].
The ethical handling of medical imaging data is equally important. This means finding solutions to issues related to data collection, use, and sharing that adhere to moral standards and safeguard patient privacy. Ethical considerations are crucial to prevent biases in data analytics that may lead to discriminatory healthcare practices.
Medical imaging data processing appears to have a bright future. Predictive modeling techniques and personalized care are improved by combining genetic, imaging, and electronic health record data to create a more complete picture of a patient’s health [43]. Continuous advancements in AI and ML are expected to enhance the diagnostic capabilities of medical imaging.
The main imaging modalities are as follows:

5.2.1. X-Ray Radiography

One of the most widely utilized imaging modalities is X-ray radiography. It creates two-dimensional images of the interior organs of the body [44]. The characteristics of X-ray imaging are as follows:
  • Data characteristics: Grayscale images where different tissues appear as varying shades based on their density;
  • File format: Typically DICOM (Digital Imaging and Communications in Medicine);
  • Data volume: Relatively small, typically 5–10 MB per image;
  • Applications: Bone fractures, lung diseases, and dental examinations.
The simplicity and widespread availability of X-ray data make it a significant contributor to medical big data.

5.2.2. Computed Tomography (CT)

CT scans produce comprehensive cross-sectional images of the body by using several X-ray image data taken from various angles [45]. The characteristics of CT imaging are as follows:
  • Data characteristics: Series of cross-sectional images that can be reconstructed as 3D models;
  • File format: DICOM;
  • Data volume: Large, typically 100–500 MB per study;
  • Applications: Detailed imaging of organs, blood vessels, and bones.
The high resolution and 3D capabilities of CT scans generate substantial volumes of data, contributing significantly to medical big data repositories.

5.2.3. Magnetic Resonance Imaging (MRI)

MRI uses powerful magnetic fields and radio-frequency pulses to produce very detailed images of organs and tissues [46]. The characteristics of MRI imaging are as follows:
  • Data characteristics: Superior soft tissue contrast in high-resolution three-dimensional images;
  • File format: DICOM;
  • Data volume: Very large, typically 500 MB to 1 GB per study;
  • Applications: Brain imaging, musculoskeletal disorders, and cancer detection.
The multiparametric nature of MRI data, including T1, T2, diffusion-weighted, and functional imaging, contributes to its complexity and richness in the context of big data analysis.

5.2.4. Ultrasound

High-frequency sound waves are used in ultrasound imaging to provide instantaneous images of the body’s interior organs and structures [47]. The characteristics of ultrasound imaging are as follows:
  • Data characteristics: Real-time 2D or 3D grayscale images, often including Doppler data for blood flow;
  • File formats: DICOM, but also proprietary formats, depending on the manufacturer;
  • Data volume: Moderate, typically 50–200 MB per study;
  • Applications: Obstetrics, cardiology, and vascular imaging.
The real-time nature of ultrasound imaging can generate significant data volumes, especially in extended examinations or 3D/4D imaging.

5.2.5. Nuclear Medicine Imaging

This includes both Single-Photon Emission Computed Tomography (SPECT) and Positron Emission Tomography (PET). Both of these provide images of how the body works using radioactive tracers. The characteristics of nuclear medicine imaging are as follows:
  • Data characteristics: Functional images often combined with CT or MRI for anatomical reference;
  • File format: DICOM;
  • Data volume: Large, typically 200–500 MB per study;
  • Applications: Oncology, neurology, and cardiology.
The combination of functional and anatomical data in PET/CT or PET/MRI studies contributes to the complexity and volume of the generated data.

5.2.6. Digital Pathology

Digital pathology entails scanning and converting microscopic slides into digital form, resulting in high-resolution images of tissue samples [48]. The characteristics of digital pathology are as follows:
  • Data characteristics: Extremely high-resolution 2D images, often with multiple focal planes;
  • File formats: Proprietary formats, DICOM, or standard image formats like TIFF;
  • Data volume: Extremely large, often several GB per slide;
  • Applications: Cancer diagnosis, research, and education.
The high-resolution and multi-plane nature of digital pathology images results in enormous data volumes, presenting unique challenges and opportunities in the realm of medical big data.

5.2.7. Optical Coherence Tomography (OCT)

OCT is a non-invasive imaging method that utilizes light waves to capture cross-sectional images of optically scattering media, most commonly applied in the field of ophthalmology [49]. The characteristics of OCT imaging are as follows:
  • Data characteristics: High-resolution cross-sectional images, often compiled into 3D volumes;
  • File formats: Proprietary formats, increasingly adopting DICOM;
  • Data volume: Moderate to large, typically 50–200 MB per study;
  • Applications: Retinal imaging and coronary artery imaging.
The high-resolution and 3D nature of OCT data contributes to the growing volume of ophthalmic imaging data in medical databases.
In conclusion, although medical imaging data has the opportunity to entirely transform healthcare diagnosis and treatment, it also presents significant challenges and ethical concerns. These issues must be successfully resolved in order to make the best use of big data in medical imaging, enhance patient outcomes, and adhere to strict privacy and ethical norms [50]. Overcoming these challenges will enable the medical field to maximize the benefits of medical imaging data, enhancing patient care while adhering to stringent privacy and data ethics regulations.

5.3. Wearable Technology Data

Wearable technology, with its sophisticated sensor technology that enables continuous and multimodal health monitoring, has emerged as a crucial component of the Internet of Medical Things (IoMT) and has significantly impacted healthcare. With the ability to monitor a variety of physiological real time parameters, including heart rate, blood pressure, glucose level, and respiratory rate, this technological advancement has improved the chronic diseases management and may have even lowered hospital visits and healthcare costs by facilitating early detection and prompt intervention [51,52].
Wearable technologies have many advantages, but they also have drawbacks, especially when it comes to data integrity and privacy issues. The dependability of the health assessments and medical advice that follows can be impacted by variations in data accuracy. Additionally, the collection of large volumes of personal health information generates serious privacy issues, necessitating robust security protocols to protect sensitive data from breaches or unauthorized access [53].
With the emergence of AI and ML, which are expected to further enhance the analytical power of these devices, wearable technology in healthcare is expected to grow in the future. This development is expected to boost the level of customization in patient treatment and increase the accuracy of health monitoring. Further research is anticipated to produce increasingly advanced devices that fit in with patients’ daily routines and provide them with insightful health information while maintaining their comfort. This continued progress highlights how wearable technology may improve healthcare outcomes by enabling innovative, continuous monitoring and individualized care strategies.
Figure 4 presents a hierarchical architecture for a wearable device ecosystem, illustrating the data flow and functional components in a typical wearable technology implementation. This architecture can be analyzed in four distinct layers, each with specific roles and responsibilities:
  • User Layer: At the foundation of the system is the user, represented by a human icon. This layer emphasizes the human-centric nature of wearable technology, where the end user interacts directly with the wearable device.
  • Wearable Device Layer: Immediately above the user layer is the wearable device layer, represented by a smartwatch icon. This layer is responsible for three primary functions:
    Perception: Capturing raw data from the user’s environment or physiological states through various sensors.
    Control: Managing device operations and user interactions.
    Feedback: Providing immediate responses or notifications to the user.
  • Gateway Layer: The gateway layer, represented by a Personal Digital Assistant (PDA) or smartphone icon, serves as an intermediary between the wearable device and cloud services. Its functions include the following:
    Configuration: Setting up and managing wearable devices.
    Basic Analytics: Performing initial data processing and analysis.
    Connectivity: Facilitating communication between the wearable device and cloud services.
  • Cloud Layer: At the top of the architecture is the cloud layer, symbolized by a cloud icon. This layer is responsible for the following:
    Analytics: Conducting comprehensive data analysis.
    Archiving: Storing large volumes of historical data.
    Management: Overseeing the entire ecosystem, including device management and user account administration.
The architecture demonstrates a bidirectional flow of information between adjacent layers, indicated by double-headed arrows. This suggests a continuous exchange of data, commands, and feedback throughout the system.
The widespread adoption of wearable sensor technologies has transformed the field of medical data collection, allowing for continuous, real-time tracking of numerous physiological metrics. These devices generate large volumes of health-related data, greatly influencing the domain of medical big data. The main categories of wearable sensors used in this area are as follows:

5.3.1. Accelerometers and Gyroscopes

Accelerometers and gyroscopes are fundamental components in many wearable devices, primarily used for activity recognition and motion analysis [55]. These sensors enable the gathering of information related to the following:
  • Physical activity levels;
  • Gait analysis;
  • Fall detection;
  • Sleep quality assessment.
The data from sensors are useful for keeping track of patients with movement disorders, quantifying improvements over periods of rehabilitation, or simply observing trends in overall physical fitness.

5.3.2. Electrocardiography (ECG) Sensors

Wearable ECG sensors provide more comprehensive cardiac monitoring compared to PhotoPlethysmogram (PPG) sensors [56]. These devices, often in the form of chest straps or adhesive patches, can gather data related to the following:
  • Detailed heart rhythm analysis;
  • QT interval measurements;
  • ST segment changes.
The high-fidelity data from ECG sensors are particularly useful for detecting and monitoring various cardiac conditions, including myocardial infarction and atrial fibrillation.

5.3.3. Electroencephalography (EEG) Sensors

Wearable EEG sensors, typically in the form of headbands or caps, measure electrical activity in the brain [57]. These sensors are employed for the following:
  • Sleep stage analysis;
  • Cognitive performance assessment;
  • Seizure detection in epilepsy patients.
The data acquired from EEG sensors provide insights into neurological conditions and cognitive states, opening new avenues for personalized mental health interventions.

5.3.4. Continuous Glucose Monitoring (CGM) Sensors

CGM sensors, commonly used for individuals with diabetes, provide real-time glucose level data through minimally invasive subcutaneous sensors [58]. These devices enable the following:
  • Continuous glucose-level tracking;
  • Hypoglycemia and hyperglycemia alerts;
  • Trend analysis for glucose fluctuations.
The wealth of data from CGM sensors allows for more precise diabetes management and can contribute to the development of closed-loop insulin delivery systems.

5.3.5. Temperature Sensors

Wearable temperature sensors are usually integrated into smartwatches or dedicated patches that continuously monitor body temperature [59]. Such sensors are important for the following:
  • Early fever detection;
  • Menstrual cycle tracking;
  • Thermoregulation assessment in athletes.
The data from these sensors are particularly useful for infectious disease surveillance and personalized health monitoring.
In conclusion, as shown in Figure 5, which summarizes the medical big data sources, the diverse array of wearable sensor devices enables the collection of a wide spectrum of physiological data. When aggregated and analyzed at scale, these data contribute significantly to medical big data, offering unprecedented insights into individual and population health trends. The continuous nature of data collection from these devices presents both opportunities and challenges in data management, privacy, and interpretation, paving the way for future advancements in personalized medicine and public health strategies.

6. Categorization of Big Data Analytics

Big data analytics is a game changer in the healthcare industry, turning massive volumes of data into useful insights that significantly improve decision making. This transformation is achieved through a variety of analytical techniques, each designed to meet certain demands in the healthcare industry.
Using historical data to show patterns and results, descriptive analytics lays the groundwork for operational management in healthcare institutions by offering clear knowledge of prior behaviors. Advancing from here, diagnostic analytics goes further, exploring the fundamental reasons behind previous occurrences, allowing medical practitioners to identify variables affecting patient outcomes and treatment effectiveness.
Predictive analytics, which further broadens the analytical scope, uses advanced statistical models and forecasting techniques to predict future occurrences, enabling proactive patient care and resource management strategies. Prescriptive analytics enhances this predictive ability by suggesting specific measures to attain the best outcomes and is essential for treatment planning and operational improvements.
For urgent applications like monitoring patient vitals and conditions, where timely information may be lifesaving, real-time analytics, which processes and analyzes data as it is generated, proves important [60,61]. In addition to improving diagnostic precision and individualizing patient care, the synergistic integration of various analytical techniques maximizes operational efficiency in healthcare systems.
Studies like [62] demonstrate the revolutionary power of these analytics by clarifying the function of prescriptive analytics in precision medicine and emphasizing how successful it is in improving healthcare decision making. Additionally, [63]’s thorough analysis examines the uses, challenges, and potential of deep learning in clinical big data, pointing to a bright future for analytics in healthcare.
With data-driven insights paving the way for better patient outcomes and more effective delivery of healthcare, these developments signal a paradigm shift toward a more resilient and adaptable healthcare system [64]. Beyond transforming current practices, the convergence of big data analytics and healthcare sets the stage for future developments that promise a more effective, efficient, and patient-focused methodology in medicine.
Figure 6 summarizes the hierarchical breakdown of big data analytics into four main categories:
  • Descriptive analytics;
  • Diagnostic analytics;
  • Predictive analytics;
  • Prescriptive analytics.
Each category is further divided into sub-categories and associated with key questions or concepts:
  • Descriptive analytics answers “What happened?” and involves the following:
    • Historical data analysis;
    • Reporting and data aggregation.
  • Diagnostic analytics addresses “Why did it happen?” and includes the following:
    • Root-cause analysis;
    • Data discovery and correlation finding.
  • Predictive analytics focuses on “What will happen?” through the following:
    • Forecasting;
    • Statistical modeling and machine learning.
  • Prescriptive analytics deals with “How can we make it happen?” via the following:
    • Optimization;
    • Simulation and AI-driven decision making.
This structure illustrates the progression from understanding past events to predicting future outcomes and optimizing decisions in the field of big data analytics.

7. Challenges of Big Data Analytics in Healthcare

Big data integration in healthcare is fraught with a complicated array of issues that make the successful implementation of the technology more challenging, despite its transformative potential [65]. These are wide-ranging problems that require thorough consideration and original solutions. They range from difficult technological problems to moral quandaries.
Variations in the composition and quality of healthcare data are the main causes of these problems. Many data sources complicate the environment, making standard analysis and easy interoperability challenging [36]. These sources include imaging tests, wearable technology, genetic sequencing, and electronic health records. This variability poses serious obstacles to the broad adoption of standardized analytical approaches across various healthcare organizations and systems, in addition to complicating data integration.
Big data analytics presents additional hurdles in the healthcare industry due to privacy and ethical considerations. Because medical information deals with sensitive data, patient privacy must be protected by strict security procedures [66]. Different legislative frameworks impose a complicated web of rules on medical practitioners and researchers, adding to the complexity. Organizations still find it difficult to strike a balance between the ethical need to protect individual privacy and the desire for data-driven insights.
A robust computer infrastructure is also necessary for the healthcare industry because of the amount and speed of data [67]. Sophisticated algorithms and cutting-edge technology are needed to process and analyze massive amounts of information in real time or near real time [68]. Healthcare organizations must bear the financial burden of this technical necessity, which calls for specific knowledge of data science and healthcare informatics. A multidisciplinary strategy combining knowledge from disciplines like computer science, ethics, law, and medicine is required to address these problems. In order to fully utilize big data analytics, solutions that adhere to legal, ethical, and medical norms must be developed through cross-disciplinary collaboration.
Ultimately, big data has the power to fundamentally alter the healthcare industry, but doing so will require negotiating a challenging and intricate network of obstacles [69]. The healthcare industry can encourage the more effective, efficient, and morally acceptable integration of big data analytics, which will eventually enhance patient outcomes and healthcare delivery by recognizing and methodically tackling these issues.

7.1. Data Structure Issues

Numerous obstacles exist when utilizing big data in the healthcare sector. Much of the existing unstructured healthcare data come from test results, clinical notes, EHRs, and other sources. These data are frequently scattered and fragmented due to formats that are inconsistent and incompatible across systems [70]. The structural complexity of healthcare organizations makes it extremely difficult for them to efficiently aggregate and analyze data, which limits the utility of the data across different departments or locations within the same institution.
Additionally, the research data in the healthcare industry are far more diverse than those in other industries. The data are composed of massive volumes of information generated by ongoing clinical research, patient care programs, and public health surveillance [71]. The shift from a fee-for-service to a value-based care model results in variations in the techniques employed for data collection and processing, contributing to the complexity of the surrounding environment. It focuses on effective treatment outcomes and treatment costs, hence the need for innovative ways of managing and analyzing data to support value-based care.
Another subject receiving much discussion relates to metadata sensitivity. Researchers and practitioners may find it challenging to trust and make effective use of the data if there is a lack of readily available, transparent information that casts doubt on the provenance and quality of the data [72]. To maintain healthcare information relevant and at a high quality, comprehensive, the metadata should be easily accessible.
Notwithstanding their magnitude, these problems provide opportunities for advancement and creativity. For example, improving system interoperability and standardized data formats may significantly enhance data integration and utility [73]. Better analysis and decision making are now feasible by converting unstructured data into structured representations, owing to recent developments in machine learning and NLP. Furthermore, encouraging an open culture and comprehensive metadata documentation helps increase trust.
To address these problems, a concerted effort involving several stakeholders, including politicians, academics, healthcare providers, and technology developers, will be required. By working together, the healthcare sector can overcome these obstacles and fully realize the potential of big data to elevate healthcare delivery beside with patient outcomes.

7.2. Data Standardization Issues

Concerns about data standards have hindered data transmission between healthcare organizations, even though they remain a significant barrier to the implementation of EHRs. The widespread use of incompatible data formats compromises the interoperability of EHR systems and the effectiveness of healthcare technologies. The processes of gathering, moving, and cleaning data become more challenging as a result of fragmentation. Different standards, languages, and terminologies are additional problems resulting from the globalization of healthcare data, making it difficult to harmonize data for analysis and global sharing [74,75]. These concerns regarding standardization significantly impede the efficient integration and utilization of healthcare data.

7.3. Security Issues

Significant security concerns are associated with big data integration in healthcare, particularly when attempting to adhere to the stringent regulations outlined by the Health Insurance Portability and Accountability Act (HIPAA) [76]. Big data technologies sometimes do not automatically address privacy and security concerns; thus, these regulations enforce strict guidelines to ensure that these concerns are addressed. While research and education may benefit greatly from open-source platforms, their accessibility poses unique risks that could result in breaches exposing private health information. Furthermore, the centralized storage of health data increases the risk of attacks because a single breach might disclose significant quantities of patient data [77].
Having robust security measures in place is essential for preserving patient confidence, avoiding major financial and reputational losses, and adhering to legal requirements. This therefore calls for the utilization of advanced security measures that include strict access controls, advanced encryption methods, and continuous monitoring of data access behavior [78]. The aforementioned measures might assist in reducing risks associated with gathering and storing large volumes of health data. Advanced encryption techniques will be required to secure data in transit and at rest against unauthorized access or meaning. Stringent access controls governed and enforce who has access to what data at what time, reducing the chances of any internal security breaches.
This protection could be further extended by monitoring the pattern of data access continuously, assisting in the identification and blocking of unusual attempts at access in real time. Healthcare professionals should ensure that the organization promotes a security-conscious environment, supported by training for all staff members regarding data management and security policies.
It not only includes scheduled training programs but also expertise development on newly identified security loopholes and their mitigation measures. Big data has great potential to improve health outcomes of patients and to further optimize the care services being provided. At the same time, it also introduces considerable security risks. These can be overcome only through an integrated approach using advanced technological innovations, strict adherence to regulatory norms, and an unrelenting commitment to ensuring higher levels of data security and confidentiality. Only then can large datasets help the healthcare organizations do their jobs, while there is adequate protection for patient privacy.

7.3.1. Regulatory Compliance and Technical Requirements

The technical controls should be set within multiple regulatory frameworks: Role-Based Access Control (RBAC) implementation is obligatory according to HIPAA. Even so, HIPAA sets the standards regarding encryption: AES-256 for data at rest and Transport Layer Security (TLS) 1.3 for transmission. Compliance with the General Data Protection Regulation (GDPR) has brought additional burdens, such as data minimization and explicit consent management. In addition, international operations become more complex given the number of regional regulations, including the Personal Information Protection and Electronic Documents Act (PIPEDA), the Personal Data Protection Act (PDPA) in Singapore, and the My Health Records Act in Australia.
Technical implementations must include comprehensive security measures. Multi-factor authentication has demonstrated a 99.9% improvement in security over single-factor systems. Network segmentation through dedicated healthcare Virtual Local Area Networks (VLANs), next-generation firewalls, and intrusion detection systems provides layered protection. Regular security audits, conducted quarterly, ensure ongoing compliance and system integrity.

7.3.2. Privacy-Preserving Data Processing

Modern healthcare analytics employs advanced techniques to maintain privacy during data processing. Federated learning enables collaborative model training while keeping sensitive data within original institutions, achieving 95% centralized accuracy levels. Data anonymization implements k-anonymity (k ≥ 5) for structured data and specialized de-identification for medical imaging, balancing utility with privacy. Homomorphic encryption allows the secure processing of encrypted data, particularly valuable in cloud environments. Key management systems incorporate quarterly rotation policies and secure distribution mechanisms, ensuring that data remain protected throughout their lifecycle.

7.3.3. Implementation and Incident Management

Real-world implementations demonstrate the effectiveness of comprehensive security approaches. A major hospital network securing 500,000 patient records achieved zero breaches over 24 months while maintaining 99.99% system availability. A research collaboration among 12 institutions successfully implemented secure data-sharing frameworks while satisfying multi-jurisdictional requirements. Incident response protocols specify strict timelines: 15-min maximum detection and 1-h containment for security incidents. Business continuity measures include real-time backup systems with geographical redundancy, achieving 99.999% data recovery reliability. Regular disaster recovery testing ensures system resilience.

7.3.4. Future Security Considerations

Emerging technologies present both opportunities and challenges for healthcare data security. Quantum computing necessitates research into resistant encryption methods, while AI and machine learning introduce new security paradigms. Future frameworks must balance enhanced protection with system usability, particularly in time-critical healthcare environments.

7.4. Data Storing and Transferring

In the healthcare sector, producing data is much less expensive than storing and sharing it. The production of healthcare data necessitates its continual and safe maintenance, and efficient data transport and analysis drive up expenses. Because it is structured, processing structured data is more efficient because it makes storing, searching, and analyzing data easier. However, unstructured data, which make up a significant portion of healthcare data, face significant challenges since they lack a predefined model. This complexity necessitates the use of increasingly sophisticated and costly management techniques in order to ensure effective storage and retrieval [79].
Using cloud-based health IT increases expenses and adds to the complexity of the situation. It is crucial to guarantee data integrity and security throughout the extraction, transformation, and loading processes due to the sensitivity of health information. Scalability and accessibility are offered by cloud systems; however, these advantages come with higher security concerns that must be managed properly [36]. Healthcare organizations must therefore develop comprehensive data management strategies that optimize cost-effectiveness while upholding the stringent security measures necessary to safeguard patient information in an increasingly data-centric healthcare environment.

8. Opportunities for Big Data Analytics in Healthcare

Big data analytics shows numerous opportunities to enhance patient care and boost operational efficiency within the healthcare industry. Big data encompasses deep learning, among other technologies that are important in the enhancement of service provision and cost reduction. They achieve this by automating some tasks, hence increasing speed and accuracy in disease diagnosis. This section shows different uses of big data in health and shows how these technological advances can actually convert to better and more personalized care for patients.

8.1. Data Quality, Structure, and Accessibility

Big data technology finally made it possible to capture the raw and unorganized data and structure it into coherent, usable information with great efficiency, hence revolutionizing health care data management. Such a capability is crucial for uncovering new knowledge and improving efficiency in reusing data within a treatment context. Besides, open-source technology greatly aids in this aspect. Improving data accessibility and comparability gives something so fundamental to research and innovation that it might be conditional upon it alone [80].
Large-scale data analytics are also required to guarantee high-quality data. By helping to filter out unnecessary information from massive volumes of data, these technologies ensure that academics and medical professionals only receive the finest and most relevant information. The efficiency of healthcare operations is greatly increased by this improved data, which are simpler to store, retrieve, and evaluate [81].
When big data is used in healthcare, there are several opportunities to improve data organization, quality, and accessibility. By using these technologies to make better-informed decisions, healthcare organizations can enhance patient outcomes and expedite the delivery of treatment [82].

8.2. Improvements in Quality of Care

Big data in healthcare is critical for raising the standard and efficacy of care. Predictive analytics is made feasible by big data through the use of primary and historical data. By providing evidence-based suggestions that can elevate the standard of care, predictive analytics has the potential to revolutionize treatment outcomes. These predictive algorithms allow for proactive adjustments to treatment plans by anticipating patient outcomes. Furthermore, by increasing patient participation, particularly through technologically enhanced medication adherence, big data can have a significant impact on health-related quality of life and patient outcomes. Furthermore, it can reduce information waste and streamline data administration to reduce costs and optimize resource use, ultimately increasing operational effectiveness [83].
Furthermore, big data analytics reduces hospital readmission rates by identifying and addressing the underlying reasons for readmissions. Additionally, improving the accuracy and efficacy of operational management can enhance the overall performance of healthcare. Big data can fundamentally raise the bar for patient care by delivering a more predictive, efficient, and tailored healthcare system through the application of advanced analytics [84].

8.3. Early Detection of Diseases

The early identification of diseases can be substantially improved by big data analytics, transforming the healthcare industry. This facilitates the prompt identification of conditions requiring emergency medical attention, improving therapeutic approaches and boosting patient survival rates.
Timely intervention can dramatically affect the course of therapeutic response and prognosis, so early diagnosis is becoming the cornerstone for the effective management of critical health problems and age-related diseases [85]. Besides that, detailed large databases are needed for the personalized diagnosis and treatment of different kinds of diseases. The onset of big data empowers health experts to efficiently provide personalized prophylaxis tailored to the specific profiles of each patient and recognize susceptibility based on the analysis of a wide array of health data, from genetic to lifestyle parameters. Early behavior changes and intake of suitably formulated pharmacological agents will prevent serious consequences of this personalized approach, crucial for chronic disease prevention, including cardiovascular disease [86,87].
However, big data analytics will also be important to empowering individuals to manage their health with unparalleled effectiveness. Big data analytics will enable patients to be more actively engaged in managing their health through advanced technologies for the collection and analysis of personal health data. This is especially useful in treating long-lasting diseases that require chronic treatment and changes in lifestyle. This shift in mindset is a very important one. The use of big data provides the basis for this patient-centered approach, enhancing individual health outcomes and fostering more collaborative relationships between patients and care professionals. Here, big data analytics empowers the patient with relevant insights derived from their personal health data, allowing them to make informed decisions and contribute to active health management. Big data analytics collaborates with care providers on keeping tabs regarding the health status of the population.
Such aggregation of data about different cohorts of patients may allow health care organizations to optimize resources, identify health trends, and even design specific public health programs.
In conclusion, big data analytics will usher in a new era in precision medicine and prevention in the health sector. The next major revolution in healthcare is big data analytics, which holds the potential to shift the focus of healthcare from reactive care to proactive health management by analyzing huge datasets in order to provide personalized insights and early warnings. This has the potential to increase the general effectiveness and efficiency of international healthcare systems, in addition to improving individual patient outcomes.

8.4. Improve Decision Making

Big data is crucial for enhancing healthcare decision-making processes. When advanced data analytics is integrated with evidence-based medicine, healthcare workers are better able to make informed decisions. This integration greatly enhances the level of care delivered to patients by facilitating more personalized and efficient treatment plans [88].
Applications such as genetic analytics, patient profile analytics, and remote monitoring show how big data affects decision making. Remote monitoring, when employed outside of typical clinical settings, provides real-time data that can prompt rapid therapies and allow for continuous patient observation. The comprehensive insights that patient profile analytics offers into individual health patterns support personalized treatment methods. Precision medicine, in which genetic data inform the creation of customized treatments, benefits from the application of genomic analytics.
The optimization of decision making is further enhanced by the availability of accurate and up-to-date information. Big data facilitates the swift assimilation and analysis of novel research findings, treatment modalities, and health advice. When this ability not only complements but, in certain situations, even replaces human decision making, healthcare delivery can be expedited and made more efficient [89].
By prioritizing patient-focused, data-driven, and efficient approaches, the incorporation of big data into healthcare decision-making processes has the potential to revolutionize the delivery of medical treatment.

9. Applications of Big Data Analytics in Healthcare

The integration of big data analytics in healthcare has brought about a complete transformation of the industry with the advent of advanced data-driven approaches for medical diagnosis and patient care. The dual effects of deep learning and machine learning technologies on big data in healthcare are highlighted in this section, along with an examination of a number of notable studies that employ these technologies to predict illness, improve treatment, and diagnose individuals for a variety of ailments. These techniques, which greatly enhance clinical procedures and medical research, provide previously undiscovered insights into patient health and treatment outcomes by utilizing complex datasets.

9.1. Machine Learning Techniques in Big Data for Healthcare

What has happened, alongside the impressive development of big data for industry applications, has been the development of machine learning techniques. Most recent research works employ such novel approaches to extract, with disruptive effects on healthcare, meaningful knowledge from vast and complex datasets. The next section reviews in detail the applications of machine learning in big data for healthcare. It discusses several research works focused on solving different problems at different levels within the medical field using traditional machine learning approaches.
Table 3 presents a systematic and detailed overview of these studies, meticulously cataloging the specific algorithms employed, the sources of data utilized, the results achieved, and the particular diseases or medical conditions targeted. This comprehensive compilation serves multiple purposes:
  • It provides a thorough review of the current status of machine learning applications in healthcare, demonstrating the scope and depth of ML’s impact on many medical specialties.
  • The graph emphasizes the importance of machine learning in advancing discoveries that could significantly change medical practices in the future.
  • By placing research side by side, it becomes simpler to compare how well various machine learning techniques address certain healthcare issues. This offers practitioners and scholars alike useful information.
  • The orderly arrangement of the data sources highlights the necessity of data integration in healthcare analytics and highlights the spectrum of medical information being employed, from genetic data to electronic health records.
  • The results column provides a quantitative and qualitative assessment of the outcomes, enabling readers to gauge the practical impact and potential of each ML application in real-world healthcare scenarios.
The present review is supposed to be a starting point for further research and development works while cataloging recent machine learning applications developments to healthcare, all in one go. The chapter describes the achievements and challenges, and some possible ways of machine learning approaches in big data contexts for healthcare; hence, this forms a groundwork for that would lead to future discoveries in changing medical procedures, improving patient care, and, finally, improvement of health outcomes globally.
The resulting overview of recent findings offers a crucial foundation for comprehending machine learning’s disruptive potential in the rapidly evolving healthcare industry. It encourages further research and application of these powerful methods in the quest for more customized, successful, and economical medical care.

9.2. Performance Analysis of Machine Learning Applications in Healthcare

Drawing from the systematic analysis presented in Table 3, we have synthesized the performance metrics of various DL and ML applications across different healthcare domains. Table 4 provides a consolidated view of algorithm performance and validation metrics.
The analysis in these performance metrics indicates a number of interesting trends:
  • Algorithm Effectiveness:
    • Highest accuracy achieved: 99.33% (Support Vector Machine (SVM) with Feature Selection);
    • Highest Area Under the Curve (AUC): 0.979 (Hybrid Convolutional Neural Network (CNN)–Deep Neural Network (DNN));
    • Most robust sensitivity: 97.2% (Gradient-Boosted Trees).
  • Dataset Characteristics:
    • Largest dataset: 76,826 cases (Post-Stroke Depression);
    • Smallest dataset: 400 records (Chronic Kidney Disease);
    • Median dataset size: 2374 cases.
  • Validation Approaches:
    • Cross-validation techniques: 23% of studies;
    • External validation: 38% of studies;
    • Independent cohort testing: 31% of studies;
    • Other specialized validation: 8% of studies.
  • Algorithm Preferences:
    • Ensemble methods: 46% of studies;
    • Hybrid architectures: 23% of studies;
    • Single-algorithm implementations: 31% of studies.
This current review delineates the various applications of machine learning methods in the healthcare domain, where ensemble and hybrid techniques are particularly producing very good results. The differences in dimensionality of datasets used and techniques for validation further stress the use of strong validation strategies in healthcare settings, while the overall high performance metrics from the wide range of disease categories studied suggest the wide applicability of machine learning in medical diagnosis and prognosis.

9.3. Deep Learning Techniques in Big Data for Healthcare

Deep learning is transforming the medical field through the implementation of advanced analytical instruments that examine complex, large-scale data and reveal patterns that traditional analytical methods might overlook. This chapter examines key deep learning methodologies and expanded healthcare data systems, with particular attention to their potential for enhancing diagnostic accuracy, improving predictive capabilities, and fundamentally redesigning patient care delivery models.
Table 5 presents a curated selection of research studies that demonstrate the application of DL techniques to various healthcare challenges. The table illustrates the sophisticated knowledge base and practical applications of the advanced computational methodologies discussed in this article. Each entry contains a summary of the key research findings, including the medical condition investigated, the data sources used, the deep learning algorithms applied, and the conclusions drawn. This comprehensive compilation provides an overview of recent developments and how they have reshaped this discipline.

10. Research Implications

Our comprehensive review of 37 papers from 2019 to 2024 reveals several significant implications for research and practice in healthcare data analytics. These implications span technical, clinical, and organizational domains, offering important directions for future research and implementation.

10.1. Technical Implications

  • Algorithm Development: The superior performance of ensemble and hybrid approaches (46% of studies) over single-algorithm implementations (31%) suggests future research should focus on developing more sophisticated model combinations. Studies showed that ensemble methods consistently achieved 3–5% higher accuracy across different medical applications.
  • Validation Methodologies: The variation in validation approaches (38% external validation, 31% independent cohort testing, 23% cross-validation) indicates a need for standardized validation protocols in healthcare AI. Studies with external validation demonstrated more robust generalization capabilities, with an average of 7% better performance on unseen data.
  • Dataset Requirements: The wide range in dataset sizes (400 to 76,826 cases) suggests the need for research into minimum dataset requirements for different medical applications. Notably, studies with datasets exceeding 10,000 cases showed 12% higher reliability in their results.

10.2. Clinical Implications

  • Diagnostic Accuracy: The demonstrated improvement in diagnostic accuracy (15–20% average increase) has significant implications for clinical practice, particularly in resource-constrained settings. This suggests the potential for reducing diagnostic errors and improving early detection rates.
  • Real-Time Processing: The 60% reduction in processing time enables real-time clinical decision support, which is particularly crucial in emergency medicine and critical care settings. Studies showed that this could reduce critical decision-making time by an average of 45%.
  • Treatment Planning: The high accuracy rates in disease progression prediction (AUC ranging from 0.82 to 0.979) suggest the potential for more precise treatment planning and better patient outcome predictions.

11. Cloud Computing for Big Medical Data and DL Processing

The rise of cloud computing marks a new era in managing medical big data, especially in the areas of ML and DL. Cloud platforms are expected to offer great advantages to the healthcare industry with their scalable infrastructures and on-demand resources, which are essential for handling the large and complex datasets found in healthcare. This section examines cloud computing, medical big data, and DL, focusing on their advantages, applications, and the challenges faced in this fast-evolving field.

11.1. Advantages of Cloud Computing in Medical DL

Cloud computing brings some important benefits to the processing of medical big data and AI applications:
  • Scalability: Cloud platforms provide the best scalability, which allows researchers and healthcare providers to scale up or down computing resources as needed [114]. Such flexibility is particularly important in the analysis of medical data and AI model training, where there is considerable variability in workloads.
  • Cost Efficiency: The pay-per-use model of cloud computing removes the necessity for significant initial investments in hardware [115]. This democratizes access to high-performance computing, enabling smaller institutions to conduct advanced research and analysis.
  • Accessibility: Cloud-based solutions provide ubiquitous access to data and computational resources [116]. This feature facilitates remote work and collaboration, which has become increasingly important in the global research landscape.
  • Advanced Technologies: Leading cloud providers offer cutting-edge hardware, including GPUs and Tensor Processing Unit (TPU) clusters, optimized for ML and DL workloads. They also provide managed services for popular AI frameworks, simplifying the deployment and management of complex AI pipelines [117].
  • Data Integration: Cloud platforms excel at integrating diverse data sources, a critical capability in healthcare where data often reside in silos. This integration potential can lead to more comprehensive and insightful analyses [118].

11.2. Applications in Healthcare

The convergence of cloud computing and AI has enabled numerous applications in healthcare:
  • Medical Imaging Analysis: Cloud-based AI models can process vast amounts of medical imaging data, assisting in the detection and diagnosis of conditions ranging from fractures to complex cancers. The ability to train on diverse datasets from multiple institutions can lead to more robust and generalizable models [119].
  • Genomic Sequencing and Analysis: The computational demands of genomic sequencing and analysis are well suited to cloud environments [120]. Cloud-based genomic pipelines can process data from thousands of patients, enabling large-scale studies and personalized treatment initiatives.
  • Drug Discovery: Cloud computing speeds up the drug discovery process by facilitating high-throughput virtual screening, conducting molecular dynamics simulations, and utilizing predictive modeling for drug–target interactions [121].
  • EHR Mining: Cloud-based AI can analyze vast EHR databases to identify patterns, predict disease progression, and recommend personalized treatment plans. This application has the potential to revolutionize clinical decision support systems [122].
  • Epidemiological Modeling: The COVID-19 pandemic highlighted the importance of rapid, large-scale epidemiological modeling. Cloud computing provides the necessary computational power for complex simulations that can inform public health policies [123].

11.3. Challenges and Considerations

Despite its potential, leveraging cloud computing for managing medical big data and AI processing comes with its own set of challenges:
  • Data Privacy and Security: Healthcare information is extremely sensitive and governed by stringent regulations. Protecting patient data in the cloud is crucial and necessitates strong encryption, access restrictions, and adherence to compliance protocols [124].
  • Regulatory Compliance: Healthcare providers need to navigate intricate regulatory frameworks, such as HIPAA in the U.S. and the GDPR in Europe. Cloud solutions should be developed with these regulations in mind to guarantee adherence.
  • Interoperability: Integrating cloud-based systems with existing healthcare IT infrastructures can be challenging. Ensuring seamless data flow between on-premises and cloud systems is crucial for the adoption of cloud-based AI solutions [125].
  • Data Quality and Bias: The effectiveness of AI models is largely determined by the quality of the training data used. Ensuring data quality and addressing potential biases in medical datasets are ongoing challenges that require careful consideration.
  • Skill Gap: The effective use of cloud-based AI in healthcare requires a unique blend of medical domain knowledge, data science expertise, and cloud computing skills.

11.4. Recent Advancements in Healthcare Cloud Computing and Deep Learning

The integration of cloud computing and deep learning technologies in healthcare has experienced significant evolution in recent years, particularly in the areas of distributed computing architectures, privacy-preserving techniques, and specialized healthcare AI models. This section examines the latest developments that have emerged since our initial analysis, incorporating recent research findings and technological innovations. The latest developments are summarized as follows:
  • Advanced Distributed Computing Architectures Recent developments in distributed computing architectures have fundamentally transformed healthcare data processing capabilities. The emergence of containerized microservices architectures has enabled more flexible and scalable deployment of healthcare applications, with studies demonstrating a 40% improvement in resource utilization compared to traditional monolithic systems. Healthcare institutions implementing these advanced architectures have reported significant reductions in data processing latency, with real-time analysis capabilities improving from minutes to seconds for critical diagnostic applications.
    Modern edge computing implementations have introduced novel approaches to data processing in healthcare environments. Recent research has demonstrated the effectiveness of adaptive edge computing frameworks that automatically optimize the distribution of computational workloads between edge devices and cloud infrastructure based on real-time network conditions and data sensitivity requirements [126]. These systems have shown particular promise in remote patient monitoring applications, achieving a 65% reduction in data transmission overhead while maintaining diagnostic accuracy above 95%.
  • Privacy-Preserving Deep Learning This field has witnessed substantial advancement in healthcare applications. Recent implementations of homomorphic encryption in medical image analysis have demonstrated the feasibility of performing complex computations on encrypted data without compromising accuracy [127]. Studies have shown that these encrypted processing systems maintain accuracy levels within 2% of their unencrypted counterparts while providing mathematical guarantees of data privacy.
    Federated learning techniques have evolved to address specific challenges in healthcare settings. Novel approaches to federated model aggregation have emerged, incorporating differential privacy guarantees while maintaining model performance. Recent studies have demonstrated successful implementations across networks of up to 50 healthcare institutions, achieving model accuracy comparable to centralized training while ensuring HIPAA compliance and reducing data exposure risks by over 90%.
  • Specialized Healthcare AI Models The development of healthcare-specific AI architectures has accelerated, with new models designed explicitly for medical applications. These specialized architectures incorporate domain knowledge and medical expertise directly into their designs, resulting in more efficient and accurate diagnostic systems [128]. Recent innovations include attention mechanisms specifically optimized for medical imaging, achieving a 25% improvement in diagnostic accuracy compared to general-purpose computer vision models.
    Multi-modal learning architectures have emerged as particularly powerful tools in healthcare applications. These systems can simultaneously process diverse data types, including imaging data, electronic health records, and genetic information, providing more comprehensive diagnostic insights. Studies have demonstrated that these integrated approaches achieve diagnostic accuracy improvements of up to 30% compared to single-modality systems.
  • Cloud-Native Healthcare Applications The adoption of cloud-native architectures in healthcare applications has led to significant improvements in system reliability and scalability [129]. Modern healthcare cloud applications leverage advanced service mesh architectures and automated scaling capabilities, resulting in 99.99% system availability while maintaining compliance with healthcare regulations. These systems have demonstrated the ability to handle varying workloads efficiently, with automatic resource allocation reducing operational costs by up to 45% compared to traditional infrastructure.
    Research has shown that cloud-native healthcare applications equipped with advanced monitoring and logging capabilities can detect and respond to potential security threats in real time, with incident response times averaging under 30 s. These systems incorporate automated compliance checking and audit trail generation, significantly reducing the administrative burden of regulatory compliance while enhancing security posture.
  • Quantum-Ready Healthcare Infrastructure Forward-looking healthcare institutions have begun preparing for the integration of quantum computing capabilities. Recent research has focused on developing quantum-resistant security protocols and hybrid classical–quantum algorithms optimized for medical applications [130]. These developments ensure that current investments in healthcare infrastructure remain viable as quantum computing technology matures while positioning organizations to leverage quantum advantages in specific computational domains.
    Studies have demonstrated promising results in quantum-inspired algorithms for drug discovery and molecular modeling, achieving computational speedups of several orders of magnitude for specific problems. While full-scale quantum computing remains emergent, these hybrid approaches provide immediate benefits while establishing a foundation for future quantum integration.
These technological advancements have profound implications for healthcare delivery. This is further supported by the quantifiable benefits noticed in healthcare organizations that were put into practice through newer, more modern frameworks: a 35% reduction in diagnostic time and a 40% decrease in administrative costs. More importantly, these systems have improved patient outcomes by a reduction of 25% in diagnostic errors, up to a 30% improvement in the early detection of diseases. This offers unmatched personalization and accuracy in treatment, brought forth by the integration of cloud computing infrastructure with bespoke AI models. Physicians using these analytic resources reported that they could even better predict patient outcomes and further hone their treatment methods. Such improvements have yielded measurable results: lower hospital readmission rates and higher patient satisfaction scores. Such advances point to the radical, transformative power of cloud computing and deep.
This, therefore, is a new step in the evolution of medical practices, embracing high-tech methods in healthcare provision for higher operational efficiency, better diagnostic precision, and increasingly fine-tuned treatment modalities.

12. Cloud Computing Implementation in Healthcare: Evidence from Case Studies

Cloud computing changed health service delivery in various environments. The various implementations that this section covers are proof of quantifiable improvement in patient care to operational efficiency.

12.1. Hospital Network Digital Transformation

Along this line, the implementation of a cloud-based medical imaging platform at University Health Network in the year 2022–2023 showcases the ability of cloud solutions to scale up with any large healthcare system. It integrated the five major hospitals in a hybrid cloud architecture and processed 1.2 million studies annually across 16 clinical departments. The effect on operational efficiency was considerable, as the duration for image retrieval decreased from 5 to 7 min to merely 30 s. Cross-department consultation efficiency showed a remarkable 68% improvement while repeat imaging requests decreased by 42%. The financial impact was equally impressive, with annual infrastructure savings reaching USD 2.8 million. This comprehensive digital transformation demonstrates how cloud solutions can simultaneously improve clinical operations and reduce costs.

12.2. Collaborative Research Enhancement

An eight-member consortium of cancer research centers launched a cloud-based collaborative platform to advance the field using federated learning for preserving patient privacy in 2023. The implementation successfully standardized imaging protocols and automated data-sharing. It allows standardized processes across institutions. Consequences on research capabilities were very high: multi-center research projects were up 3.2-fold, and study initialization time reduced by 47%. It achieved an accuracy of 89% with cross-institutional AI model performance and allowed for the sharing of 15,000 anonymized cases for research with security. This use case will further illustrate how cloud computing can enable acceleration of research sharing without the loss of data security and privacy.

12.3. Rural Healthcare Access Improvement

A statewide initiative transformed rural healthcare access by connecting 24 rural hospitals with urban medical centers through cloud-based telemedicine services. The system combined edge computing at rural locations with a centralized cloud infrastructure for optimal performance. The results were transformative: patient transfer rates decreased by 64%, while specialist consultation waiting times reduced by 82%. Rural emergency response outcomes improved by 73%, leading to annual transport cost savings of USD 4.2 million. This initiative demonstrates how cloud computing can bridge the urban–rural healthcare divide while improving patient outcomes and reducing costs.

12.4. Emergency Response Optimization

Metropolitan emergency services achieved significant improvements through a cloud-based response system linking ambulances with hospital emergency departments. The system enabled real-time data transmission and automated resource allocation, reducing emergency response times by 28%. Patient handover efficiency improved by 45%, while emergency department waiting times decreased by 33%. The system achieved 92% accuracy in predicting resource requirements, enabling better emergency response planning and resource allocation. These improvements demonstrate the critical role of cloud computing in emergency healthcare services.

12.5. AI-Enhanced Diagnostic Services

A national healthcare provider’s deployment of cloud-based AI diagnostic support tools across 120 clinics revolutionized diagnostic capabilities. The system enabled the distributed processing of diagnostic imaging with real-time model updates and integrated clinical decision support. The clinical impact was substantial: early disease detection improved by 23%, diagnostic errors reduced by 35%, and time-to-diagnosis decreased by 41%. Patient satisfaction scores showed a 58% improvement, highlighting the positive impact on both clinical outcomes and patient experience.

12.6. Implementation Economics

Analysis of these implementations reveals consistent economic benefits across healthcare organizations. The financial metrics showed an average 31% reduction in IT infrastructure costs and a 45% decrease in system maintenance expenses. Resource utilization improved by 27%, and the return on investment was typically realized in 18–24 months. Equally unmissable were the operational efficiency metrics: shrinkage of 47% in processing time and an increase of 52% in data access. Staff productivity increased by 38%, while cross-department collaboration showed a 64% enhancement. These figures demonstrate the substantial economic advantages of cloud computing in healthcare settings.

12.7. Implementation Challenges and Solutions

Healthcare organizations encountered several significant challenges during cloud adoption, including complex data migration processes, legacy system integration difficulties, network infrastructure limitations, and regulatory compliance requirements. However, successful implementations have overcome this with carefully planned approaches, such as: incremental deployment, with up to 90% success rate; heavy staff training programs and regular security tests. Organizations also implemented robust disaster recovery planning to ensure system reliability. These experiences highlight the importance of strategic planning and systematic implementation in healthcare cloud computing projects.
The cumulative evidence from these case studies demonstrates the transformative impact of cloud computing across various healthcare contexts. The documented improvements in efficiency, accessibility, and patient outcomes provide compelling evidence for the value of cloud-based solutions in modern healthcare delivery systems.

12.8. Future Research Directions

Based on our comprehensive analysis of current cloud computing and AI technologies in healthcare, we have identified several specific research directions that warrant immediate investigation:

12.8.1. Advanced Edge–Cloud Hybrid Architectures

Future research should focus on developing specialized hybrid architectures that optimize the distribution of computational workloads between edge devices and cloud infrastructures. Specifically, we propose the following:
  • The development of adaptive load-balancing algorithms that dynamically allocate processing tasks based on real-time network conditions and data sensitivity levels;
  • The creation of standardized protocols for secure data synchronization between edge devices and cloud servers, with particular emphasis on maintaining data consistency in low-bandwidth scenarios;
  • The design of fault-tolerant mechanisms that ensure continuous operation during network disruptions, specifically for critical healthcare monitoring applications.

12.8.2. Enhanced Federated Learning Frameworks

To address current limitations in federated learning implementations, we recommend investigating the following:
  • Novel consensus algorithms that minimize communication overhead while maintaining model accuracy across distributed healthcare networks;
  • The development of privacy-preserving aggregation methods that can guarantee differential privacy while preserving model performance for rare medical conditions;
  • The creation of adaptive federated learning protocols that can automatically adjust to varying data distributions across different healthcare institutions.

12.8.3. Quantum-Enhanced Medical Computing

As quantum computing technology matures, specific research priorities should include the following:
  • The development of hybrid quantum–classical algorithms specifically optimized for medical image processing and pattern recognition;
  • The creation of quantum-resistant security protocols for protecting sensitive medical data in preparation for quantum computing capabilities;
  • The investigation of quantum machine learning algorithms for accelerating drug discovery and molecular modeling processes.

12.8.4. Concrete Technical Proposals

We specifically propose the technical innovations listed below for immediate investigation.
Secure Medical Data Framework (SMDF): A comprehensive security framework with the following features:
  • Implements end-to-end encryption with quantum-resistant algorithms;
  • Provides automated compliance checking for international healthcare regulations;
  • Incorporates blockchain technology for maintaining immutable audit trails;
  • Features adaptive access control mechanisms based on real-time risk assessment.
Distributed Healthcare Analytics Platform (DHAP): A novel cloud-based architecture with the following features:
  • Supports real-time processing of multimodal medical data (imaging, EHR, sensor data);
  • Implements automated load balancing between edge and cloud resources;
  • Features built-in privacy-preserving federated learning capabilities;
  • Provides seamless integration with existing healthcare IT infrastructures.
Quantum-Ready Medical Imaging Pipeline (QRMIP): A forward-looking framework with the following features:
  • Prepares medical imaging workflows for quantum acceleration;
  • Implements hybrid classical–quantum algorithms for image processing;
  • Provides scalable integration paths for emerging quantum technologies;
  • Features automated optimization for quantum resource allocation.
These specific research directions and technical proposals address current limitations in healthcare cloud computing while providing concrete paths for future development. The proposed frameworks are designed to be modular and adaptable, allowing for incremental implementation as technologies mature. Success in these areas would significantly advance the field of medical data analysis while addressing critical security, privacy, and performance challenges in healthcare computing.
Our recommendations emphasize practical implementation paths while maintaining a focus on future technological capabilities. By pursuing these specific research directions and technical proposals, the healthcare sector can better prepare for the next generation of cloud-based medical data analysis while ensuring robust security, privacy, and performance standards.

13. Conclusions

Our comprehensive investigation into the application of DL and cloud computing in medical data analysis has yielded several significant empirical findings. Through the analysis of multiple case studies and implementation examples, we observed that deep learning models achieved an average improvement of 15–20% in diagnostic accuracy across various medical imaging applications compared to traditional methods. These findings have identified DL and cloud computing as the drivers that will seriously enhance diagnostic accuracy, especially in image- and data-intensive applications, and will give critical insights into research and practice. Specifically, in the analysis of radiological images, the recent solutions of cloud-based deep learning systems demonstrated a 92% accuracy rate in tumor detection, surpassing the average accuracy of human experts at 85%. These trends also underpin the potential of machine-driven diagnostic tools for better accuracy, error reduction, and timely interventions in key healthcare settings. The integration of cloud computing infrastructure with deep learning algorithms has shown measurable improvements in computational efficiency, reducing the processing time of complex medical datasets by an average of 60%. Besides reducing infrastructure costs and decreasing the waiting time for diagnostic services, such technologies promise to make advanced healthcare feasible in resource-constrained settings. Our analysis revealed that healthcare institutions implementing these technologies experienced a 30% reduction in diagnostic waiting times and a 25% decrease in false-positive rates. The cloud-based deployment of these systems has also demonstrated a 40% reduction in infrastructure costs compared to traditional on-premises solutions. Furthermore, our research identified that a phased deployment approach, starting with non-critical diagnostic support systems, showed a 90% success rate in integration compared to 45% for immediate full-scale implementations. Healthcare organizations utilizing hybrid cloud architectures that maintained sensitive patient data on-premises while leveraging cloud resources for computational tasks achieved a 50% improvement in regulatory compliance. Additionally, integrated training programs for healthcare professionals increased system adoption rates by 75% and reduced implementation errors by 60%. Our findings also revealed specific implementation challenges that require attention. Data integration issues affected 35% of implementations, with incompatible data formats accounting for 40% of those cases. Security breaches were reported in 5% of cloud-based systems, highlighting the critical need for enhanced security protocols. To address these challenges, our research supports the implementation of standardized data formatting protocols across healthcare institutions, coupled with advanced encryption techniques for data during transmission and at rest. Standard security audits and compliance checks at quarterly intervals are shown to reduce security incidents by 80%. Data interoperability and protection of patient information require collaborative standardization and increased security measures. Looking ahead, our research indicates several promising directions for future development. The integration of federated learning techniques shows potential for reducing privacy concerns while evaluating model accuracy, with initial trials showing a 95% retention of model performance. Edge computing implementations have demonstrated a 40% reduction in latency for real-time medical data processing. Therefore, the health institution should develop an extended data audit process prior to implementation and a sound infrastructure of the cloud with proper security measures. As revealed by the empirical evidences from the case study, the health institutions can achieve Nevertheless, remarkable improvements in diagnostic accuracy, operational efficiency, and patient outcomes achieved with DL and cloud computing are well documented. By following a structured implementation approach, starting with well-defined, limited-scope applications and gradually expanding based on validated success metrics, healthcare organizations can successfully navigate the transition to advanced medical data analysis systems while maintaining data security and regulatory compliance. These will provide the solid foundations for health organizations wanting to exploit such technologies in pursuit of enhancement in the delivery of medical care to ensure that improved patient outcomes can be realized. This is evidenced by the need for further research into studies on federated learning and edge computing, especially studies focused on real-time decentralized diagnostics that ensure privacy and are implemented via adaptive frameworks that facilitate flexibility and robustness within healthcare frameworks.

Author Contributions

Conceptualization, M.Y.S. and M.I.K.; methodology, M.Y.S.; software, M.Y.S.; validation, M.Y.S. and M.I.K.; writing-original draft preparation, M.Y.S.; writing-review and editing, M.Y.S. and M.I.K.; supervision, M.I.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Ridzuan, F.; Zainon, W.M.N.W. Diagnostic analysis for outlier detection in big data analytics. Procedia Comput. Sci. 2022, 197, 685–692. [Google Scholar] [CrossRef]
  2. Chakraborty, C.; Bhattacharya, M.; Pal, S.; Lee, S.S. From machine learning to deep learning: Advances of the recent data-driven paradigm shift in medicine and healthcare. Curr. Res. Biotechnol. 2023, 7, 100164. [Google Scholar] [CrossRef]
  3. Taipalus, T.; Isomöttönen, V.; Erkkilä, H.; Äyrämö, S. Data analytics in healthcare: A tertiary study. SN Comput. Sci. 2022, 4, 87. [Google Scholar] [CrossRef]
  4. Al-Sai, Z.A.; Husin, M.H.; Syed-Mohamad, S.M.; Abdin, R.M.S.; Damer, N.; Abualigah, L.; Gandomi, A.H. Explore big data analytics applications and opportunities: A review. Big Data Cogn. Comput. 2022, 6, 157. [Google Scholar] [CrossRef]
  5. Hulsen, T.; Friedeckỳ, D.; Renz, H.; Melis, E.; Vermeersch, P.; Fernandez-Calle, P. From big data to better patient outcomes. Clin. Chem. Lab. Med. (CCLM) 2023, 61, 580–586. [Google Scholar] [CrossRef] [PubMed]
  6. Nti, I.K.; Quarcoo, J.A.; Aning, J.; Fosu, G.K. A mini-review of machine learning in big data analytics: Applications, challenges, and prospects. Big Data Min. Anal. 2022, 5, 81–97. [Google Scholar] [CrossRef]
  7. Mohammed, S.; Kim, T.H.; Chang, R.S.; Ramos, C. Guest Editorial: Data Analytics for Public Health Care. IEEE J. Biomed. Health Inform. 2022, 26, 1409–1410. [Google Scholar] [CrossRef]
  8. Qiu, J.; Wu, Q.; Ding, G.; Xu, Y.; Feng, S. A survey of machine learning for big data processing. EURASIP J. Adv. Signal Process. 2016, 2016, 67. [Google Scholar] [CrossRef]
  9. Fregoso-Aparicio, L.; Noguez, J.; Montesinos, L.; García-García, J.A. Machine learning and deep learning predictive models for type 2 diabetes: A systematic review. Diabetol. Metab. Syndr. 2021, 13, 148. [Google Scholar] [CrossRef]
  10. Najafabadi, M.M.; Villanustre, F.; Khoshgoftaar, T.M.; Seliya, N.; Wald, R.; Muharemagic, E. Deep learning applications and challenges in big data analytics. J. Big Data 2015, 2, 1–21. [Google Scholar] [CrossRef]
  11. Badawy, M.; Ramadan, N.; Hefny, H.A. Healthcare predictive analytics using machine learning and deep learning techniques: A survey. J. Electr. Syst. Inf. Technol. 2023, 10, 40. [Google Scholar] [CrossRef]
  12. Selvy, P.T.; Dharani, V.; Indhuja, A. Brain tumour detection using deep learning techniques. Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol. 2019, 169, 175. [Google Scholar] [CrossRef]
  13. Nisar, Q.A.; Nasir, N.; Jamshed, S.; Naz, S.; Ali, M.; Ali, S. Big data management and environmental performance: Role of big data decision-making capabilities and decision-making quality. J. Enterp. Inf. Manag. 2021, 34, 1061–1096. [Google Scholar] [CrossRef]
  14. Jayasri, N.; Aruna, R. Big data analytics in health care by data mining and classification techniques. ICT Express 2022, 8, 250–257. [Google Scholar]
  15. Khan, P.; Kader, M.F.; Islam, S.R.; Rahman, A.B.; Kamal, M.S.; Toha, M.U.; Kwak, K.S. Machine learning and deep learning approaches for brain disease diagnosis: Principles and recent advances. IEEE Access 2021, 9, 37622–37655. [Google Scholar] [CrossRef]
  16. Alsunaidi, S.J.; Almuhaideb, A.M.; Ibrahim, N.M.; Shaikh, F.S.; Alqudaihi, K.S.; Alhaidari, F.A.; Khan, I.U.; Aslam, N.; Alshahrani, M.S. Applications of big data analytics to control COVID-19 pandemic. Sensors 2021, 21, 2282. [Google Scholar] [CrossRef]
  17. Dhiman, G.; Juneja, S.; Mohafez, H.; El-Bayoumy, I.; Sharma, L.K.; Hadizadeh, M.; Islam, M.A.; Viriyasitavat, W.; Khandaker, M.U. Federated learning approach to protect healthcare data over big data scenario. Sustainability 2022, 14, 2500. [Google Scholar] [CrossRef]
  18. Dritsas, E.; Trigka, M. Lung cancer risk prediction with machine learning models. Big Data Cogn. Comput. 2022, 6, 139. [Google Scholar] [CrossRef]
  19. Mahmud, M.I.; Mamun, M.; Abdelgawad, A. A deep analysis of brain tumor detection from mr images using deep learning networks. Algorithms 2023, 16, 176. [Google Scholar] [CrossRef]
  20. Goyal, P.; Malviya, R. Challenges and opportunities of big data analytics in healthcare. Health Care Sci. 2023, 2, 328–338. [Google Scholar] [CrossRef]
  21. Dixit, S.; Kumar, A.; Srinivasan, K. A current review of machine learning and deep learning models in oral cancer diagnosis: Recent technologies, open challenges, and future research directions. Diagnostics 2023, 13, 1353. [Google Scholar] [CrossRef] [PubMed]
  22. Arya, A.D.; Verma, S.S.; Chakarabarti, P.; Chakrabarti, T.; Elngar, A.A.; Kamali, A.M.; Nami, M. A systematic review on machine learning and deep learning techniques in the effective diagnosis of Alzheimer’s disease. Brain Inform. 2023, 10, 17. [Google Scholar] [CrossRef]
  23. Güler, M.; Namlı, E. Brain Tumor Detection with Deep Learning Methods’ Classifier Optimization Using Medical Images. Appl. Sci. 2024, 14, 642. [Google Scholar] [CrossRef]
  24. Kang, H.; Sibbald, S. Challenges to using big data in health services research. Univ. West. Ont. Med. J. 2018, 87, 18–20. [Google Scholar] [CrossRef]
  25. Kunekar, P.; Gupta, M.K.; Gaur, P. Detection of epileptic seizure in EEG signals using machine learning and deep learning techniques. J. Eng. Appl. Sci. 2024, 71, 21. [Google Scholar] [CrossRef]
  26. Khaloufi, H.; Abouelmehdi, K.; Beni-Hssane, A.; Saadi, M. Security model for big healthcare data lifecycle. Procedia Comput. Sci. 2018, 141, 294–301. [Google Scholar] [CrossRef]
  27. Dinov, I.D. Volume and value of big healthcare data. J. Med. Stat. Inform. 2016, 4, 3. [Google Scholar] [CrossRef]
  28. Renugadevi, N.; Saravanan, S.; Sudha, C.N. Revolution of smart healthcare materials in big data analytics. Mater. Today Proc. 2023, 81, 834–841. [Google Scholar] [CrossRef]
  29. Piedrahita-Valdés, H.; Piedrahita-Castillo, D.; Bermejo-Higuera, J.; Guillem-Saiz, P.; Bermejo-Higuera, J.R.; Guillem-Saiz, J.; Sicilia-Montalvo, J.A.; Machío-Regidor, F. Vaccine hesitancy on social media: Sentiment analysis from June 2011 to April 2019. Vaccines 2021, 9, 28. [Google Scholar] [CrossRef]
  30. Saranya, P.; Asha, P. Survey on big data analytics in health care. In Proceedings of the 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, 27–29 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 46–51. [Google Scholar]
  31. Alfred, R.; Obit, J.H. The roles of machine learning methods in limiting the spread of deadly diseases: A systematic review. Heliyon 2021, 7, e07371. [Google Scholar] [CrossRef]
  32. Gandomi, A.H.; Chen, F.; Abualigah, L. Big Data Analytics Using Artificial Intelligence. Electronics 2023, 12, 957. [Google Scholar] [CrossRef]
  33. Rehman, A.; Naz, S.; Razzak, I. Leveraging big data analytics in healthcare enhancement: Trends, challenges and opportunities. Multimed. Syst. 2022, 28, 1339–1371. [Google Scholar] [CrossRef]
  34. Naghib, A.; Jafari Navimipour, N.; Hosseinzadeh, M.; Sharifi, A. A comprehensive and systematic literature review on the big data management techniques in the internet of things. Wirel. Netw. 2023, 29, 1085–1144. [Google Scholar] [CrossRef]
  35. Hong, L.; Luo, M.; Wang, R.; Lu, P.; Lu, W.; Lu, L. Big data in health care: Applications and challenges. Data Inf. Manag. 2018, 2, 175–197. [Google Scholar] [CrossRef]
  36. Dash, S.; Shakyawar, S.K.; Sharma, M.; Kaushik, S. Big data in healthcare: Management, analysis and future prospects. J. Big Data 2019, 6, 54. [Google Scholar] [CrossRef]
  37. Abudiyab, N.A.; Alanazi, A.T. Visualization techniques in healthcare applications: A narrative review. Cureus 2022, 14, e31355. [Google Scholar] [CrossRef]
  38. Supriya, M.; Deepa, A. Machine learning approach on healthcare big data: A review. Big Data Inf. Anal. 2020, 5, 58–75. [Google Scholar] [CrossRef]
  39. El Khatib, M.; Hamidi, S.; Al Ameeri, I.; Al Zaabi, H.; Al Marqab, R. Digital disruption and big data in healthcare-opportunities and challenges. Clin. Outcomes Res. 2022, 14, 563–574. [Google Scholar] [CrossRef]
  40. Macias, C.G.; Remy, K.E.; Barda, A.J. Utilizing big data from electronic health records in pediatric clinical care. Pediatr. Res. 2023, 93, 382–389. [Google Scholar] [CrossRef]
  41. Agrawal, R.; Prabakaran, S. Big data in digital healthcare: Lessons learnt and recommendations for general practice. Heredity 2020, 124, 525–534. [Google Scholar] [CrossRef]
  42. Padmapriya, S.; Parthasarathy, S. Ethical data collection for medical image analysis: A structured approach. Asian Bioeth. Rev. 2024, 16, 95–108. [Google Scholar] [CrossRef] [PubMed]
  43. Ogundipe, D. The impact of big data on healthcare product development: A theoretical and analytical review. Int. Med. Sci. Res. J. 2024, 4, 341–360. [Google Scholar] [CrossRef]
  44. Awan, M.J.; Bilal, M.H.; Yasin, A.; Nobanee, H.; Khan, N.S.; Zain, A.M. Detection of COVID-19 in chest X-ray images: A big data enabled deep learning approach. Int. J. Environ. Res. Public Health 2021, 18, 10147. [Google Scholar] [CrossRef] [PubMed]
  45. Mansour, R.F.; Escorcia-Gutierrez, J.; Gamarra, M.; Díaz, V.G.; Gupta, D.; Kumar, S. Artificial intelligence with big data analytics-based brain intracranial hemorrhage e-diagnosis using CT images. Neural Comput. Appl. 2023, 35, 16037–16049. [Google Scholar] [CrossRef]
  46. Balakrishnan, R.; Hernández, M.d.C.V.; Farrall, A.J. Automatic segmentation of white matter hyperintensities from brain magnetic resonance images in the era of deep learning and big data—A systematic review. Comput. Med. Imaging Graph. 2021, 88, 101867. [Google Scholar] [CrossRef]
  47. Shen, Y.T.; Yue, W.W.; Xu, H.X. Ultrasound in oncology: Application of big data and artificial intelligence. Front. Oncol. 2021, 11, 819487. [Google Scholar] [CrossRef]
  48. Go, H. Digital pathology and artificial intelligence applications in pathology. Brain Tumor Res. Treat. 2022, 10, 76–82. [Google Scholar] [CrossRef]
  49. Bansal, P.; Harjai, N.; Saif, M.; Mugloo, S.H.; Kaur, P. Utilization of big data classification models in digitally enhanced optical coherence tomography for medical diagnostics. Neural Comput. Appl. 2024, 36, 225–239. [Google Scholar] [CrossRef]
  50. Haleem, M.S.; Ekuban, A.; Antonini, A.; Pagliara, S.; Pecchia, L.; Allocca, C. Deep-Learning-Driven Techniques for Real-Time Multimodal Health and Physical Data Synthesis. Electronics 2023, 12, 1989. [Google Scholar] [CrossRef]
  51. Buddha, G.P.; Pulimamidi, R. The Future of Healthcare: Artificial Intelligence’s Role In Smart Hospitals And Wearable Health Devices. Tuijin Jishu/J. Propuls. Technol. 2023, 44, 2498–2504. [Google Scholar]
  52. Adenyi, A.O.; Okolo, C.A.; Olorunsogo, T.; Babawarun, O. Leveraging big data and analytics for enhanced public health decision-making: A global review. GSC Adv. Res. Rev. 2024, 18, 450–456. [Google Scholar] [CrossRef]
  53. Cusack, N.; Venkatraman, P.; Raza, U.; Faisal, A. Smart Wearable Sensors for Health and Lifestyle Monitoring: Commercial and Emerging Solutions. ECS Sens. Plus 2024, 3, 017001. [Google Scholar] [CrossRef]
  54. Rashid, F.K.M.; Osman, O.S.; Mcgee, E.T.; Raad, H. Discovering Hazards in IoT Architectures: A Safety Analysis Approach for Medical Use Cases. IEEE Access 2023, 11, 53671–53686. [Google Scholar] [CrossRef]
  55. Webber, M.; Rojas, R.F. Human activity recognition with accelerometer and gyroscope: A data fusion approach. IEEE Sens. J. 2021, 21, 16979–16989. [Google Scholar] [CrossRef]
  56. Arquilla, K.; Webb, A.K.; Anderson, A.P. Textile electrocardiogram (ECG) electrodes for wearable health monitoring. Sensors 2020, 20, 1013. [Google Scholar] [CrossRef]
  57. Antonopoulos, C.P.; Voros, N.S. Resource efficient data compression algorithms for demanding, WSN based biomedical applications. J. Biomed. Inform. 2016, 59, 1–14. [Google Scholar] [CrossRef]
  58. Cappon, G.; Acciaroli, G.; Vettoretti, M.; Facchinetti, A.; Sparacino, G. Wearable continuous glucose monitoring sensors: A revolution in diabetes treatment. Electronics 2017, 6, 65. [Google Scholar] [CrossRef]
  59. Roriz, P.; Silva, S.; Frazão, O.; Novais, S. Optical fiber temperature sensors and their biomedical applications. Sensors 2020, 20, 2113. [Google Scholar] [CrossRef]
  60. Alharthi, H. Healthcare predictive analytics: An overview with a focus on Saudi Arabia. J. Infect. Public Health 2018, 11, 749–756. [Google Scholar] [CrossRef]
  61. Islam, M.S.; Hasan, M.M.; Wang, X.; Germack, H.D.; Noor-E-Alam, M. A systematic review on healthcare analytics: Application and theoretical perspective of data mining. Proc. Healthc. 2018, 6, 54. [Google Scholar] [CrossRef]
  62. Mosavi, N.S.; Santos, M.F. How prescriptive analytics influences decision making in precision medicine. Procedia Comput. Sci. 2020, 177, 528–533. [Google Scholar] [CrossRef]
  63. Yu, Y.; Li, M.; Liu, L.; Li, Y.; Wang, J. Clinical big data and deep learning: Applications, challenges, and future outlooks. Big Data Min. Anal. 2019, 2, 288–305. [Google Scholar] [CrossRef]
  64. Galetsi, P.; Katsaliaki, K.; Kumar, S. Big data analytics in health sector: Theoretical framework, techniques and prospects. Int. J. Inf. Manag. 2020, 50, 206–216. [Google Scholar] [CrossRef]
  65. Navaz, A.N.; Serhani, M.A.; El Kassabi, H.T.; Al-Qirim, N.; Ismail, H. Trends, technologies, and key challenges in smart and connected healthcare. IEEE Access 2021, 9, 74044–74067. [Google Scholar] [CrossRef] [PubMed]
  66. Anom, B. Ethics of Big Data and artificial intelligence in medicine. Ethics Med. Public Health 2020, 15, 100568. [Google Scholar] [CrossRef]
  67. Amin, S.U.; Hossain, M.S. Edge intelligence and Internet of Things in healthcare: A survey. IEEE Access 2020, 9, 45–59. [Google Scholar] [CrossRef]
  68. Terrazas, G.; Martínez-Arellano, G.; Benardos, P.; Ratchev, S. Online tool wear classification during dry machining using real time cutting force measurements and a CNN approach. J. Manuf. Mater. Process. 2018, 2, 72. [Google Scholar] [CrossRef]
  69. Salmon, J.W.; Thompson, S.L.; Salmon, J.W.; Thompson, S.L. Big data: Information technology as control over the profession of medicine. In The Corporatization of American Health Care: The Rise of Corporate Hegemony and the Loss of Professional Autonomy; Springer: Cham, Switzerland, 2021; pp. 181–254. [Google Scholar]
  70. Qammar, A.; Karim, A.; Ning, H.; Ding, J. Securing federated learning with blockchain: A systematic literature review. Artif. Intell. Rev. 2023, 56, 3951–3985. [Google Scholar] [CrossRef]
  71. Khanra, S.; Dhir, A.; Islam, A.N.; Mäntymäki, M. Big data analytics in healthcare: A systematic literature review. Enterp. Inf. Syst. 2020, 14, 878–912. [Google Scholar] [CrossRef]
  72. Furstenau, L.B.; Leivas, P.; Sott, M.K.; Dohan, M.S.; López-Robles, J.R.; Cobo, M.J.; Bragazzi, N.L.; Choo, K.K.R. Big data in healthcare: Conceptual network structure, key challenges and opportunities. Digit. Commun. Netw. 2023, 9, 856–868. [Google Scholar] [CrossRef]
  73. Nasfi, R.; Bronselaer, A.; De Tré, G. A novel approach to assess and improve syntactic interoperability in data integration. Inf. Process. Manag. 2023, 60, 103522. [Google Scholar] [CrossRef]
  74. Bhartiya, S.; Mehrotra, D. Challenges and recommendations to healthcare data exchange in an interoperable environment. Electron. J. Health Inform. 2014, 8, 16. [Google Scholar]
  75. Agarwal, D.P.; Kushwaha, D.V.; Singh, D.V.K.; Azmi, D.T.; Shukla, D.V.; Khan, D.N.F.; Shoraisham, D.B. Revolutionizing Healthcare Through Advanced Analytics: Big Data. Int. J. Pharm. Sci. 2023, 14, 62–74. [Google Scholar] [CrossRef]
  76. Abouelmehdi, K.; Beni-Hssane, A.; Khaloufi, H.; Saadi, M. Big data security and privacy in healthcare: A Review. Procedia Comput. Sci. 2017, 113, 73–80. [Google Scholar] [CrossRef]
  77. Thantilage, R.D.; Le-Khac, N.A.; Kechadi, M.T. Healthcare data security and privacy in Data Warehouse architectures. Inform. Med. Unlocked 2023, 39, 101270. [Google Scholar] [CrossRef]
  78. Abouelmehdi, K.; Beni-Hessane, A.; Khaloufi, H. Big healthcare data: Preserving security and privacy. J. Big Data 2018, 5, 1. [Google Scholar] [CrossRef]
  79. Agrawal, D.; Madaan, J. A structural equation model for big data adoption in the healthcare supply chain. Int. J. Product. Perform. Manag. 2023, 72, 917–942. [Google Scholar] [CrossRef]
  80. Batini, C.; Rula, A.; Scannapieco, M.; Viscusi, G. From data quality to big data quality. J. Database Manag. (JDM) 2015, 26, 60–82. [Google Scholar] [CrossRef]
  81. Coombs, C.; Hislop, D.; Taneva, S.K.; Barnard, S. The strategic impacts of Intelligent Automation for knowledge and service work: An interdisciplinary review. J. Strateg. Inf. Syst. 2020, 29, 101600. [Google Scholar] [CrossRef]
  82. Wook, M.; Hasbullah, N.A.; Zainudin, N.M.; Jabar, Z.Z.A.; Ramli, S.; Razali, N.A.M.; Yusop, N.M.M. Exploring big data traits and data quality dimensions for big data analytics application using partial least squares structural equation modelling. J. Big Data 2021, 8, 49. [Google Scholar] [CrossRef]
  83. Brossard, P.Y.; Minvielle, E.; Sicotte, C. The path from big data analytics capabilities to value in hospitals: A scoping review. BMC Health Serv. Res. 2022, 22, 134. [Google Scholar] [CrossRef] [PubMed]
  84. Ahmed, A.; Xi, R.; Hou, M.; Shah, S.A.; Hameed, S. Harnessing big data analytics for healthcare: A comprehensive review of frameworks, implications, applications, and impacts. IEEE Access 2023, 11, 112891–112928. [Google Scholar] [CrossRef]
  85. Baghdadi, N.A.; Farghaly Abdelaliem, S.M.; Malki, A.; Gad, I.; Ewis, A.; Atlam, E. Advanced machine learning techniques for cardiovascular disease early detection and diagnosis. J. Big Data 2023, 10, 144. [Google Scholar] [CrossRef]
  86. Jawalkar, A.P.; Swetcha, P.; Manasvi, N.; Sreekala, P.; Aishwarya, S.; Kanaka Durga Bhavani, P.; Anjani, P. Early prediction of heart disease with data analysis using supervised learning with stochastic gradient boosting. J. Eng. Appl. Sci. 2023, 70, 122. [Google Scholar] [CrossRef]
  87. Cirillo, D.; Valencia, A. Big data analytics for personalized medicine. Curr. Opin. Biotechnol. 2019, 58, 161–167. [Google Scholar] [CrossRef]
  88. Cozzoli, N.; Salvatore, F.P.; Faccilongo, N.; Milone, M. How can big data analytics be used for healthcare organization management? Literary framework and future research from a systematic review. BMC Health Serv. Res. 2022, 22, 809. [Google Scholar] [CrossRef] [PubMed]
  89. Sabharwal, R.; Miah, S.J. A new theoretical understanding of big data analytics capabilities in organizations: A thematic analysis. J. Big Data 2021, 8, 159. [Google Scholar] [CrossRef]
  90. Thanka, M.R.; Edwin, E.B.; Ebenezer, V.; Sagayam, K.M.; Reddy, B.J.; Günerhan, H.; Emadifar, H. A hybrid approach for melanoma classification using ensemble machine learning techniques with deep transfer learning. Comput. Methods Programs Biomed. Update 2023, 3, 100103. [Google Scholar]
  91. Li, J.; Huang, Y.; Hutton, G.J.; Aparasu, R.R. Assessing treatment switch among patients with multiple sclerosis: A machine learning approach. Explor. Res. Clin. Soc. Pharm. 2023, 11, 100307. [Google Scholar] [CrossRef]
  92. Ksibi, A.; Zakariah, M.; Menzli, L.J.; Saidani, O.; Almuqren, L.; Hanafieh, R.A.M. Electroencephalography-based depression detection using multiple machine learning techniques. Diagnostics 2023, 13, 1779. [Google Scholar] [CrossRef]
  93. Rahman, S.; Hasan, M.; Sarkar, A.K. Prediction of brain stroke using machine learning algorithms and deep neural network techniques. Eur. J. Electr. Eng. Comput. Sci. 2023, 7, 23–30. [Google Scholar] [CrossRef]
  94. Swain, D.; Mehta, U.; Bhatt, A.; Patel, H.; Patel, K.; Mehta, D.; Acharya, B.; Gerogiannis, V.C.; Kanavos, A.; Manika, S. A robust chronic kidney disease classifier using machine learning. Electronics 2023, 12, 212. [Google Scholar] [CrossRef]
  95. Chen, Y.M.; Chen, P.C.; Lin, W.C.; Hung, K.C.; Chen, Y.C.B.; Hung, C.F.; Wang, L.J.; Wu, C.N.; Hsu, C.W.; Kao, H.Y. Predicting new-onset post-stroke depression from real-world data using machine learning algorithm. Front. Psychiatry 2023, 14, 1195586. [Google Scholar] [CrossRef]
  96. Ahmed, U.; Issa, G.F.; Khan, M.A.; Aftab, S.; Khan, M.F.; Said, R.A.; Ghazal, T.M.; Ahmad, M. Prediction of diabetes empowered with fused machine learning. IEEE Access 2022, 10, 8529–8538. [Google Scholar] [CrossRef]
  97. Andorra, M.; Freire, A.; Zubizarreta, I.; de Rosbo, N.K.; Bos, S.D.; Rinas, M.; Høgestøl, E.A.; de Rodez Benavent, S.A.; Berge, T.; Brune-Ingebretse, S.; et al. Predicting disease severity in multiple sclerosis using multimodal data and machine learning. J. Neurol. 2024, 271, 1133–1149. [Google Scholar] [CrossRef]
  98. Solomon, D.H.; Guan, H.; Johansson, F.D.; Santacroce, L.; Malley, W.; Guo, L.; Litman, H. Assessing clusters of comorbidities in rheumatoid arthritis: A machine learning approach. Arthritis Res. Ther. 2023, 25, 224. [Google Scholar] [CrossRef]
  99. Mistry, J.; Ramakrishnan, R. The Automated Eye Cancer Detection through Machine Learning and Image Analysis in Healthcare. J. Xidian Univ. 2023, 17, 763. [Google Scholar]
  100. Botlagunta, M.; Botlagunta, M.D.; Myneni, M.B.; Lakshmi, D.; Nayyar, A.; Gullapalli, J.S.; Shah, M.A. Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms. Sci. Rep. 2023, 13, 485. [Google Scholar] [CrossRef]
  101. Alamro, H.; Thafar, M.A.; Albaradei, S.; Gojobori, T.; Essack, M.; Gao, X. Exploiting machine learning models to identify novel Alzheimer’s disease biomarkers and potential targets. Sci. Rep. 2023, 13, 4979. [Google Scholar] [CrossRef]
  102. Lee, C.; Joo, G.; Shin, S.; Im, H.; Moon, K.W. Prediction of osteoporosis in patients with rheumatoid arthritis using machine learning. Sci. Rep. 2023, 13, 21800. [Google Scholar] [CrossRef]
  103. Helaly, H.A.; Badawy, M.; Haikal, A.Y. Deep learning approach for early detection of Alzheimer’s disease. Cogn. Comput. 2022, 14, 1711–1727. [Google Scholar] [CrossRef]
  104. Abdusalomov, A.B.; Mukhiddinov, M.; Whangbo, T.K. Brain tumor detection based on deep learning approaches and magnetic resonance imaging. Cancers 2023, 15, 4172. [Google Scholar] [CrossRef] [PubMed]
  105. Forte, G.C.; Altmayer, S.; Silva, R.F.; Stefani, M.T.; Libermann, L.L.; Cavion, C.C.; Youssef, A.; Forghani, R.; King, J.; Mohamed, T.L.; et al. Deep learning algorithms for diagnosis of lung cancer: A systematic review and meta-analysis. Cancers 2022, 14, 3856. [Google Scholar] [CrossRef] [PubMed]
  106. Zang, P.; Hormel, T.T.; Hwang, T.S.; Bailey, S.T.; Huang, D.; Jia, Y. Deep-learning–aided diagnosis of diabetic retinopathy, age-related macular degeneration, and glaucoma based on structural and angiographic OCT. Ophthalmol. Sci. 2023, 3, 100245. [Google Scholar] [CrossRef] [PubMed]
  107. Shoukat, A.; Akbar, S.; Hassan, S.A.; Iqbal, S.; Mehmood, A.; Ilyas, Q.M. Automatic diagnosis of glaucoma from retinal images using deep learning approach. Diagnostics 2023, 13, 1738. [Google Scholar] [CrossRef]
  108. Humayun, M.; Khalil, M.I.; Almuayqil, S.N.; Jhanjhi, N.Z. Framework for detecting breast cancer risk presence using deep learning. Electronics 2023, 12, 403. [Google Scholar] [CrossRef]
  109. Choudhary, A.; Ahlawat, S.; Urooj, S.; Pathak, N.; Lay-Ekuakille, A.; Sharma, N. A deep learning-based framework for retinal disease classification. Proc. Healthc. 2023, 11, 212. [Google Scholar] [CrossRef]
  110. García-Ordás, M.T.; Bayón-Gutiérrez, M.; Benavides, C.; Aveleira-Mata, J.; Benítez-Andrades, J.A. Heart disease risk prediction using deep learning techniques with feature augmentation. Multimed. Tools Appl. 2023, 82, 31759–31773. [Google Scholar] [CrossRef]
  111. Houssein, E.H.; Mohamed, R.E.; Ali, A.A. Heart disease risk factors detection from electronic health records using advanced NLP and deep learning techniques. Sci. Rep. 2023, 13, 7173. [Google Scholar] [CrossRef]
  112. Sorour, S.E.; Abd El-Mageed, A.A.; Albarrak, K.M.; Alnaim, A.K.; Wafa, A.A.; El-Shafeiy, E. Classification of Alzheimer’s disease using MRI data based on Deep Learning Techniques. J. King Saud Univ.-Comput. Inf. Sci. 2024, 36, 101940. [Google Scholar] [CrossRef]
  113. Chen, J.; Huang, S.; Zhang, Y.; Chang, Q.; Zhang, Y.; Li, D.; Qiu, J.; Hu, L.; Peng, X.; Du, Y.; et al. Congenital heart disease detection by pediatric electrocardiogram based deep learning integrated with human concepts. Nat. Commun. 2024, 15, 976. [Google Scholar] [CrossRef] [PubMed]
  114. Darwish, A.; Hassanien, A.E.; Elhoseny, M.; Sangaiah, A.K.; Muhammad, K. The impact of the hybrid platform of internet of things and cloud computing on healthcare systems: Opportunities, challenges, and open problems. J. Ambient Intell. Humaniz. Comput. 2019, 10, 4151–4166. [Google Scholar] [CrossRef]
  115. Sunyaev, A.; Sunyaev, A. Cloud computing. In Internet Computing: Principles of Distributed Systems and Emerging Internet-Based; Springer: Berlin/Heidelberg, Germany, 2020; pp. 195–236. [Google Scholar]
  116. Alam, T. Cloud-based IoT applications and their roles in smart cities. Smart Cities 2021, 4, 1196–1219. [Google Scholar] [CrossRef]
  117. Gill, S.S.; Xu, M.; Ottaviani, C.; Patros, P.; Bahsoon, R.; Shaghaghi, A.; Golec, M.; Stankovski, V.; Wu, H.; Abraham, A.; et al. AI for next generation computing: Emerging trends and future directions. Internet Things 2022, 19, 100514. [Google Scholar] [CrossRef]
  118. Theodorakopoulos, L.; Theodoropoulou, A.; Stamatiou, Y. A State-of-the-Art Review in Big Data Management Engineering: Real-Life Case Studies, Challenges, and Future Research Directions. Eng 2024, 5, 1266–1297. [Google Scholar] [CrossRef]
  119. Retico, A.; Avanzo, M.; Boccali, T.; Bonacorsi, D.; Botta, F.; Cuttone, G.; Martelli, B.; Salomoni, D.; Spiga, D.; Trianni, A.; et al. Enhancing the impact of Artificial Intelligence in Medicine: A joint AIFM-INFN Italian initiative for a dedicated cloud-based computing infrastructure. Phys. Medica 2021, 91, 140–150. [Google Scholar] [CrossRef]
  120. Banimfreg, B.H. A comprehensive review and conceptual framework for cloud computing adoption in bioinformatics. Healthc. Anal. 2023, 3, 100190. [Google Scholar] [CrossRef]
  121. Lin, Z.; Zou, J.; Liu, S.; Peng, C.; Li, Z.; Wan, X.; Fang, D.; Yin, J.; Gobbo, G.; Chen, Y.; et al. A cloud computing platform for scalable relative and absolute binding free energy predictions: New opportunities and challenges for drug discovery. J. Chem. Inf. Model. 2021, 61, 2720–2732. [Google Scholar] [CrossRef]
  122. Ahmed, Z.; Mohamed, K.; Zeeshan, S.; Dong, X. Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database 2020, 2020, baaa010. [Google Scholar] [CrossRef]
  123. Wang, Q.; Su, M.; Zhang, M.; Li, R. Integrating digital technologies and public health to fight Covid-19 pandemic: Key technologies, applications, challenges and outlook of digital healthcare. Int. J. Environ. Res. Public Health 2021, 18, 6053. [Google Scholar] [CrossRef]
  124. Sourav, A. Data security and privacy concern in the healthcare system. In Internet of Healthcare Things: Machine Learning for Security and Privacy; Wiley: New York, NY, USA, 2022; pp. 1–25. [Google Scholar]
  125. Yathiraju, N. Investigating the use of an artificial intelligence model in an ERP cloud-based system. Int. J. Electr. Electron. Comput. 2022, 7, 1–26. [Google Scholar] [CrossRef]
  126. Sittón-Candanedo, I.; Alonso, R.S.; Corchado, J.M.; Rodríguez-González, S.; Casado-Vara, R. A review of edge computing reference architectures and a new global edge proposal. Future Gener. Comput. Syst. 2019, 99, 278–294. [Google Scholar] [CrossRef]
  127. Tanuwidjaja, H.C.; Choi, R.; Baek, S.; Kim, K. Privacy-preserving deep learning on machine learning as a service-a comprehensive survey. IEEE Access 2020, 8, 167425–167447. [Google Scholar] [CrossRef]
  128. Nasr, M.; Islam, M.M.; Shehata, S.; Karray, F.; Quintana, Y. Smart healthcare in the age of AI: Recent advances, challenges, and future prospects. IEEE Access 2021, 9, 145248–145270. [Google Scholar] [CrossRef]
  129. Majdalawieh, M.; Hani, A.B.; Al-Sabbah, H.; Adedugbe, O.; Benkhelifa, E. A Cloud-Native Knowledge Management Framework for Patient-Generated Health Data. In Proceedings of the 2023 Tenth International Conference on Social Networks Analysis, Management and Security (SNAMS), Abu Dhabi, United Arab Emirates, 21–24 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–7. [Google Scholar]
  130. Sutradhar, K.; Venkatesh, R.; Venkatesh, P. Quantum Internet of Things for Smart Healthcare. In Learning Techniques for the Internet of Things; Springer: Berlin/Heidelberg, Germany, 2023; pp. 261–285. [Google Scholar]
Figure 1. Methodology of the selection process.
Figure 1. Methodology of the selection process.
Electronics 13 04860 g001
Figure 2. Features of medical big data.
Figure 2. Features of medical big data.
Electronics 13 04860 g002
Figure 3. Medical data sources.
Figure 3. Medical data sources.
Electronics 13 04860 g003
Figure 4. General wearable device architecture [54].
Figure 4. General wearable device architecture [54].
Electronics 13 04860 g004
Figure 5. Medical big data sources.
Figure 5. Medical big data sources.
Electronics 13 04860 g005
Figure 6. Categories of big data analytics.
Figure 6. Categories of big data analytics.
Electronics 13 04860 g006
Table 1. Key research studies on big medical image data analysis through deep learning and cloud computing.
Table 1. Key research studies on big medical image data analysis through deep learning and cloud computing.
Ref.YearApplication DomainCloud Computing InfrastructureDeep Learning MethodologyKey FindingsLimitations
[12]2019Glioblastoma Detection in MRIAWS Medical Imaging CloudEnhanced VGG16 with transfer learning
-
94% accuracy in tumor detection
-
60% faster processing time
-
Validated on 150 MRI images
-
Small dataset size
-
Limited multi-center validation
[13]2021Hospital Resource OptimizationGoogle Cloud Healthcare APIHierarchical attention networks
-
42% improvement in decision quality
-
38% enhanced operational efficiency
-
Analyzed 384 hospitals
-
Region-specific implementation
-
Limited to urban hospitals
[14]2021Diabetes Management with Medical ImagingAzure Healthcare CloudHybrid CNN-LSTM
-
91.3% diagnostic accuracy
-
Real-time monitoring capability
-
50,000 patient records
-
High computational cost
-
Limited to specific imaging modalities
[15]2021Neurological DisordersDistributed cloud computing frameworkMulti-modal CNN architecture
-
89.7% mean diagnostic accuracy
-
Sensitivity: 92.3%
-
Meta-analysis of 45 studies
-
Hardware dependency
-
Integration complexity
[16]2021COVID-19 Chest X-ray AnalysisFederated cloud frameworkResNet50 with attention mechanism
-
Validated across 12 countries
-
Privacy-preserving analysis
-
Standardized assessment matrix
-
Network latency issues
-
Data synchronization challenges
[17]2022Multi-institutional Medical ImagingHybrid cloud architectureFederated deep learning
-
96.5% data protection
-
89.2% analytical accuracy
-
8 healthcare institutions
-
Communication overhead
-
Model convergence issues
[18]2022Lung Cancer DetectionEdge-cloud infrastructureEnsemble CNN with Rotation Forest
-
93.7% accuracy
-
2500 patient records
-
Reduced latency by 40%
-
Edge device limitations
-
Resource constraints
[19]2022Brain Tumor DetectionScalable cloud platformModified CNN architecture
-
95.8% accuracy
-
15% faster detection
-
3000 MRI scans
-
High bandwidth requirement
-
Complex deployment
[20]2023Healthcare Analytics IntegrationMulti-cloud frameworkDeep transfer learning
-
200 facilities analyzed
-
Implementation framework
-
Integration success metrics
-
Cost implications
-
Vendor lock-in issues
[21]2023Oral Cancer DiagnosisCloud-based medical imagingEnsemble deep learning
-
94.2% accuracy
-
Multi-demographic validation
-
32 model comparison
-
Dataset bias
-
Limited rare case detection
[22]2023Alzheimer’s DiagnosisDistributed cloud computingComparative DL analysis
-
91.4% diagnostic accuracy
-
85 studies meta-analysis
-
Statistical significance
-
Processing bottlenecks
-
Standardization challenges
[23]2024Brain Tumor ClassificationEdge-cloud hybrid systemHybrid deep learning
-
96.3% accuracy
-
5000 MRI scans
-
Reduced false positives
-
Infrastructure costs
-
Scaling limitations
[24]2024Cancer Research Data IntegrationCloud-based data lakeInteroperable DL framework
-
87% implementation success
-
150 datasets analyzed
-
Standardization protocols
-
Migration complexity
-
Security overhead
[25]2024Epileptic Seizure DetectionReal-time cloud processingAdvanced LSTM model
-
97.2% accuracy
-
23% improved detection
-
10,000 EEG recordings
-
Real-time constraints
-
Power consumption issues
Table 2. Summary of algorithm performance across healthcare applications.
Table 2. Summary of algorithm performance across healthcare applications.
Algorithm TypeMedical ApplicationAccuracy (%)Sample SizeKey Performance Metrics
Probabilistic Neural Network [12]Glioblastoma Detection94.0150 MRI imagesImproved detection rates over traditional methods
Hierarchical Attention Network [14]Diabetes Management91.350,000 recordsp < 0.001 significance level
CNN Architectures [15]Neurological Disorders89.7Meta-analysis (45 studies)Sensitivity: 92.3%; Specificity: 90.1%
Federated Learning [16]Multi-institutional Analysis89.28 institutionsData protection: 96.5%
Rotation Forest [18]Lung Cancer Prediction93.72500 recordsComprehensive risk stratification
CNN Architecture [19]Brain Tumor Detection95.83000 MRI scans15% improvement in speed and accuracy
Ensemble DL Models [21]Oral Cancer Diagnosis94.2Multiple datasetsCross-demographic validation
DL Methods [22]Alzheimer’s Diagnosis91.485 studiesp < 0.01 significance level
Hybrid DL Approach [23]Brain Tumor Classification96.35000 MRI scansReduced false positives
LSTM Model [25]Epileptic Seizure Detection97.210,000 EEG records23% improvement in early detection
Table 3. Systematic analysis of machine learning techniques in big data for healthcare: methodological validation and clinical outcomes.
Table 3. Systematic analysis of machine learning techniques in big data for healthcare: methodological validation and clinical outcomes.
Study ReferenceDisease FocusData SourceData SizeDeep Learning AlgorithmResults
Thanka et al., 2023 [90]MelanomaISIC validated dataset1416 standardized imagesEnsemble: VGG16 with XGBoostPrimary: 99.1% accuracy with validated cross-fold testing
Li et al., 2023 [91]Multiple SclerosisMulticenter EMR data7258 clinically validated casesOptimized Random ForestAUC: 0.65 with external validation; Sensitivity: 72%
Ksibi et al., 2023 [92]DepressionMODMA standardized EEG2104 validated recordingsHybrid CNN–LSTM97% accuracy with 5-fold cross-validation
Carter et al., 2023 [93]TuberculosisMulti-institutional genetic database24,231 verified isolatesEnhanced Gradient-Boosted TreesSensitivity: 97.2%; Specificity: 63.1% on test set
Swain et al., 2023 [94]Chronic Kidney DiseaseUCI CKD benchmark400 validated recordsOptimized SVM with feature selection99.33% accuracy with independent test cohort
Chen et al., 2023 [95]Post-Stroke DepressionNational EMR database76,826 longitudinal casesXGBoost with temporal featuresAUC-ROC: 71% with prospective validation
Ahmed et al., 2023 [96]DiabetesUCI verified dataset520 complete casesEnsemble: SVM-ANN fusion94.87% accuracy with external validation
Andorra et al., 2023 [97]Multiple SclerosisMulti-modal clinical data420 longitudinal casesRandom Forest with feature importanceValidated prediction of disease progression (AUC: 0.82)
Solomon et al., 2023 [98]Rheumatoid ArthritisCorEvitas registry11,883 verified patientsAdvanced clustering ensembleIdentified 5 distinct patient subgroups (p < 0.001)
Kopitar et al., 2020 [99]Type 2 DiabetesStandardized EHR3723 prospective casesMulti-algorithm ensembleAUC: 0.859 with temporal validation
Botlagunta et al., 2023 [100]Breast CancerValidated EMR profiles26,800 casesEnhanced Decision TreeAUC: 0.87 with independent validation cohort
Alamro et al., 2023 [101]Alzheimer’sGEO expression data445 validated samplesHybrid CNN–DNNAUC: 0.979 with novel biomarker validation
Lee et al., 2023 [102]Rheumatoid ArthritisKORONA database2374 verified casesOptimized XGBoost ensembleAUC: 0.750; F1: 0.705 with external validation
Table 4. Consolidated performance analysis of machine learning applications in healthcare.
Table 4. Consolidated performance analysis of machine learning applications in healthcare.
Algorithm TypeDisease DomainDataset SizePrimary MetricValidation Method
VGG16–XGBoost EnsembleMelanoma1416 images99.1% accuracyCross-fold validation
Optimized Random ForestMultiple Sclerosis7258 casesAUC: 0.65External validation
Hybrid CNN–LSTMDepression2104 records97% accuracy5-fold cross-validation
Gradient-Boosted TreesTuberculosis24,231 samplesSensitivity: 97.2%Independent test set
SVM with Feature SelectionChronic Kidney Disease400 records99.33% accuracyIndependent cohort
XGBoost (Temporal)Post-stroke Depression76,826 casesAUC-ROC: 71%Prospective validation
SVM–ANN FusionDiabetes520 cases94.87% accuracyExternal validation
Feature-based Random ForestMultiple Sclerosis420 casesAUC: 0.82Disease progression validation
Clustering EnsembleRheumatoid Arthritis11,883 patientsp < 0.001Statistical significance
Multi-algorithm EnsembleType 2 Diabetes3723 casesAUC: 0.859Temporal validation
Enhanced Decision TreeBreast Cancer26,800 casesAUC: 0.87Independent cohort
Hybrid CNN–DNNAlzheimer’s445 samplesAUC: 0.979Biomarker validation
XGBoost EnsembleRheumatoid Arthritis2374 casesAUC: 0.750External validation
Table 5. Comparative analysis of deep learning techniques in big data for healthcare: validated methods and clinical outcomes.
Table 5. Comparative analysis of deep learning techniques in big data for healthcare: validated methods and clinical outcomes.
Ref.Disease FocusData SourceData SizeDL AlgorithmResults
Helaly et al., 2022 [103]Alzheimer’s DiseaseADNI standardized dataset48,000 validated imagesHierarchical CNN with VGG19 transfer learning97% accuracy with 5-fold cross-validation; sensitivity: 96.8%; specificity: 97.2%
Abdusalomov et al., 2023 [104]Brain TumorsMulti-center MRI database10,288 annotated imagesEnhanced YOLOv7 with attention mechanisms99.5% accuracy with external validation; precision: 99.3%; recall: 99.7%
Forte et al., 2022 [105]Lung CancerMulti-institutional CT dataMeta-analysis of 6 validated studiesEnsemble CNN architecturesPooled sensitivity: 93%; specificity: 68%; AUC: 0.90 with heterogeneity analysis
Zang et al., 2023 [106]Multi-Retinal DiseasesStandardized OCT/OCTA526 validated volumesMulti-Task Deep LearningDisease-specific AUCs—DR: 0.95; AMD: 0.98; Glaucoma: 0.91 with independent testing
Shoukat et al., 2023 [107]GlaucomaCurated retinal database2500 validated imagesModified ResNet-50 with attentionAccuracy: 98.48%; sensitivity: 99.30%; specificity: 96.52%, validated across populations
Humayun et al., 2023 [108]Breast CancerHistopathology database157,572 validated patchesEnhanced InceptionResNetV291% accuracy with external validation, ROC analysis included
Choudhary et al., 2023 [109]Retinal DiseasesMulti-center OCT data84,568 standardized imagesOptimized VGG-19 with domain adaptationAccuracy: 99.17%; sensitivity: 99.0%; specificity: 99.5%; cross-center validation
Garcia-Ordas et al., 2023 [110]Heart DiseaseIntegrated clinical data918 verified casesHybrid SAE–CNN architecture90.088% accuracy with independent cohort validation
Houssein et al., 2023 [111]Heart DiseaseStructured EHR data10,000 validated recordsEnsemble BERT models with clinical NLPF1: 93.66%, validated across multiple institutions
Sorour et al., 2024 [112]Alzheimer’s DiseaseStandardized MRI database6400 validated scansIntegrated CNN–LSTM with data augmentation99.92% accuracy; comprehensive ablation studies included
Chen et al., 2024 [113]Congenital Heart DiseaseMulti-center ECG data65,869 validated casesNovel CHDdECG architectureROC-AUC: 0.915 with external validation cohort
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shakor, M.Y.; Khaleel, M.I. Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing. Electronics 2024, 13, 4860. https://doi.org/10.3390/electronics13244860

AMA Style

Shakor MY, Khaleel MI. Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing. Electronics. 2024; 13(24):4860. https://doi.org/10.3390/electronics13244860

Chicago/Turabian Style

Shakor, Mohammed Y., and Mustafa Ibrahim Khaleel. 2024. "Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing" Electronics 13, no. 24: 4860. https://doi.org/10.3390/electronics13244860

APA Style

Shakor, M. Y., & Khaleel, M. I. (2024). Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing. Electronics, 13(24), 4860. https://doi.org/10.3390/electronics13244860

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop