Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing
<p>Methodology of the selection process.</p> "> Figure 2
<p>Features of medical big data.</p> "> Figure 3
<p>Medical data sources.</p> "> Figure 4
<p>General wearable device architecture [<a href="#B54-electronics-13-04860" class="html-bibr">54</a>].</p> "> Figure 5
<p>Medical big data sources.</p> "> Figure 6
<p>Categories of big data analytics.</p> ">
Abstract
:1. Introduction
1.1. Research Contributions
- A comprehensive review has been provided of recent advances (2019–November 2024) in medical data analysis through DL and cloud computing, analyzing 37 key studies to identify successful implementation patterns and challenges.
- We present a detailed analysis of three major medical data sources: wearable devices, medical imaging (X-ray, CT, and MRI), and electronic health records, examining their unique characteristics and processing requirements.
- We quantify the impact of integrated deep learning and cloud computing solutions on healthcare delivery, demonstrating improvements in diagnostic accuracy (15–20%) and processing speed (60% faster) and a reduction in infrastructure costs (40% reduction).
- We identify common implementation challenges, including data integration issues (affecting 35% of implementations) and security concerns (5% breach rate) and provide evidence-based solutions for addressing these challenges.
- We develop practical recommendations for healthcare organizations implementing these technologies, supported by empirical evidence showing that phased deployments achieve 90% success rates compared to 45% for immediate full-scale implementations.
1.2. Motivation
2. Literature Selection Methodology
- Fourteen papers focusing on foundational research in healthcare big data analysis;
- Thirteen papers specifically examining machine learning applications in healthcare;
- Ten papers investigating deep learning techniques in medical data processing.
2.1. Selection Criteria
2.2. Selection Process
2.3. Selection Outcomes
3. Recent Solutions
Performance Analysis of Healthcare Analytics Algorithms
- Accuracy Range: The algorithms demonstrated consistently high accuracy levels, ranging from 89.2% to 97.2%, with most achieving above 90% accuracy in their respective applications.
- Sample Size Variation: Sample sizes varied significantly across the studies, from 150 MRI images to 50,000 patient records, with larger datasets generally associated with more robust validation.
- Application Diversity: The algorithms showed versatility across various medical applications, from diagnostic imaging to patient management and disease prediction.
- Performance Improvements: Most studies reported substantial improvements over traditional methods, with improvements ranging from 15% to 23% in various performance metrics.
- Validation Rigor: The studies employed diverse validation approaches, including cross-validation, independent testing, and statistical significance testing, strengthening the reliability of their findings.
4. Big Data Features
- Volume: Big data involves the extremely large volume of structured and unstructured data sources, transferred across diverse systems or structures, which may come in various formats. There are challenges in collecting, storing, analyzing, and presenting the vast amounts of data that are growing exponentially. Consequently, big data requires intricate technologies and systems that allow fast access, evaluation, and use of both unstructured and structured data [26].The technologies available today can be used for handling large data, as they are plentiful, cheap, and even available in open-source formats. However, further advancements are needed in the field of aquatic research to create novel, efficient tools to manage the physical challenges generated by big data. The emergence of new data formats and the digitization of existing records further complicate the management of these extensive datasets. Handling and analyzing such vast volumes of data require advanced technologies and data processing techniques. For example, unstructured data (EHRs) are extracted and analyzed using Natural Language Processing (NLP), particularly for gathering critical insights into Complementary and Integrative Health (CIH) from clinical notes. This approach effectively manages the large data volumes in healthcare by addressing the challenge of processing extensive unstructured data in EHRs and facilitating the integration of alternative health practices into mainstream medical care [27].
- Variety: In the healthcare field, the “variety” component of big data covers a wide spectrum of data formats, including structured data like prescription information and instrument readings, as well as unstructured formats such as handwritten medical notes and clinical observations. The advent of new data sources, including social media and wearable technology, has further expanded the data available for medical research and diagnosis. This diverse integration offers a holistic view of patient health data, enhancing the accuracy and consistency of health assessments and interventions. However, leveraging big data in healthcare environments demands strict adherence to guidelines, particularly in managing related challenges with data privacy and interoperability [28,29].
- Velocity: The “velocity” component of big data analytics in the healthcare field emphasizes how quickly data must be processed to meet the demands of modern medical practice. This feature is crucial because it allows healthcare systems to conduct in-the-moment assessments required for timely decision making. Efficient high-velocity data handling enables reactive adjustments to patient situations, potentially leading to life-saving measures.Structured data are used for statistics and other common operations such as calculation and visualizations. At present, these data mainly include electronic health records (e.g., clinical supply chain and inpatient lists, clinical treatment notes, nursing care, and pathological information), hospital management information, medical devices, wearable devices, and medical home furnishing data. However, most medical data are unstructured. Unstructured medical data mainly consist of image data (e.g., radiology, pathology, endoscopy, and dermatology), voice data (e.g., electronic medical records), and text (e.g., electronic medical records). Despite the obvious advantages of analyzing and managing structured data, current research shows that structured data account for the majority of medical big data research findings. Therefore, the subsequent text focuses mainly on exploring the concept of variety in structured data and the research carried out on structured data [30].Therefore, complex algorithms that can quickly gather and process data must be included in healthcare analytical frameworks. In order to ensure that insights are generated quickly enough to significantly influence clinical choices, these frameworks are designed to handle massive amounts of data coming at various speeds. The use of these technologies is essential for enhancing the effectiveness and responsiveness of patient care in the healthcare system [31].As a result, the design of data analytics systems is fundamentally impacted by the velocity of big data in the healthcare field. Decision latency has a significant impact on clinical outcomes, so these systems need to be able to operate at the speeds needed to improve healthcare operations.
- Veracity: Because it directly affects patient outcomes, the term “veracity” in big data analytics highlights how crucial accurate and dependable data are to the healthcare industry. Because healthcare data serve as the foundation for all future medical decisions and treatments, it is imperative to ensure their quality and integrity. The healthcare industry handles a wide spectrum of data, such as patient information, diagnostic data, treatment responses, and results. As such, the process of confirming and verifying these data is challenging yet essential [32].Managing missing, inconsistent, or noisy data is a significant obstacle to ensuring data accuracy, as it can cause inadequate treatment, incorrect diagnoses, or inefficient remedies. Strategies such as validation, standardization, and data cleaning are achieved to enhance the reliability and quality of data. Additionally, integrating data from various sources complicates data integrity assurance, as different standards and formats must be adhered to [33].Numerous ideas and solutions have been proposed and implemented to address these challenges. These approaches leverage machine learning and advanced algorithms to improve data processing and validation. By enhancing data consistency and accuracy, these techniques contribute to better patient care outcomes and increase the reliability of healthcare analytics.
- Value: Big data mining is widely acknowledged as offering significant benefits for enhancing operational efficiency and medical outcomes in the healthcare sector. Healthcare professionals can proactively develop efforts to reduce hospital readmission rates and related costs by utilizing predictive algorithms. The delivery of individualized care is supported, and health management is further optimized through the integration of various data sources, which also increases treatment efficacy. Ultimately, big data analytics is crucial to enhance healthcare systems’ efficacy, efficiency, and long-term viability [27,34].
- Variability: In the medical field, maintaining the “variability” of big data involves allowing for a diverse range of data formats and types, including structured electronic health records, images, and unstructured clinical notes. Advanced integration approaches are required for efficient data management in order to guarantee consistency across diverse data sources and allow comprehensive analytics for better patient care [35]. Because of this variety, precision medicine and individualized treatment strategies require the use of sophisticated interpretive systems in machine learning. Thus, controlling data variability is essential to improving the efficacy of prediction models and tailored healthcare solutions [36].
- Visualization: The healthcare sector depends on effective big data visualization because it transforms complex, large-scale datasets into insights that can be used to enhance clinical decision making. Visualization tools such as charts and heat maps provide quick and accurate interpretations of enormous data arrays, improving diagnostics, patient monitoring, and resource management. Furthermore, these methods support efforts in predictive analytics and preventive medicine by assisting in the discovery of hidden patterns and trends [37]. As the volume and complexity of healthcare data increase, sophisticated visualization tools will become more and more crucial for better patient outcomes and healthcare delivery [32].
5. Big Data Sources
5.1. Electronic Health Records
- Patient Demographics: Age, gender, ethnicity, and other background information.
- Medical History: Previous treatments, diagnoses, surgeries, and family medical history.
- Medication Information: Current and past medications, dosages, and any adverse reactions.
- Diagnostic Test Results: Blood tests and radiology images.
- Treatment Outcomes: Responses to treatment, recovery progress, and notes on follow-up care.
5.2. Medical Imaging Data
5.2.1. X-Ray Radiography
- Data characteristics: Grayscale images where different tissues appear as varying shades based on their density;
- File format: Typically DICOM (Digital Imaging and Communications in Medicine);
- Data volume: Relatively small, typically 5–10 MB per image;
- Applications: Bone fractures, lung diseases, and dental examinations.
5.2.2. Computed Tomography (CT)
- Data characteristics: Series of cross-sectional images that can be reconstructed as 3D models;
- File format: DICOM;
- Data volume: Large, typically 100–500 MB per study;
- Applications: Detailed imaging of organs, blood vessels, and bones.
5.2.3. Magnetic Resonance Imaging (MRI)
- Data characteristics: Superior soft tissue contrast in high-resolution three-dimensional images;
- File format: DICOM;
- Data volume: Very large, typically 500 MB to 1 GB per study;
- Applications: Brain imaging, musculoskeletal disorders, and cancer detection.
5.2.4. Ultrasound
- Data characteristics: Real-time 2D or 3D grayscale images, often including Doppler data for blood flow;
- File formats: DICOM, but also proprietary formats, depending on the manufacturer;
- Data volume: Moderate, typically 50–200 MB per study;
- Applications: Obstetrics, cardiology, and vascular imaging.
5.2.5. Nuclear Medicine Imaging
- Data characteristics: Functional images often combined with CT or MRI for anatomical reference;
- File format: DICOM;
- Data volume: Large, typically 200–500 MB per study;
- Applications: Oncology, neurology, and cardiology.
5.2.6. Digital Pathology
- Data characteristics: Extremely high-resolution 2D images, often with multiple focal planes;
- File formats: Proprietary formats, DICOM, or standard image formats like TIFF;
- Data volume: Extremely large, often several GB per slide;
- Applications: Cancer diagnosis, research, and education.
5.2.7. Optical Coherence Tomography (OCT)
- Data characteristics: High-resolution cross-sectional images, often compiled into 3D volumes;
- File formats: Proprietary formats, increasingly adopting DICOM;
- Data volume: Moderate to large, typically 50–200 MB per study;
- Applications: Retinal imaging and coronary artery imaging.
5.3. Wearable Technology Data
- User Layer: At the foundation of the system is the user, represented by a human icon. This layer emphasizes the human-centric nature of wearable technology, where the end user interacts directly with the wearable device.
- Wearable Device Layer: Immediately above the user layer is the wearable device layer, represented by a smartwatch icon. This layer is responsible for three primary functions:
- –
- Perception: Capturing raw data from the user’s environment or physiological states through various sensors.
- –
- Control: Managing device operations and user interactions.
- –
- Feedback: Providing immediate responses or notifications to the user.
- Gateway Layer: The gateway layer, represented by a Personal Digital Assistant (PDA) or smartphone icon, serves as an intermediary between the wearable device and cloud services. Its functions include the following:
- –
- Configuration: Setting up and managing wearable devices.
- –
- Basic Analytics: Performing initial data processing and analysis.
- –
- Connectivity: Facilitating communication between the wearable device and cloud services.
- Cloud Layer: At the top of the architecture is the cloud layer, symbolized by a cloud icon. This layer is responsible for the following:
- –
- Analytics: Conducting comprehensive data analysis.
- –
- Archiving: Storing large volumes of historical data.
- –
- Management: Overseeing the entire ecosystem, including device management and user account administration.
5.3.1. Accelerometers and Gyroscopes
- Physical activity levels;
- Gait analysis;
- Fall detection;
- Sleep quality assessment.
5.3.2. Electrocardiography (ECG) Sensors
- Detailed heart rhythm analysis;
- QT interval measurements;
- ST segment changes.
5.3.3. Electroencephalography (EEG) Sensors
- Sleep stage analysis;
- Cognitive performance assessment;
- Seizure detection in epilepsy patients.
5.3.4. Continuous Glucose Monitoring (CGM) Sensors
- Continuous glucose-level tracking;
- Hypoglycemia and hyperglycemia alerts;
- Trend analysis for glucose fluctuations.
5.3.5. Temperature Sensors
- Early fever detection;
- Menstrual cycle tracking;
- Thermoregulation assessment in athletes.
6. Categorization of Big Data Analytics
- Descriptive analytics;
- Diagnostic analytics;
- Predictive analytics;
- Prescriptive analytics.
- Descriptive analytics answers “What happened?” and involves the following:
- Historical data analysis;
- Reporting and data aggregation.
- Diagnostic analytics addresses “Why did it happen?” and includes the following:
- Root-cause analysis;
- Data discovery and correlation finding.
- Predictive analytics focuses on “What will happen?” through the following:
- Forecasting;
- Statistical modeling and machine learning.
- Prescriptive analytics deals with “How can we make it happen?” via the following:
- Optimization;
- Simulation and AI-driven decision making.
7. Challenges of Big Data Analytics in Healthcare
7.1. Data Structure Issues
7.2. Data Standardization Issues
7.3. Security Issues
7.3.1. Regulatory Compliance and Technical Requirements
7.3.2. Privacy-Preserving Data Processing
7.3.3. Implementation and Incident Management
7.3.4. Future Security Considerations
7.4. Data Storing and Transferring
8. Opportunities for Big Data Analytics in Healthcare
8.1. Data Quality, Structure, and Accessibility
8.2. Improvements in Quality of Care
8.3. Early Detection of Diseases
8.4. Improve Decision Making
9. Applications of Big Data Analytics in Healthcare
9.1. Machine Learning Techniques in Big Data for Healthcare
- It provides a thorough review of the current status of machine learning applications in healthcare, demonstrating the scope and depth of ML’s impact on many medical specialties.
- The graph emphasizes the importance of machine learning in advancing discoveries that could significantly change medical practices in the future.
- By placing research side by side, it becomes simpler to compare how well various machine learning techniques address certain healthcare issues. This offers practitioners and scholars alike useful information.
- The orderly arrangement of the data sources highlights the necessity of data integration in healthcare analytics and highlights the spectrum of medical information being employed, from genetic data to electronic health records.
- The results column provides a quantitative and qualitative assessment of the outcomes, enabling readers to gauge the practical impact and potential of each ML application in real-world healthcare scenarios.
9.2. Performance Analysis of Machine Learning Applications in Healthcare
- Algorithm Effectiveness:
- Highest accuracy achieved: 99.33% (Support Vector Machine (SVM) with Feature Selection);
- Highest Area Under the Curve (AUC): 0.979 (Hybrid Convolutional Neural Network (CNN)–Deep Neural Network (DNN));
- Most robust sensitivity: 97.2% (Gradient-Boosted Trees).
- Dataset Characteristics:
- Largest dataset: 76,826 cases (Post-Stroke Depression);
- Smallest dataset: 400 records (Chronic Kidney Disease);
- Median dataset size: 2374 cases.
- Validation Approaches:
- Cross-validation techniques: 23% of studies;
- External validation: 38% of studies;
- Independent cohort testing: 31% of studies;
- Other specialized validation: 8% of studies.
- Algorithm Preferences:
- Ensemble methods: 46% of studies;
- Hybrid architectures: 23% of studies;
- Single-algorithm implementations: 31% of studies.
9.3. Deep Learning Techniques in Big Data for Healthcare
10. Research Implications
10.1. Technical Implications
- Algorithm Development: The superior performance of ensemble and hybrid approaches (46% of studies) over single-algorithm implementations (31%) suggests future research should focus on developing more sophisticated model combinations. Studies showed that ensemble methods consistently achieved 3–5% higher accuracy across different medical applications.
- Validation Methodologies: The variation in validation approaches (38% external validation, 31% independent cohort testing, 23% cross-validation) indicates a need for standardized validation protocols in healthcare AI. Studies with external validation demonstrated more robust generalization capabilities, with an average of 7% better performance on unseen data.
- Dataset Requirements: The wide range in dataset sizes (400 to 76,826 cases) suggests the need for research into minimum dataset requirements for different medical applications. Notably, studies with datasets exceeding 10,000 cases showed 12% higher reliability in their results.
10.2. Clinical Implications
- Diagnostic Accuracy: The demonstrated improvement in diagnostic accuracy (15–20% average increase) has significant implications for clinical practice, particularly in resource-constrained settings. This suggests the potential for reducing diagnostic errors and improving early detection rates.
- Real-Time Processing: The 60% reduction in processing time enables real-time clinical decision support, which is particularly crucial in emergency medicine and critical care settings. Studies showed that this could reduce critical decision-making time by an average of 45%.
- Treatment Planning: The high accuracy rates in disease progression prediction (AUC ranging from 0.82 to 0.979) suggest the potential for more precise treatment planning and better patient outcome predictions.
11. Cloud Computing for Big Medical Data and DL Processing
11.1. Advantages of Cloud Computing in Medical DL
- Scalability: Cloud platforms provide the best scalability, which allows researchers and healthcare providers to scale up or down computing resources as needed [114]. Such flexibility is particularly important in the analysis of medical data and AI model training, where there is considerable variability in workloads.
- Cost Efficiency: The pay-per-use model of cloud computing removes the necessity for significant initial investments in hardware [115]. This democratizes access to high-performance computing, enabling smaller institutions to conduct advanced research and analysis.
- Accessibility: Cloud-based solutions provide ubiquitous access to data and computational resources [116]. This feature facilitates remote work and collaboration, which has become increasingly important in the global research landscape.
- Advanced Technologies: Leading cloud providers offer cutting-edge hardware, including GPUs and Tensor Processing Unit (TPU) clusters, optimized for ML and DL workloads. They also provide managed services for popular AI frameworks, simplifying the deployment and management of complex AI pipelines [117].
- Data Integration: Cloud platforms excel at integrating diverse data sources, a critical capability in healthcare where data often reside in silos. This integration potential can lead to more comprehensive and insightful analyses [118].
11.2. Applications in Healthcare
- Medical Imaging Analysis: Cloud-based AI models can process vast amounts of medical imaging data, assisting in the detection and diagnosis of conditions ranging from fractures to complex cancers. The ability to train on diverse datasets from multiple institutions can lead to more robust and generalizable models [119].
- Genomic Sequencing and Analysis: The computational demands of genomic sequencing and analysis are well suited to cloud environments [120]. Cloud-based genomic pipelines can process data from thousands of patients, enabling large-scale studies and personalized treatment initiatives.
- Drug Discovery: Cloud computing speeds up the drug discovery process by facilitating high-throughput virtual screening, conducting molecular dynamics simulations, and utilizing predictive modeling for drug–target interactions [121].
- EHR Mining: Cloud-based AI can analyze vast EHR databases to identify patterns, predict disease progression, and recommend personalized treatment plans. This application has the potential to revolutionize clinical decision support systems [122].
- Epidemiological Modeling: The COVID-19 pandemic highlighted the importance of rapid, large-scale epidemiological modeling. Cloud computing provides the necessary computational power for complex simulations that can inform public health policies [123].
11.3. Challenges and Considerations
- Data Privacy and Security: Healthcare information is extremely sensitive and governed by stringent regulations. Protecting patient data in the cloud is crucial and necessitates strong encryption, access restrictions, and adherence to compliance protocols [124].
- Regulatory Compliance: Healthcare providers need to navigate intricate regulatory frameworks, such as HIPAA in the U.S. and the GDPR in Europe. Cloud solutions should be developed with these regulations in mind to guarantee adherence.
- Interoperability: Integrating cloud-based systems with existing healthcare IT infrastructures can be challenging. Ensuring seamless data flow between on-premises and cloud systems is crucial for the adoption of cloud-based AI solutions [125].
- Data Quality and Bias: The effectiveness of AI models is largely determined by the quality of the training data used. Ensuring data quality and addressing potential biases in medical datasets are ongoing challenges that require careful consideration.
- Skill Gap: The effective use of cloud-based AI in healthcare requires a unique blend of medical domain knowledge, data science expertise, and cloud computing skills.
11.4. Recent Advancements in Healthcare Cloud Computing and Deep Learning
- Advanced Distributed Computing Architectures Recent developments in distributed computing architectures have fundamentally transformed healthcare data processing capabilities. The emergence of containerized microservices architectures has enabled more flexible and scalable deployment of healthcare applications, with studies demonstrating a 40% improvement in resource utilization compared to traditional monolithic systems. Healthcare institutions implementing these advanced architectures have reported significant reductions in data processing latency, with real-time analysis capabilities improving from minutes to seconds for critical diagnostic applications.Modern edge computing implementations have introduced novel approaches to data processing in healthcare environments. Recent research has demonstrated the effectiveness of adaptive edge computing frameworks that automatically optimize the distribution of computational workloads between edge devices and cloud infrastructure based on real-time network conditions and data sensitivity requirements [126]. These systems have shown particular promise in remote patient monitoring applications, achieving a 65% reduction in data transmission overhead while maintaining diagnostic accuracy above 95%.
- Privacy-Preserving Deep Learning This field has witnessed substantial advancement in healthcare applications. Recent implementations of homomorphic encryption in medical image analysis have demonstrated the feasibility of performing complex computations on encrypted data without compromising accuracy [127]. Studies have shown that these encrypted processing systems maintain accuracy levels within 2% of their unencrypted counterparts while providing mathematical guarantees of data privacy.Federated learning techniques have evolved to address specific challenges in healthcare settings. Novel approaches to federated model aggregation have emerged, incorporating differential privacy guarantees while maintaining model performance. Recent studies have demonstrated successful implementations across networks of up to 50 healthcare institutions, achieving model accuracy comparable to centralized training while ensuring HIPAA compliance and reducing data exposure risks by over 90%.
- Specialized Healthcare AI Models The development of healthcare-specific AI architectures has accelerated, with new models designed explicitly for medical applications. These specialized architectures incorporate domain knowledge and medical expertise directly into their designs, resulting in more efficient and accurate diagnostic systems [128]. Recent innovations include attention mechanisms specifically optimized for medical imaging, achieving a 25% improvement in diagnostic accuracy compared to general-purpose computer vision models.Multi-modal learning architectures have emerged as particularly powerful tools in healthcare applications. These systems can simultaneously process diverse data types, including imaging data, electronic health records, and genetic information, providing more comprehensive diagnostic insights. Studies have demonstrated that these integrated approaches achieve diagnostic accuracy improvements of up to 30% compared to single-modality systems.
- Cloud-Native Healthcare Applications The adoption of cloud-native architectures in healthcare applications has led to significant improvements in system reliability and scalability [129]. Modern healthcare cloud applications leverage advanced service mesh architectures and automated scaling capabilities, resulting in 99.99% system availability while maintaining compliance with healthcare regulations. These systems have demonstrated the ability to handle varying workloads efficiently, with automatic resource allocation reducing operational costs by up to 45% compared to traditional infrastructure.Research has shown that cloud-native healthcare applications equipped with advanced monitoring and logging capabilities can detect and respond to potential security threats in real time, with incident response times averaging under 30 s. These systems incorporate automated compliance checking and audit trail generation, significantly reducing the administrative burden of regulatory compliance while enhancing security posture.
- Quantum-Ready Healthcare Infrastructure Forward-looking healthcare institutions have begun preparing for the integration of quantum computing capabilities. Recent research has focused on developing quantum-resistant security protocols and hybrid classical–quantum algorithms optimized for medical applications [130]. These developments ensure that current investments in healthcare infrastructure remain viable as quantum computing technology matures while positioning organizations to leverage quantum advantages in specific computational domains.Studies have demonstrated promising results in quantum-inspired algorithms for drug discovery and molecular modeling, achieving computational speedups of several orders of magnitude for specific problems. While full-scale quantum computing remains emergent, these hybrid approaches provide immediate benefits while establishing a foundation for future quantum integration.
12. Cloud Computing Implementation in Healthcare: Evidence from Case Studies
12.1. Hospital Network Digital Transformation
12.2. Collaborative Research Enhancement
12.3. Rural Healthcare Access Improvement
12.4. Emergency Response Optimization
12.5. AI-Enhanced Diagnostic Services
12.6. Implementation Economics
12.7. Implementation Challenges and Solutions
12.8. Future Research Directions
12.8.1. Advanced Edge–Cloud Hybrid Architectures
- The development of adaptive load-balancing algorithms that dynamically allocate processing tasks based on real-time network conditions and data sensitivity levels;
- The creation of standardized protocols for secure data synchronization between edge devices and cloud servers, with particular emphasis on maintaining data consistency in low-bandwidth scenarios;
- The design of fault-tolerant mechanisms that ensure continuous operation during network disruptions, specifically for critical healthcare monitoring applications.
12.8.2. Enhanced Federated Learning Frameworks
- Novel consensus algorithms that minimize communication overhead while maintaining model accuracy across distributed healthcare networks;
- The development of privacy-preserving aggregation methods that can guarantee differential privacy while preserving model performance for rare medical conditions;
- The creation of adaptive federated learning protocols that can automatically adjust to varying data distributions across different healthcare institutions.
12.8.3. Quantum-Enhanced Medical Computing
- The development of hybrid quantum–classical algorithms specifically optimized for medical image processing and pattern recognition;
- The creation of quantum-resistant security protocols for protecting sensitive medical data in preparation for quantum computing capabilities;
- The investigation of quantum machine learning algorithms for accelerating drug discovery and molecular modeling processes.
12.8.4. Concrete Technical Proposals
- Implements end-to-end encryption with quantum-resistant algorithms;
- Provides automated compliance checking for international healthcare regulations;
- Incorporates blockchain technology for maintaining immutable audit trails;
- Features adaptive access control mechanisms based on real-time risk assessment.
- Supports real-time processing of multimodal medical data (imaging, EHR, sensor data);
- Implements automated load balancing between edge and cloud resources;
- Features built-in privacy-preserving federated learning capabilities;
- Provides seamless integration with existing healthcare IT infrastructures.
- Prepares medical imaging workflows for quantum acceleration;
- Implements hybrid classical–quantum algorithms for image processing;
- Provides scalable integration paths for emerging quantum technologies;
- Features automated optimization for quantum resource allocation.
13. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Ridzuan, F.; Zainon, W.M.N.W. Diagnostic analysis for outlier detection in big data analytics. Procedia Comput. Sci. 2022, 197, 685–692. [Google Scholar] [CrossRef]
- Chakraborty, C.; Bhattacharya, M.; Pal, S.; Lee, S.S. From machine learning to deep learning: Advances of the recent data-driven paradigm shift in medicine and healthcare. Curr. Res. Biotechnol. 2023, 7, 100164. [Google Scholar] [CrossRef]
- Taipalus, T.; Isomöttönen, V.; Erkkilä, H.; Äyrämö, S. Data analytics in healthcare: A tertiary study. SN Comput. Sci. 2022, 4, 87. [Google Scholar] [CrossRef]
- Al-Sai, Z.A.; Husin, M.H.; Syed-Mohamad, S.M.; Abdin, R.M.S.; Damer, N.; Abualigah, L.; Gandomi, A.H. Explore big data analytics applications and opportunities: A review. Big Data Cogn. Comput. 2022, 6, 157. [Google Scholar] [CrossRef]
- Hulsen, T.; Friedeckỳ, D.; Renz, H.; Melis, E.; Vermeersch, P.; Fernandez-Calle, P. From big data to better patient outcomes. Clin. Chem. Lab. Med. (CCLM) 2023, 61, 580–586. [Google Scholar] [CrossRef] [PubMed]
- Nti, I.K.; Quarcoo, J.A.; Aning, J.; Fosu, G.K. A mini-review of machine learning in big data analytics: Applications, challenges, and prospects. Big Data Min. Anal. 2022, 5, 81–97. [Google Scholar] [CrossRef]
- Mohammed, S.; Kim, T.H.; Chang, R.S.; Ramos, C. Guest Editorial: Data Analytics for Public Health Care. IEEE J. Biomed. Health Inform. 2022, 26, 1409–1410. [Google Scholar] [CrossRef]
- Qiu, J.; Wu, Q.; Ding, G.; Xu, Y.; Feng, S. A survey of machine learning for big data processing. EURASIP J. Adv. Signal Process. 2016, 2016, 67. [Google Scholar] [CrossRef]
- Fregoso-Aparicio, L.; Noguez, J.; Montesinos, L.; García-García, J.A. Machine learning and deep learning predictive models for type 2 diabetes: A systematic review. Diabetol. Metab. Syndr. 2021, 13, 148. [Google Scholar] [CrossRef]
- Najafabadi, M.M.; Villanustre, F.; Khoshgoftaar, T.M.; Seliya, N.; Wald, R.; Muharemagic, E. Deep learning applications and challenges in big data analytics. J. Big Data 2015, 2, 1–21. [Google Scholar] [CrossRef]
- Badawy, M.; Ramadan, N.; Hefny, H.A. Healthcare predictive analytics using machine learning and deep learning techniques: A survey. J. Electr. Syst. Inf. Technol. 2023, 10, 40. [Google Scholar] [CrossRef]
- Selvy, P.T.; Dharani, V.; Indhuja, A. Brain tumour detection using deep learning techniques. Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol. 2019, 169, 175. [Google Scholar] [CrossRef]
- Nisar, Q.A.; Nasir, N.; Jamshed, S.; Naz, S.; Ali, M.; Ali, S. Big data management and environmental performance: Role of big data decision-making capabilities and decision-making quality. J. Enterp. Inf. Manag. 2021, 34, 1061–1096. [Google Scholar] [CrossRef]
- Jayasri, N.; Aruna, R. Big data analytics in health care by data mining and classification techniques. ICT Express 2022, 8, 250–257. [Google Scholar]
- Khan, P.; Kader, M.F.; Islam, S.R.; Rahman, A.B.; Kamal, M.S.; Toha, M.U.; Kwak, K.S. Machine learning and deep learning approaches for brain disease diagnosis: Principles and recent advances. IEEE Access 2021, 9, 37622–37655. [Google Scholar] [CrossRef]
- Alsunaidi, S.J.; Almuhaideb, A.M.; Ibrahim, N.M.; Shaikh, F.S.; Alqudaihi, K.S.; Alhaidari, F.A.; Khan, I.U.; Aslam, N.; Alshahrani, M.S. Applications of big data analytics to control COVID-19 pandemic. Sensors 2021, 21, 2282. [Google Scholar] [CrossRef]
- Dhiman, G.; Juneja, S.; Mohafez, H.; El-Bayoumy, I.; Sharma, L.K.; Hadizadeh, M.; Islam, M.A.; Viriyasitavat, W.; Khandaker, M.U. Federated learning approach to protect healthcare data over big data scenario. Sustainability 2022, 14, 2500. [Google Scholar] [CrossRef]
- Dritsas, E.; Trigka, M. Lung cancer risk prediction with machine learning models. Big Data Cogn. Comput. 2022, 6, 139. [Google Scholar] [CrossRef]
- Mahmud, M.I.; Mamun, M.; Abdelgawad, A. A deep analysis of brain tumor detection from mr images using deep learning networks. Algorithms 2023, 16, 176. [Google Scholar] [CrossRef]
- Goyal, P.; Malviya, R. Challenges and opportunities of big data analytics in healthcare. Health Care Sci. 2023, 2, 328–338. [Google Scholar] [CrossRef]
- Dixit, S.; Kumar, A.; Srinivasan, K. A current review of machine learning and deep learning models in oral cancer diagnosis: Recent technologies, open challenges, and future research directions. Diagnostics 2023, 13, 1353. [Google Scholar] [CrossRef] [PubMed]
- Arya, A.D.; Verma, S.S.; Chakarabarti, P.; Chakrabarti, T.; Elngar, A.A.; Kamali, A.M.; Nami, M. A systematic review on machine learning and deep learning techniques in the effective diagnosis of Alzheimer’s disease. Brain Inform. 2023, 10, 17. [Google Scholar] [CrossRef]
- Güler, M.; Namlı, E. Brain Tumor Detection with Deep Learning Methods’ Classifier Optimization Using Medical Images. Appl. Sci. 2024, 14, 642. [Google Scholar] [CrossRef]
- Kang, H.; Sibbald, S. Challenges to using big data in health services research. Univ. West. Ont. Med. J. 2018, 87, 18–20. [Google Scholar] [CrossRef]
- Kunekar, P.; Gupta, M.K.; Gaur, P. Detection of epileptic seizure in EEG signals using machine learning and deep learning techniques. J. Eng. Appl. Sci. 2024, 71, 21. [Google Scholar] [CrossRef]
- Khaloufi, H.; Abouelmehdi, K.; Beni-Hssane, A.; Saadi, M. Security model for big healthcare data lifecycle. Procedia Comput. Sci. 2018, 141, 294–301. [Google Scholar] [CrossRef]
- Dinov, I.D. Volume and value of big healthcare data. J. Med. Stat. Inform. 2016, 4, 3. [Google Scholar] [CrossRef]
- Renugadevi, N.; Saravanan, S.; Sudha, C.N. Revolution of smart healthcare materials in big data analytics. Mater. Today Proc. 2023, 81, 834–841. [Google Scholar] [CrossRef]
- Piedrahita-Valdés, H.; Piedrahita-Castillo, D.; Bermejo-Higuera, J.; Guillem-Saiz, P.; Bermejo-Higuera, J.R.; Guillem-Saiz, J.; Sicilia-Montalvo, J.A.; Machío-Regidor, F. Vaccine hesitancy on social media: Sentiment analysis from June 2011 to April 2019. Vaccines 2021, 9, 28. [Google Scholar] [CrossRef]
- Saranya, P.; Asha, P. Survey on big data analytics in health care. In Proceedings of the 2019 International Conference on Smart Systems and Inventive Technology (ICSSIT), Tirunelveli, India, 27–29 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 46–51. [Google Scholar]
- Alfred, R.; Obit, J.H. The roles of machine learning methods in limiting the spread of deadly diseases: A systematic review. Heliyon 2021, 7, e07371. [Google Scholar] [CrossRef]
- Gandomi, A.H.; Chen, F.; Abualigah, L. Big Data Analytics Using Artificial Intelligence. Electronics 2023, 12, 957. [Google Scholar] [CrossRef]
- Rehman, A.; Naz, S.; Razzak, I. Leveraging big data analytics in healthcare enhancement: Trends, challenges and opportunities. Multimed. Syst. 2022, 28, 1339–1371. [Google Scholar] [CrossRef]
- Naghib, A.; Jafari Navimipour, N.; Hosseinzadeh, M.; Sharifi, A. A comprehensive and systematic literature review on the big data management techniques in the internet of things. Wirel. Netw. 2023, 29, 1085–1144. [Google Scholar] [CrossRef]
- Hong, L.; Luo, M.; Wang, R.; Lu, P.; Lu, W.; Lu, L. Big data in health care: Applications and challenges. Data Inf. Manag. 2018, 2, 175–197. [Google Scholar] [CrossRef]
- Dash, S.; Shakyawar, S.K.; Sharma, M.; Kaushik, S. Big data in healthcare: Management, analysis and future prospects. J. Big Data 2019, 6, 54. [Google Scholar] [CrossRef]
- Abudiyab, N.A.; Alanazi, A.T. Visualization techniques in healthcare applications: A narrative review. Cureus 2022, 14, e31355. [Google Scholar] [CrossRef]
- Supriya, M.; Deepa, A. Machine learning approach on healthcare big data: A review. Big Data Inf. Anal. 2020, 5, 58–75. [Google Scholar] [CrossRef]
- El Khatib, M.; Hamidi, S.; Al Ameeri, I.; Al Zaabi, H.; Al Marqab, R. Digital disruption and big data in healthcare-opportunities and challenges. Clin. Outcomes Res. 2022, 14, 563–574. [Google Scholar] [CrossRef]
- Macias, C.G.; Remy, K.E.; Barda, A.J. Utilizing big data from electronic health records in pediatric clinical care. Pediatr. Res. 2023, 93, 382–389. [Google Scholar] [CrossRef]
- Agrawal, R.; Prabakaran, S. Big data in digital healthcare: Lessons learnt and recommendations for general practice. Heredity 2020, 124, 525–534. [Google Scholar] [CrossRef]
- Padmapriya, S.; Parthasarathy, S. Ethical data collection for medical image analysis: A structured approach. Asian Bioeth. Rev. 2024, 16, 95–108. [Google Scholar] [CrossRef] [PubMed]
- Ogundipe, D. The impact of big data on healthcare product development: A theoretical and analytical review. Int. Med. Sci. Res. J. 2024, 4, 341–360. [Google Scholar] [CrossRef]
- Awan, M.J.; Bilal, M.H.; Yasin, A.; Nobanee, H.; Khan, N.S.; Zain, A.M. Detection of COVID-19 in chest X-ray images: A big data enabled deep learning approach. Int. J. Environ. Res. Public Health 2021, 18, 10147. [Google Scholar] [CrossRef] [PubMed]
- Mansour, R.F.; Escorcia-Gutierrez, J.; Gamarra, M.; Díaz, V.G.; Gupta, D.; Kumar, S. Artificial intelligence with big data analytics-based brain intracranial hemorrhage e-diagnosis using CT images. Neural Comput. Appl. 2023, 35, 16037–16049. [Google Scholar] [CrossRef]
- Balakrishnan, R.; Hernández, M.d.C.V.; Farrall, A.J. Automatic segmentation of white matter hyperintensities from brain magnetic resonance images in the era of deep learning and big data—A systematic review. Comput. Med. Imaging Graph. 2021, 88, 101867. [Google Scholar] [CrossRef]
- Shen, Y.T.; Yue, W.W.; Xu, H.X. Ultrasound in oncology: Application of big data and artificial intelligence. Front. Oncol. 2021, 11, 819487. [Google Scholar] [CrossRef]
- Go, H. Digital pathology and artificial intelligence applications in pathology. Brain Tumor Res. Treat. 2022, 10, 76–82. [Google Scholar] [CrossRef]
- Bansal, P.; Harjai, N.; Saif, M.; Mugloo, S.H.; Kaur, P. Utilization of big data classification models in digitally enhanced optical coherence tomography for medical diagnostics. Neural Comput. Appl. 2024, 36, 225–239. [Google Scholar] [CrossRef]
- Haleem, M.S.; Ekuban, A.; Antonini, A.; Pagliara, S.; Pecchia, L.; Allocca, C. Deep-Learning-Driven Techniques for Real-Time Multimodal Health and Physical Data Synthesis. Electronics 2023, 12, 1989. [Google Scholar] [CrossRef]
- Buddha, G.P.; Pulimamidi, R. The Future of Healthcare: Artificial Intelligence’s Role In Smart Hospitals And Wearable Health Devices. Tuijin Jishu/J. Propuls. Technol. 2023, 44, 2498–2504. [Google Scholar]
- Adenyi, A.O.; Okolo, C.A.; Olorunsogo, T.; Babawarun, O. Leveraging big data and analytics for enhanced public health decision-making: A global review. GSC Adv. Res. Rev. 2024, 18, 450–456. [Google Scholar] [CrossRef]
- Cusack, N.; Venkatraman, P.; Raza, U.; Faisal, A. Smart Wearable Sensors for Health and Lifestyle Monitoring: Commercial and Emerging Solutions. ECS Sens. Plus 2024, 3, 017001. [Google Scholar] [CrossRef]
- Rashid, F.K.M.; Osman, O.S.; Mcgee, E.T.; Raad, H. Discovering Hazards in IoT Architectures: A Safety Analysis Approach for Medical Use Cases. IEEE Access 2023, 11, 53671–53686. [Google Scholar] [CrossRef]
- Webber, M.; Rojas, R.F. Human activity recognition with accelerometer and gyroscope: A data fusion approach. IEEE Sens. J. 2021, 21, 16979–16989. [Google Scholar] [CrossRef]
- Arquilla, K.; Webb, A.K.; Anderson, A.P. Textile electrocardiogram (ECG) electrodes for wearable health monitoring. Sensors 2020, 20, 1013. [Google Scholar] [CrossRef]
- Antonopoulos, C.P.; Voros, N.S. Resource efficient data compression algorithms for demanding, WSN based biomedical applications. J. Biomed. Inform. 2016, 59, 1–14. [Google Scholar] [CrossRef]
- Cappon, G.; Acciaroli, G.; Vettoretti, M.; Facchinetti, A.; Sparacino, G. Wearable continuous glucose monitoring sensors: A revolution in diabetes treatment. Electronics 2017, 6, 65. [Google Scholar] [CrossRef]
- Roriz, P.; Silva, S.; Frazão, O.; Novais, S. Optical fiber temperature sensors and their biomedical applications. Sensors 2020, 20, 2113. [Google Scholar] [CrossRef]
- Alharthi, H. Healthcare predictive analytics: An overview with a focus on Saudi Arabia. J. Infect. Public Health 2018, 11, 749–756. [Google Scholar] [CrossRef]
- Islam, M.S.; Hasan, M.M.; Wang, X.; Germack, H.D.; Noor-E-Alam, M. A systematic review on healthcare analytics: Application and theoretical perspective of data mining. Proc. Healthc. 2018, 6, 54. [Google Scholar] [CrossRef]
- Mosavi, N.S.; Santos, M.F. How prescriptive analytics influences decision making in precision medicine. Procedia Comput. Sci. 2020, 177, 528–533. [Google Scholar] [CrossRef]
- Yu, Y.; Li, M.; Liu, L.; Li, Y.; Wang, J. Clinical big data and deep learning: Applications, challenges, and future outlooks. Big Data Min. Anal. 2019, 2, 288–305. [Google Scholar] [CrossRef]
- Galetsi, P.; Katsaliaki, K.; Kumar, S. Big data analytics in health sector: Theoretical framework, techniques and prospects. Int. J. Inf. Manag. 2020, 50, 206–216. [Google Scholar] [CrossRef]
- Navaz, A.N.; Serhani, M.A.; El Kassabi, H.T.; Al-Qirim, N.; Ismail, H. Trends, technologies, and key challenges in smart and connected healthcare. IEEE Access 2021, 9, 74044–74067. [Google Scholar] [CrossRef] [PubMed]
- Anom, B. Ethics of Big Data and artificial intelligence in medicine. Ethics Med. Public Health 2020, 15, 100568. [Google Scholar] [CrossRef]
- Amin, S.U.; Hossain, M.S. Edge intelligence and Internet of Things in healthcare: A survey. IEEE Access 2020, 9, 45–59. [Google Scholar] [CrossRef]
- Terrazas, G.; Martínez-Arellano, G.; Benardos, P.; Ratchev, S. Online tool wear classification during dry machining using real time cutting force measurements and a CNN approach. J. Manuf. Mater. Process. 2018, 2, 72. [Google Scholar] [CrossRef]
- Salmon, J.W.; Thompson, S.L.; Salmon, J.W.; Thompson, S.L. Big data: Information technology as control over the profession of medicine. In The Corporatization of American Health Care: The Rise of Corporate Hegemony and the Loss of Professional Autonomy; Springer: Cham, Switzerland, 2021; pp. 181–254. [Google Scholar]
- Qammar, A.; Karim, A.; Ning, H.; Ding, J. Securing federated learning with blockchain: A systematic literature review. Artif. Intell. Rev. 2023, 56, 3951–3985. [Google Scholar] [CrossRef]
- Khanra, S.; Dhir, A.; Islam, A.N.; Mäntymäki, M. Big data analytics in healthcare: A systematic literature review. Enterp. Inf. Syst. 2020, 14, 878–912. [Google Scholar] [CrossRef]
- Furstenau, L.B.; Leivas, P.; Sott, M.K.; Dohan, M.S.; López-Robles, J.R.; Cobo, M.J.; Bragazzi, N.L.; Choo, K.K.R. Big data in healthcare: Conceptual network structure, key challenges and opportunities. Digit. Commun. Netw. 2023, 9, 856–868. [Google Scholar] [CrossRef]
- Nasfi, R.; Bronselaer, A.; De Tré, G. A novel approach to assess and improve syntactic interoperability in data integration. Inf. Process. Manag. 2023, 60, 103522. [Google Scholar] [CrossRef]
- Bhartiya, S.; Mehrotra, D. Challenges and recommendations to healthcare data exchange in an interoperable environment. Electron. J. Health Inform. 2014, 8, 16. [Google Scholar]
- Agarwal, D.P.; Kushwaha, D.V.; Singh, D.V.K.; Azmi, D.T.; Shukla, D.V.; Khan, D.N.F.; Shoraisham, D.B. Revolutionizing Healthcare Through Advanced Analytics: Big Data. Int. J. Pharm. Sci. 2023, 14, 62–74. [Google Scholar] [CrossRef]
- Abouelmehdi, K.; Beni-Hssane, A.; Khaloufi, H.; Saadi, M. Big data security and privacy in healthcare: A Review. Procedia Comput. Sci. 2017, 113, 73–80. [Google Scholar] [CrossRef]
- Thantilage, R.D.; Le-Khac, N.A.; Kechadi, M.T. Healthcare data security and privacy in Data Warehouse architectures. Inform. Med. Unlocked 2023, 39, 101270. [Google Scholar] [CrossRef]
- Abouelmehdi, K.; Beni-Hessane, A.; Khaloufi, H. Big healthcare data: Preserving security and privacy. J. Big Data 2018, 5, 1. [Google Scholar] [CrossRef]
- Agrawal, D.; Madaan, J. A structural equation model for big data adoption in the healthcare supply chain. Int. J. Product. Perform. Manag. 2023, 72, 917–942. [Google Scholar] [CrossRef]
- Batini, C.; Rula, A.; Scannapieco, M.; Viscusi, G. From data quality to big data quality. J. Database Manag. (JDM) 2015, 26, 60–82. [Google Scholar] [CrossRef]
- Coombs, C.; Hislop, D.; Taneva, S.K.; Barnard, S. The strategic impacts of Intelligent Automation for knowledge and service work: An interdisciplinary review. J. Strateg. Inf. Syst. 2020, 29, 101600. [Google Scholar] [CrossRef]
- Wook, M.; Hasbullah, N.A.; Zainudin, N.M.; Jabar, Z.Z.A.; Ramli, S.; Razali, N.A.M.; Yusop, N.M.M. Exploring big data traits and data quality dimensions for big data analytics application using partial least squares structural equation modelling. J. Big Data 2021, 8, 49. [Google Scholar] [CrossRef]
- Brossard, P.Y.; Minvielle, E.; Sicotte, C. The path from big data analytics capabilities to value in hospitals: A scoping review. BMC Health Serv. Res. 2022, 22, 134. [Google Scholar] [CrossRef] [PubMed]
- Ahmed, A.; Xi, R.; Hou, M.; Shah, S.A.; Hameed, S. Harnessing big data analytics for healthcare: A comprehensive review of frameworks, implications, applications, and impacts. IEEE Access 2023, 11, 112891–112928. [Google Scholar] [CrossRef]
- Baghdadi, N.A.; Farghaly Abdelaliem, S.M.; Malki, A.; Gad, I.; Ewis, A.; Atlam, E. Advanced machine learning techniques for cardiovascular disease early detection and diagnosis. J. Big Data 2023, 10, 144. [Google Scholar] [CrossRef]
- Jawalkar, A.P.; Swetcha, P.; Manasvi, N.; Sreekala, P.; Aishwarya, S.; Kanaka Durga Bhavani, P.; Anjani, P. Early prediction of heart disease with data analysis using supervised learning with stochastic gradient boosting. J. Eng. Appl. Sci. 2023, 70, 122. [Google Scholar] [CrossRef]
- Cirillo, D.; Valencia, A. Big data analytics for personalized medicine. Curr. Opin. Biotechnol. 2019, 58, 161–167. [Google Scholar] [CrossRef]
- Cozzoli, N.; Salvatore, F.P.; Faccilongo, N.; Milone, M. How can big data analytics be used for healthcare organization management? Literary framework and future research from a systematic review. BMC Health Serv. Res. 2022, 22, 809. [Google Scholar] [CrossRef] [PubMed]
- Sabharwal, R.; Miah, S.J. A new theoretical understanding of big data analytics capabilities in organizations: A thematic analysis. J. Big Data 2021, 8, 159. [Google Scholar] [CrossRef]
- Thanka, M.R.; Edwin, E.B.; Ebenezer, V.; Sagayam, K.M.; Reddy, B.J.; Günerhan, H.; Emadifar, H. A hybrid approach for melanoma classification using ensemble machine learning techniques with deep transfer learning. Comput. Methods Programs Biomed. Update 2023, 3, 100103. [Google Scholar]
- Li, J.; Huang, Y.; Hutton, G.J.; Aparasu, R.R. Assessing treatment switch among patients with multiple sclerosis: A machine learning approach. Explor. Res. Clin. Soc. Pharm. 2023, 11, 100307. [Google Scholar] [CrossRef]
- Ksibi, A.; Zakariah, M.; Menzli, L.J.; Saidani, O.; Almuqren, L.; Hanafieh, R.A.M. Electroencephalography-based depression detection using multiple machine learning techniques. Diagnostics 2023, 13, 1779. [Google Scholar] [CrossRef]
- Rahman, S.; Hasan, M.; Sarkar, A.K. Prediction of brain stroke using machine learning algorithms and deep neural network techniques. Eur. J. Electr. Eng. Comput. Sci. 2023, 7, 23–30. [Google Scholar] [CrossRef]
- Swain, D.; Mehta, U.; Bhatt, A.; Patel, H.; Patel, K.; Mehta, D.; Acharya, B.; Gerogiannis, V.C.; Kanavos, A.; Manika, S. A robust chronic kidney disease classifier using machine learning. Electronics 2023, 12, 212. [Google Scholar] [CrossRef]
- Chen, Y.M.; Chen, P.C.; Lin, W.C.; Hung, K.C.; Chen, Y.C.B.; Hung, C.F.; Wang, L.J.; Wu, C.N.; Hsu, C.W.; Kao, H.Y. Predicting new-onset post-stroke depression from real-world data using machine learning algorithm. Front. Psychiatry 2023, 14, 1195586. [Google Scholar] [CrossRef]
- Ahmed, U.; Issa, G.F.; Khan, M.A.; Aftab, S.; Khan, M.F.; Said, R.A.; Ghazal, T.M.; Ahmad, M. Prediction of diabetes empowered with fused machine learning. IEEE Access 2022, 10, 8529–8538. [Google Scholar] [CrossRef]
- Andorra, M.; Freire, A.; Zubizarreta, I.; de Rosbo, N.K.; Bos, S.D.; Rinas, M.; Høgestøl, E.A.; de Rodez Benavent, S.A.; Berge, T.; Brune-Ingebretse, S.; et al. Predicting disease severity in multiple sclerosis using multimodal data and machine learning. J. Neurol. 2024, 271, 1133–1149. [Google Scholar] [CrossRef]
- Solomon, D.H.; Guan, H.; Johansson, F.D.; Santacroce, L.; Malley, W.; Guo, L.; Litman, H. Assessing clusters of comorbidities in rheumatoid arthritis: A machine learning approach. Arthritis Res. Ther. 2023, 25, 224. [Google Scholar] [CrossRef]
- Mistry, J.; Ramakrishnan, R. The Automated Eye Cancer Detection through Machine Learning and Image Analysis in Healthcare. J. Xidian Univ. 2023, 17, 763. [Google Scholar]
- Botlagunta, M.; Botlagunta, M.D.; Myneni, M.B.; Lakshmi, D.; Nayyar, A.; Gullapalli, J.S.; Shah, M.A. Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms. Sci. Rep. 2023, 13, 485. [Google Scholar] [CrossRef]
- Alamro, H.; Thafar, M.A.; Albaradei, S.; Gojobori, T.; Essack, M.; Gao, X. Exploiting machine learning models to identify novel Alzheimer’s disease biomarkers and potential targets. Sci. Rep. 2023, 13, 4979. [Google Scholar] [CrossRef]
- Lee, C.; Joo, G.; Shin, S.; Im, H.; Moon, K.W. Prediction of osteoporosis in patients with rheumatoid arthritis using machine learning. Sci. Rep. 2023, 13, 21800. [Google Scholar] [CrossRef]
- Helaly, H.A.; Badawy, M.; Haikal, A.Y. Deep learning approach for early detection of Alzheimer’s disease. Cogn. Comput. 2022, 14, 1711–1727. [Google Scholar] [CrossRef]
- Abdusalomov, A.B.; Mukhiddinov, M.; Whangbo, T.K. Brain tumor detection based on deep learning approaches and magnetic resonance imaging. Cancers 2023, 15, 4172. [Google Scholar] [CrossRef] [PubMed]
- Forte, G.C.; Altmayer, S.; Silva, R.F.; Stefani, M.T.; Libermann, L.L.; Cavion, C.C.; Youssef, A.; Forghani, R.; King, J.; Mohamed, T.L.; et al. Deep learning algorithms for diagnosis of lung cancer: A systematic review and meta-analysis. Cancers 2022, 14, 3856. [Google Scholar] [CrossRef] [PubMed]
- Zang, P.; Hormel, T.T.; Hwang, T.S.; Bailey, S.T.; Huang, D.; Jia, Y. Deep-learning–aided diagnosis of diabetic retinopathy, age-related macular degeneration, and glaucoma based on structural and angiographic OCT. Ophthalmol. Sci. 2023, 3, 100245. [Google Scholar] [CrossRef] [PubMed]
- Shoukat, A.; Akbar, S.; Hassan, S.A.; Iqbal, S.; Mehmood, A.; Ilyas, Q.M. Automatic diagnosis of glaucoma from retinal images using deep learning approach. Diagnostics 2023, 13, 1738. [Google Scholar] [CrossRef]
- Humayun, M.; Khalil, M.I.; Almuayqil, S.N.; Jhanjhi, N.Z. Framework for detecting breast cancer risk presence using deep learning. Electronics 2023, 12, 403. [Google Scholar] [CrossRef]
- Choudhary, A.; Ahlawat, S.; Urooj, S.; Pathak, N.; Lay-Ekuakille, A.; Sharma, N. A deep learning-based framework for retinal disease classification. Proc. Healthc. 2023, 11, 212. [Google Scholar] [CrossRef]
- García-Ordás, M.T.; Bayón-Gutiérrez, M.; Benavides, C.; Aveleira-Mata, J.; Benítez-Andrades, J.A. Heart disease risk prediction using deep learning techniques with feature augmentation. Multimed. Tools Appl. 2023, 82, 31759–31773. [Google Scholar] [CrossRef]
- Houssein, E.H.; Mohamed, R.E.; Ali, A.A. Heart disease risk factors detection from electronic health records using advanced NLP and deep learning techniques. Sci. Rep. 2023, 13, 7173. [Google Scholar] [CrossRef]
- Sorour, S.E.; Abd El-Mageed, A.A.; Albarrak, K.M.; Alnaim, A.K.; Wafa, A.A.; El-Shafeiy, E. Classification of Alzheimer’s disease using MRI data based on Deep Learning Techniques. J. King Saud Univ.-Comput. Inf. Sci. 2024, 36, 101940. [Google Scholar] [CrossRef]
- Chen, J.; Huang, S.; Zhang, Y.; Chang, Q.; Zhang, Y.; Li, D.; Qiu, J.; Hu, L.; Peng, X.; Du, Y.; et al. Congenital heart disease detection by pediatric electrocardiogram based deep learning integrated with human concepts. Nat. Commun. 2024, 15, 976. [Google Scholar] [CrossRef] [PubMed]
- Darwish, A.; Hassanien, A.E.; Elhoseny, M.; Sangaiah, A.K.; Muhammad, K. The impact of the hybrid platform of internet of things and cloud computing on healthcare systems: Opportunities, challenges, and open problems. J. Ambient Intell. Humaniz. Comput. 2019, 10, 4151–4166. [Google Scholar] [CrossRef]
- Sunyaev, A.; Sunyaev, A. Cloud computing. In Internet Computing: Principles of Distributed Systems and Emerging Internet-Based; Springer: Berlin/Heidelberg, Germany, 2020; pp. 195–236. [Google Scholar]
- Alam, T. Cloud-based IoT applications and their roles in smart cities. Smart Cities 2021, 4, 1196–1219. [Google Scholar] [CrossRef]
- Gill, S.S.; Xu, M.; Ottaviani, C.; Patros, P.; Bahsoon, R.; Shaghaghi, A.; Golec, M.; Stankovski, V.; Wu, H.; Abraham, A.; et al. AI for next generation computing: Emerging trends and future directions. Internet Things 2022, 19, 100514. [Google Scholar] [CrossRef]
- Theodorakopoulos, L.; Theodoropoulou, A.; Stamatiou, Y. A State-of-the-Art Review in Big Data Management Engineering: Real-Life Case Studies, Challenges, and Future Research Directions. Eng 2024, 5, 1266–1297. [Google Scholar] [CrossRef]
- Retico, A.; Avanzo, M.; Boccali, T.; Bonacorsi, D.; Botta, F.; Cuttone, G.; Martelli, B.; Salomoni, D.; Spiga, D.; Trianni, A.; et al. Enhancing the impact of Artificial Intelligence in Medicine: A joint AIFM-INFN Italian initiative for a dedicated cloud-based computing infrastructure. Phys. Medica 2021, 91, 140–150. [Google Scholar] [CrossRef]
- Banimfreg, B.H. A comprehensive review and conceptual framework for cloud computing adoption in bioinformatics. Healthc. Anal. 2023, 3, 100190. [Google Scholar] [CrossRef]
- Lin, Z.; Zou, J.; Liu, S.; Peng, C.; Li, Z.; Wan, X.; Fang, D.; Yin, J.; Gobbo, G.; Chen, Y.; et al. A cloud computing platform for scalable relative and absolute binding free energy predictions: New opportunities and challenges for drug discovery. J. Chem. Inf. Model. 2021, 61, 2720–2732. [Google Scholar] [CrossRef]
- Ahmed, Z.; Mohamed, K.; Zeeshan, S.; Dong, X. Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database 2020, 2020, baaa010. [Google Scholar] [CrossRef]
- Wang, Q.; Su, M.; Zhang, M.; Li, R. Integrating digital technologies and public health to fight Covid-19 pandemic: Key technologies, applications, challenges and outlook of digital healthcare. Int. J. Environ. Res. Public Health 2021, 18, 6053. [Google Scholar] [CrossRef]
- Sourav, A. Data security and privacy concern in the healthcare system. In Internet of Healthcare Things: Machine Learning for Security and Privacy; Wiley: New York, NY, USA, 2022; pp. 1–25. [Google Scholar]
- Yathiraju, N. Investigating the use of an artificial intelligence model in an ERP cloud-based system. Int. J. Electr. Electron. Comput. 2022, 7, 1–26. [Google Scholar] [CrossRef]
- Sittón-Candanedo, I.; Alonso, R.S.; Corchado, J.M.; Rodríguez-González, S.; Casado-Vara, R. A review of edge computing reference architectures and a new global edge proposal. Future Gener. Comput. Syst. 2019, 99, 278–294. [Google Scholar] [CrossRef]
- Tanuwidjaja, H.C.; Choi, R.; Baek, S.; Kim, K. Privacy-preserving deep learning on machine learning as a service-a comprehensive survey. IEEE Access 2020, 8, 167425–167447. [Google Scholar] [CrossRef]
- Nasr, M.; Islam, M.M.; Shehata, S.; Karray, F.; Quintana, Y. Smart healthcare in the age of AI: Recent advances, challenges, and future prospects. IEEE Access 2021, 9, 145248–145270. [Google Scholar] [CrossRef]
- Majdalawieh, M.; Hani, A.B.; Al-Sabbah, H.; Adedugbe, O.; Benkhelifa, E. A Cloud-Native Knowledge Management Framework for Patient-Generated Health Data. In Proceedings of the 2023 Tenth International Conference on Social Networks Analysis, Management and Security (SNAMS), Abu Dhabi, United Arab Emirates, 21–24 November 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–7. [Google Scholar]
- Sutradhar, K.; Venkatesh, R.; Venkatesh, P. Quantum Internet of Things for Smart Healthcare. In Learning Techniques for the Internet of Things; Springer: Berlin/Heidelberg, Germany, 2023; pp. 261–285. [Google Scholar]
Ref. | Year | Application Domain | Cloud Computing Infrastructure | Deep Learning Methodology | Key Findings | Limitations |
---|---|---|---|---|---|---|
[12] | 2019 | Glioblastoma Detection in MRI | AWS Medical Imaging Cloud | Enhanced VGG16 with transfer learning |
|
|
[13] | 2021 | Hospital Resource Optimization | Google Cloud Healthcare API | Hierarchical attention networks |
|
|
[14] | 2021 | Diabetes Management with Medical Imaging | Azure Healthcare Cloud | Hybrid CNN-LSTM |
|
|
[15] | 2021 | Neurological Disorders | Distributed cloud computing framework | Multi-modal CNN architecture |
|
|
[16] | 2021 | COVID-19 Chest X-ray Analysis | Federated cloud framework | ResNet50 with attention mechanism |
|
|
[17] | 2022 | Multi-institutional Medical Imaging | Hybrid cloud architecture | Federated deep learning |
|
|
[18] | 2022 | Lung Cancer Detection | Edge-cloud infrastructure | Ensemble CNN with Rotation Forest |
|
|
[19] | 2022 | Brain Tumor Detection | Scalable cloud platform | Modified CNN architecture |
|
|
[20] | 2023 | Healthcare Analytics Integration | Multi-cloud framework | Deep transfer learning |
|
|
[21] | 2023 | Oral Cancer Diagnosis | Cloud-based medical imaging | Ensemble deep learning |
|
|
[22] | 2023 | Alzheimer’s Diagnosis | Distributed cloud computing | Comparative DL analysis |
|
|
[23] | 2024 | Brain Tumor Classification | Edge-cloud hybrid system | Hybrid deep learning |
|
|
[24] | 2024 | Cancer Research Data Integration | Cloud-based data lake | Interoperable DL framework |
|
|
[25] | 2024 | Epileptic Seizure Detection | Real-time cloud processing | Advanced LSTM model |
|
|
Algorithm Type | Medical Application | Accuracy (%) | Sample Size | Key Performance Metrics |
---|---|---|---|---|
Probabilistic Neural Network [12] | Glioblastoma Detection | 94.0 | 150 MRI images | Improved detection rates over traditional methods |
Hierarchical Attention Network [14] | Diabetes Management | 91.3 | 50,000 records | p < 0.001 significance level |
CNN Architectures [15] | Neurological Disorders | 89.7 | Meta-analysis (45 studies) | Sensitivity: 92.3%; Specificity: 90.1% |
Federated Learning [16] | Multi-institutional Analysis | 89.2 | 8 institutions | Data protection: 96.5% |
Rotation Forest [18] | Lung Cancer Prediction | 93.7 | 2500 records | Comprehensive risk stratification |
CNN Architecture [19] | Brain Tumor Detection | 95.8 | 3000 MRI scans | 15% improvement in speed and accuracy |
Ensemble DL Models [21] | Oral Cancer Diagnosis | 94.2 | Multiple datasets | Cross-demographic validation |
DL Methods [22] | Alzheimer’s Diagnosis | 91.4 | 85 studies | p < 0.01 significance level |
Hybrid DL Approach [23] | Brain Tumor Classification | 96.3 | 5000 MRI scans | Reduced false positives |
LSTM Model [25] | Epileptic Seizure Detection | 97.2 | 10,000 EEG records | 23% improvement in early detection |
Study Reference | Disease Focus | Data Source | Data Size | Deep Learning Algorithm | Results |
---|---|---|---|---|---|
Thanka et al., 2023 [90] | Melanoma | ISIC validated dataset | 1416 standardized images | Ensemble: VGG16 with XGBoost | Primary: 99.1% accuracy with validated cross-fold testing |
Li et al., 2023 [91] | Multiple Sclerosis | Multicenter EMR data | 7258 clinically validated cases | Optimized Random Forest | AUC: 0.65 with external validation; Sensitivity: 72% |
Ksibi et al., 2023 [92] | Depression | MODMA standardized EEG | 2104 validated recordings | Hybrid CNN–LSTM | 97% accuracy with 5-fold cross-validation |
Carter et al., 2023 [93] | Tuberculosis | Multi-institutional genetic database | 24,231 verified isolates | Enhanced Gradient-Boosted Trees | Sensitivity: 97.2%; Specificity: 63.1% on test set |
Swain et al., 2023 [94] | Chronic Kidney Disease | UCI CKD benchmark | 400 validated records | Optimized SVM with feature selection | 99.33% accuracy with independent test cohort |
Chen et al., 2023 [95] | Post-Stroke Depression | National EMR database | 76,826 longitudinal cases | XGBoost with temporal features | AUC-ROC: 71% with prospective validation |
Ahmed et al., 2023 [96] | Diabetes | UCI verified dataset | 520 complete cases | Ensemble: SVM-ANN fusion | 94.87% accuracy with external validation |
Andorra et al., 2023 [97] | Multiple Sclerosis | Multi-modal clinical data | 420 longitudinal cases | Random Forest with feature importance | Validated prediction of disease progression (AUC: 0.82) |
Solomon et al., 2023 [98] | Rheumatoid Arthritis | CorEvitas registry | 11,883 verified patients | Advanced clustering ensemble | Identified 5 distinct patient subgroups (p < 0.001) |
Kopitar et al., 2020 [99] | Type 2 Diabetes | Standardized EHR | 3723 prospective cases | Multi-algorithm ensemble | AUC: 0.859 with temporal validation |
Botlagunta et al., 2023 [100] | Breast Cancer | Validated EMR profiles | 26,800 cases | Enhanced Decision Tree | AUC: 0.87 with independent validation cohort |
Alamro et al., 2023 [101] | Alzheimer’s | GEO expression data | 445 validated samples | Hybrid CNN–DNN | AUC: 0.979 with novel biomarker validation |
Lee et al., 2023 [102] | Rheumatoid Arthritis | KORONA database | 2374 verified cases | Optimized XGBoost ensemble | AUC: 0.750; F1: 0.705 with external validation |
Algorithm Type | Disease Domain | Dataset Size | Primary Metric | Validation Method |
---|---|---|---|---|
VGG16–XGBoost Ensemble | Melanoma | 1416 images | 99.1% accuracy | Cross-fold validation |
Optimized Random Forest | Multiple Sclerosis | 7258 cases | AUC: 0.65 | External validation |
Hybrid CNN–LSTM | Depression | 2104 records | 97% accuracy | 5-fold cross-validation |
Gradient-Boosted Trees | Tuberculosis | 24,231 samples | Sensitivity: 97.2% | Independent test set |
SVM with Feature Selection | Chronic Kidney Disease | 400 records | 99.33% accuracy | Independent cohort |
XGBoost (Temporal) | Post-stroke Depression | 76,826 cases | AUC-ROC: 71% | Prospective validation |
SVM–ANN Fusion | Diabetes | 520 cases | 94.87% accuracy | External validation |
Feature-based Random Forest | Multiple Sclerosis | 420 cases | AUC: 0.82 | Disease progression validation |
Clustering Ensemble | Rheumatoid Arthritis | 11,883 patients | p < 0.001 | Statistical significance |
Multi-algorithm Ensemble | Type 2 Diabetes | 3723 cases | AUC: 0.859 | Temporal validation |
Enhanced Decision Tree | Breast Cancer | 26,800 cases | AUC: 0.87 | Independent cohort |
Hybrid CNN–DNN | Alzheimer’s | 445 samples | AUC: 0.979 | Biomarker validation |
XGBoost Ensemble | Rheumatoid Arthritis | 2374 cases | AUC: 0.750 | External validation |
Ref. | Disease Focus | Data Source | Data Size | DL Algorithm | Results |
---|---|---|---|---|---|
Helaly et al., 2022 [103] | Alzheimer’s Disease | ADNI standardized dataset | 48,000 validated images | Hierarchical CNN with VGG19 transfer learning | 97% accuracy with 5-fold cross-validation; sensitivity: 96.8%; specificity: 97.2% |
Abdusalomov et al., 2023 [104] | Brain Tumors | Multi-center MRI database | 10,288 annotated images | Enhanced YOLOv7 with attention mechanisms | 99.5% accuracy with external validation; precision: 99.3%; recall: 99.7% |
Forte et al., 2022 [105] | Lung Cancer | Multi-institutional CT data | Meta-analysis of 6 validated studies | Ensemble CNN architectures | Pooled sensitivity: 93%; specificity: 68%; AUC: 0.90 with heterogeneity analysis |
Zang et al., 2023 [106] | Multi-Retinal Diseases | Standardized OCT/OCTA | 526 validated volumes | Multi-Task Deep Learning | Disease-specific AUCs—DR: 0.95; AMD: 0.98; Glaucoma: 0.91 with independent testing |
Shoukat et al., 2023 [107] | Glaucoma | Curated retinal database | 2500 validated images | Modified ResNet-50 with attention | Accuracy: 98.48%; sensitivity: 99.30%; specificity: 96.52%, validated across populations |
Humayun et al., 2023 [108] | Breast Cancer | Histopathology database | 157,572 validated patches | Enhanced InceptionResNetV2 | 91% accuracy with external validation, ROC analysis included |
Choudhary et al., 2023 [109] | Retinal Diseases | Multi-center OCT data | 84,568 standardized images | Optimized VGG-19 with domain adaptation | Accuracy: 99.17%; sensitivity: 99.0%; specificity: 99.5%; cross-center validation |
Garcia-Ordas et al., 2023 [110] | Heart Disease | Integrated clinical data | 918 verified cases | Hybrid SAE–CNN architecture | 90.088% accuracy with independent cohort validation |
Houssein et al., 2023 [111] | Heart Disease | Structured EHR data | 10,000 validated records | Ensemble BERT models with clinical NLP | F1: 93.66%, validated across multiple institutions |
Sorour et al., 2024 [112] | Alzheimer’s Disease | Standardized MRI database | 6400 validated scans | Integrated CNN–LSTM with data augmentation | 99.92% accuracy; comprehensive ablation studies included |
Chen et al., 2024 [113] | Congenital Heart Disease | Multi-center ECG data | 65,869 validated cases | Novel CHDdECG architecture | ROC-AUC: 0.915 with external validation cohort |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shakor, M.Y.; Khaleel, M.I. Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing. Electronics 2024, 13, 4860. https://doi.org/10.3390/electronics13244860
Shakor MY, Khaleel MI. Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing. Electronics. 2024; 13(24):4860. https://doi.org/10.3390/electronics13244860
Chicago/Turabian StyleShakor, Mohammed Y., and Mustafa Ibrahim Khaleel. 2024. "Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing" Electronics 13, no. 24: 4860. https://doi.org/10.3390/electronics13244860
APA StyleShakor, M. Y., & Khaleel, M. I. (2024). Recent Advances in Big Medical Image Data Analysis Through Deep Learning and Cloud Computing. Electronics, 13(24), 4860. https://doi.org/10.3390/electronics13244860