1 Introduction
The
Internet of Things (
IoT), comprising smartphones, laptops, vehicles, and smartwatches, is ubiquitous and equipped with sensing and computing capabilities that enable accurate and effective data analysis and decision-making based on massive data and advanced models [
2].
Artificial Intelligence (
AI) disciplines, especially the field of
Machine Learning (
ML), have been rapidly growing and widely applied to enhance the performance of these devices and drive the evolution of related industries [
38,
87]. However, big-data-based applications bring significant risks and challenges, particularly in traditional centralized storage and computing approaches. The data collected by mobile devices and containing sensitive information is growing at an unprecedented rate, leading to a development bottleneck in cloud-based data processing.
Various approaches have been proposed to meet the requirements of new-generation data storage, data processing, and privacy protection. One such approach is
Federated Learning (
FL), a distributed ML approach introduced in 2016 by McMahan et al. [
80]. In the FL model, training data is kept locally on edge devices, instead of being uploaded to a central server. By only sharing the model parameters for aggregation, FL mitigates the risk of privacy leakage during raw training data transmission, relieves the burden of centralized data storage and computation, and aligns well with the IoT development trend. FL empowers devices to collaboratively learn a shared model while maintaining data locally, thereby circumventing the centralization of sensitive information and further addressing the concerns over data privacy and security in the IoT ecosystem [
13].
There is a growing focus on research in FL, recognizing the specific challenges and problems related to FL, such as heterogeneity and trust issues of the central server [
60,
152]. To address these concerns and further advance development, Blockchain technology [
86], which enables safe data storage and sharing, is introduced as an alternative to classical the central server of FL. Blockchain is a distributed and immutable ledger, consisting of blocks of data that are linked and secured using cryptography [
113]. It ensures data consistency, integrity and trustworthiness across Blockchain peers, fostering a secure environment for decentralized systems [
32]. The integration of FL and Blockchain technology can leverage their strengths and enable the training of distributed models in a secure and decentralized way. The advent of Blockchain technology as a complement to FL introduces an unprecedented level of security and trust. By decentralizing the management of model updates and data exchanges, Blockchain ensures that the learning process within FL is immutable and transparent [
97].
In this article, we explore the synergistic integration of FL and Blockchain technologies, commonly referred to as BlockFL, across various domains of the IoT. The IoT landscape is vast and diverse, encompassing a range of application areas, each with its unique challenges and requirements. To provide a structured and in-depth analysis, we focus on four specific application areas of IoT, selected for their distinct characteristics and the unique benefits they can derive from BlockFL technologies:
—
Personal Internet of Things (
PIoT): PIoT enhances the connectivity and automation of daily-use objects, using data from individual sensors and devices to drive personalization and convenience [
34]. The integration of BlockFL in PIoT is crucial for ensuring data privacy and security in personal applications [
58].
—
Industrial Internet of Things (
IIoT): IIoT is geared toward revolutionizing industrial processes through intelligent manufacturing and smart factories [
110]. In IIoT, BlockFL is instrumental in ensuring secure, efficient, and transparent industrial operations, enhancing productivity and process optimization [
53].
—
Internet of Vehicles (
IoV): IoV focuses on vehicle-related aspects of IoT, providing real-time traffic information and enhancing in-vehicle services [
21]. The role of BlockFL in IoV is vital for managing vast amounts of vehicular data securely and efficiently, improving transportation systems and vehicle-to-infrastructure communication [
108].
—
Internet of Health Things (
IoHT): IoHT connects patients and healthcare providers, utilizing biomedical sensors for improved healthcare services [
104]. The application of BlockFL in IoHT is paramount for safeguarding sensitive health data, ensuring data integrity, and facilitating secure health data exchange [
19].
By categorizing these IoT domains, we aim at highlighting the distinct challenges each faces and how the convergence of FL and Blockchain can offer tailored solutions. The criteria for selecting these domains include the sensitivity and volume of data involved, the criticality of data security and privacy, the need for efficient data processing, and the potential for enhancing overall system efficiency and user experience. This categorization allows for a focused examination of BlockFL’s role in addressing the unique needs of each domain, paving the way for innovative applications and advancements in IoT.
As shown in Figure
1, our work focuses on BlockFL tailored to various IoT applications with the collation and analysis of the latest research. In contrast, prior works like [
42,
88], and [
5] focus more on separate discussions of FL and Blockchain, while [
169] pays less attention to specific IoT scenarios, emphasizing theoretical analysis. In [
100], authors discuss Blockchain as a solution to existing FL issues, focusing more on how to optimize the performance of FL rather than discussing the development of BlockFL. Research in [
109] and [
43] are concerned with specific domains within IoT, and [
7] and [
128] only consider either Blockchain or FL aspects alone, which seem limited compared to our work. Our research highlights the role of BlockFL in security and privacy, trust and reliability, efficiency, and data diversity within four IoT domains: PIoT, IIoT, IoV, and IoHT. We analyze the distinct needs and challenges in those IoT domains, with the different development focuses of BlockFL under different application areas.
BlockFL has shown growing popularity and potential as a novel solution in recent years. Further survey work is necessary to synthesize current research and inform future developments. The four IoT domains we discussed cover a broad spectrum, addressing the primary concerns of relevant stakeholders and researchers. The challenges we pay attention to are the most mentioned in the current research, which can not be ignored in future applications and developments related to BlockFL. Issues of privacy and security are most frequently discussed in FL and Blockchain, hence critical in BlockFL. Trust and reliability are emerging as new focus areas with increasing system demands. Efficiency in learning and resource allocation is an ongoing challenge for BlockFL, especially in IoV scenarios, while addressing data diversity is crucial for practical applications in PIoT and IoHT. Our analyses provide targeted insights into the future development and optimization of BlockFL in different application scenarios, including enhancing security and privacy, building trust and reliability, improving efficiency, and addressing data diversity.
Moreover, our work discusses the potential integration of other learning frameworks, such as Split Learning, Transfer Learning, and Continuous Learning with BlockFL, which have not been explored in other articles. By leveraging the techniques of these learning frameworks, BlockFL can be further optimized in terms of efficiency and scalability, providing a more robust and feasible application across various IoT scenarios. This integration paves the way for tailored solutions that cater to specific needs within the diverse landscape of IoT applications, thereby enhancing the practical utility and implementation success of BlockFL models.
The key contributions of this article are summarized as follows:
—
We conduct a detailed analysis of BlockFL in four common scenarios, i.e., PIoT, IIoT, IoV, and IoHT, and highlight the challenges faced by BlockFL in these contexts. We also examine the advantages and disadvantages of BlockFL concerning these challenges comprehensively.
—
We present an overview of the relationship between BlockFL, FL, and Blockchain, and perform a comparative classification of existing BlockFL applications and features in various scenarios, focusing on four essential aspects: security and privacy, trust and reliability, efficiency, and data heterogeneity.
—
We analyze the common challenges and unique needs of BlockFL across different application domains and find that combining existing technologies (including cryptography, anomaly detection, compression techniques, and normalization) and enhancing the exploration of Blockchain components can drive the development of BlockFL.
Our analysis reveals that features of decentralization and transparency make BlockFL a secure and effective solution for distributed model training, while the overhead and compatibility still need further investigation for the fruition of BlockFL. Considering diverse application domains, our analysis also indicates that, besides the universal considerations of privacy protection, resource constraints and data heterogeneity, each domain presents unique challenges, e.g., the requirement of accommodating dynamic environments in IoV and the high demands of identity and permission management in IoHT. It is anticipated that this article can serve as an informative guide for future research efforts.
The rest of this article is organized as shown in Figure
2. Section
2 introduces the concepts and definitions of FL, Blockchain, and BlockFL. Section
3 describes the different application scenarios of BlockFL. Sections
4 –
7 illustrate the latest application BlockFL models focusing on these different scenarios. The most prominent features of each reference are highlighted to show their advantages and limitations. Section
8 summarizes the key lessons learned from the previous sections and puts forward future research directions. Finally, the conclusion of this article and suggestions for the follow-up works are presented in Section
9.
8 Lesson Learned and Open Challenges of BlockFL for IoT
The combination of FL and Blockchain has demonstrated significant potential in advancing next-generation digital developments. Through theoretical analysis and related experiments, existing studies confirm the application value of integrating BlockFL technologies in various fields. But the works are limited and remain largely theoretical. The throughout review of the BlockFL in IoT reveals several essential challenges and unresolved issues when considering the implementation and development of BlockFL.
This section will highlight potential future research directions for BlockFL, exploring both general open issues faced across all application domains, as well as domain-specific challenges.
8.1 General Challenges of BlockFL in Different Application Scenarios
In the 5G/6G era, BlockFL is set to thrive due to unprecedented data speeds, lower latency, and higher reliability [
106]. These advanced networks enhance real-time interactions between Blockchain nodes and FL participants, contributing to more efficient data exchange, model updating, and consensus. The increased data transmission speeds also aid in the quick synchronization of Blockchain ledgers and fast sharing of FL model updates, bolstering the scalability and efficiency of BlockFL systems. With the evolution of 5G and the advent of 6G technologies, however, BlockFL faces new challenges in maintaining efficiency and security. The ultra-low latency of these networks demands real-time data processing and decision-making in BlockFL systems [
20]. For instance, autonomous driving requires millisecond-precise decision-making utilizing the low latency of these networks. Additionally, the higher data throughput of 5G/6G networks increases resource demands, necessitating more efficient BlockFL algorithms [
152], especially in contexts like smart cities where vast sensor data needs to be managed without overwhelming network nodes.
As BlockFL navigates these technological advancements, it also encounters universal challenges in IoT scenarios, underscoring the need for tailored solutions in various applications. While each considered IoT scenario has its unique characteristics, they share foundational challenges in data security, privacy, resource limitation, scalability, and data diversity. All scenarios involve data collection, transmission, and processing, often with sensitive information, necessitating robust security and evolving privacy protection [
9]. The limited computing power and energy resources of IoT devices pose scalability challenges as the IoT ecosystem expands. Moreover, data diversity in terms of device capabilities and communication protocols requires intelligent processing techniques for efficient BlockFL deployment [
40]. These challenges vary significantly across scenarios. For example, IoHT involves health-related data requiring strict compliance with healthcare regulations [
49], while IIoT might focus more on protecting industrial processes and proprietary information [
78]. The degree of resource limitation and scalability requirements can differ. PIoT devices might be more constrained in terms of battery life and processing power [
114], whereas IIoT settings might prioritize the scalability of solutions across vast industrial networks with varying computational capacities [
142]. The type and level of heterogeneity can vary widely; IoV deals with mobility-related data and connectivity challenges unique to vehicular networks [
84], while IIoT must accommodate a wide range of industrial equipment and operational technologies.
Table
9 summarizes potential technologies for future BlockFL development. In terms of security enhancement and privacy protection, the integration of encryption and secure computing technologies has been effective [
10,
121,
150]. Combining BlockFL with various encryption algorithms and noise addition methods [
154,
160] or multi-party security technologies can further enhance Blockchain-based FL models. Additionally, data processing techniques like compression [
44,
69] and normalization [
54], along with smart contracts [
76] and sharding mechanisms [
151,
165], are suggested to address resource limitations and data heterogeneity, enhancing the capabilities of BlockFL.
Furthermore, ongoing research is needed to delve deeper into combining Blockchain and FL. Current BlockFL models mainly employ Blockchain for aggregation in the FL process, with less emphasis on enhancing Blockchain through FL. Intermediate results in BlockFL, such as the quality of local models, could be utilized in the consensus calculations of Blockchain [
70], thus reducing the costs of computational and communication resources. Exploring new consensus methods and smart contract technologies based on BlockFL presents promising avenues for further development. This research should also address issues typically associated with Blockchain, such as the collusion of the miners and the challenges of hybrid-Blockchain structures [
134,
157].
8.2 Unique Challenges of BlockFL in Different Application Scenarios
The integration of BlockFL across different IoT applications presents unique challenges due to the distinct nature of each field. Table
10 summarizes these challenges and their solutions. In PIoT, the complexity arising from personalized smart services is addressed using transfer and split learning technologies to enhance personalization and privacy. IIoT leverages transfer learning to improve intelligent collaboration, boosting efficiency and cutting costs. For IoV, high latency is tackled through online and continuous learning technologies, enhancing safety and traffic flow. In IoHT, the focus is on handling sensitive medical data securely, utilizing consortium Blockchains and split learning for secure data handling and collaborative research. This table encapsulates the diverse challenges and innovative solutions tailored to the unique needs of different IoT scenarios.
In large-scale PIoT and IIoT, collaborative intelligence and meeting personalized needs have emerged as a new research direction. As existing studies have focused on optimizing a single task, the increasing demand for multitasking collaboration and cooperation requires a complex system to analyze and coordinate the relationship and connection of multitasking and multi-objective. To enable intelligent coordination, the models should explore transfer learning technology and other related novel technologies in combination with BlockFL models. In particular, the process of industrialization requires more consideration of the costs of large-scale implementation.
In IoV, ensuring the stability of the system in high-speed movement is a crucial research direction. A large number of vehicles in the IoV application scenarios are constantly moving at high speeds and changing positions in real time, which poses a considerable challenge to the stability and reliability of the network and connection. To address this issue, researchers can increase the calculation effectiveness, reduce model delay, and consider optimizing and innovating BlockFL models by imitating online learning algorithms. Moreover, future vehicles in 6G are expected to support cross-domain communication across the ground, underwater and air [
33], so stability in combination with new devices and technologies, such as over-the-air computing, should also be taken into account [
168].
In IoHT, permission and identity management of participants are critical challenges due to the high sensitivity of medical data. Consortium Blockchains are more suitable for implementation in IoHT, with the high professional knowledge required by participants to analyze and manage medical-related data. The involvement of medical organizations can make the management of IoHT models highly controllable and convenient, and multiple participation can improve the accuracy and other performance of IoHT models. Thus, researchers should explore how to incentivize participation in BlockFL while considering the problem of membership management.
8.3 Other Learning Framework as Solution for BlockFL
In addressing future research on BlockFL, it is imperative to broaden the scope to include various frameworks of distributed learning, such as split learning, transfer learning, and continuous learning. This expansion is crucial for offering a holistic view of the distributed learning landscape, enabling the identification of synergies and potential integrations that could further enhance the capabilities and applications of BlockFL across diverse IoT scenarios.
Integrating learning frameworks like Split Learning, Transfer Learning, and Continuous Learning with BlockFL offers the potential to leverage the strengths of both FL and Blockchain technologies while addressing their respective weaknesses. By doing so, we can create a learning ecosystem that is robust against data privacy issues, adaptable to new data, and capable of continuous improvement without centralized data storage.
Split Learning is a distributed learning framework that divides a neural network model between client and server sides [
105]. Clients compute with local data and send intermediate results to a server for further processing. This method, combined with BlockFL, enhances privacy and efficiency. Clients handle initial training stages and only transfer intermediate results to a Blockchain-based server, which aggregates them securely and updates the Blockchain ledger with the enhanced model. This integration with BlockFL offers scalable, privacy-preserving solutions in IoT environments, benefiting from the security and transparency of Blockchain.
Transfer Learning applies knowledge from one domain to solve problems in another and is particularly useful in PIoT and IIoT within BlockFL contexts [
138]. By incorporating it into BlockFL, pre-trained models on Blockchain nodes can be refined by new participants using their data, thus reducing training time while maintaining privacy. This approach also facilitates cross-domain applications, allowing knowledge transfer from one IoT sector to another (e.g., from IoHT to IoV), accelerating intelligent system deployment across varied IoT ecosystems.
Continuous Learning focuses on systems that learn and evolve over time, accumulating and adapting to new data while retaining previous knowledge [
90]. Incorporating this into the BlockFL framework could enhance adaptability. The approach involves regularly updating Blockchain models with insights from client data. Clients contribute to ongoing learning, facilitating continuous model evolution, which ensures data integrity and lineage for auditing purposes. In dynamic IoT settings, this enables BlockFL systems to adapt to new patterns and changes, maintaining relevance and effectiveness in applications such as IoV and IoHT, where continuous learning and updating are essential.