FaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems

Fault tolerance in IoT systems is challenging to overcome due to its complexity, dynamicity, and heterogeneity. IoT systems are typically designed and constructed in layers. Every layer has its requirements and fault tolerance strategies. However, errors in one layer can propagate and cause effects...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Mário Melo, Gibeon Aquino
Formato: article
Lenguaje:EN
Publicado: MDPI AG 2021
Materias:
IoT
Acceso en línea:https://doaj.org/article/61c90f19ecba4f47989eaad8dd8f4979
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:61c90f19ecba4f47989eaad8dd8f4979
record_format dspace
spelling oai:doaj.org-article:61c90f19ecba4f47989eaad8dd8f49792021-11-11T19:10:21ZFaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems10.3390/s212171811424-8220https://doaj.org/article/61c90f19ecba4f47989eaad8dd8f49792021-10-01T00:00:00Zhttps://www.mdpi.com/1424-8220/21/21/7181https://doaj.org/toc/1424-8220Fault tolerance in IoT systems is challenging to overcome due to its complexity, dynamicity, and heterogeneity. IoT systems are typically designed and constructed in layers. Every layer has its requirements and fault tolerance strategies. However, errors in one layer can propagate and cause effects on others. Thus, it is impractical to consider a centralized fault tolerance approach for an entire system. Consequently, it is vital to consider multiple layers in order to enable collaboration and information exchange when addressing fault tolerance. The purpose of this study is to propose a multi-layer fault tolerance approach, granting interconnection among IoT system layers, allowing information exchange and collaboration in order to attain the property of dependability. Therefore, we define an event-driven framework called FaTEMa (Fault Tolerance Event Manager) that creates a dedicated fault-related communication channel in order to propagate events across the levels of the system. The implemented framework assist with error detection and continued service. Additionally, it offers extension points to support heterogeneous communication protocols and evolve new capabilities. Our empirical results show that introducing FaTEMa provided improvements to the error detection and error resolution time, consequently improving system availability. In addition, the use of Fatema provided a reliability improvement and a reduction in the number of failures produced.Mário MeloGibeon AquinoMDPI AGarticleIoTdependabilityfault toleranceerror detectionerror recoveryreliabilityChemical technologyTP1-1185ENSensors, Vol 21, Iss 7181, p 7181 (2021)
institution DOAJ
collection DOAJ
language EN
topic IoT
dependability
fault tolerance
error detection
error recovery
reliability
Chemical technology
TP1-1185
spellingShingle IoT
dependability
fault tolerance
error detection
error recovery
reliability
Chemical technology
TP1-1185
Mário Melo
Gibeon Aquino
FaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems
description Fault tolerance in IoT systems is challenging to overcome due to its complexity, dynamicity, and heterogeneity. IoT systems are typically designed and constructed in layers. Every layer has its requirements and fault tolerance strategies. However, errors in one layer can propagate and cause effects on others. Thus, it is impractical to consider a centralized fault tolerance approach for an entire system. Consequently, it is vital to consider multiple layers in order to enable collaboration and information exchange when addressing fault tolerance. The purpose of this study is to propose a multi-layer fault tolerance approach, granting interconnection among IoT system layers, allowing information exchange and collaboration in order to attain the property of dependability. Therefore, we define an event-driven framework called FaTEMa (Fault Tolerance Event Manager) that creates a dedicated fault-related communication channel in order to propagate events across the levels of the system. The implemented framework assist with error detection and continued service. Additionally, it offers extension points to support heterogeneous communication protocols and evolve new capabilities. Our empirical results show that introducing FaTEMa provided improvements to the error detection and error resolution time, consequently improving system availability. In addition, the use of Fatema provided a reliability improvement and a reduction in the number of failures produced.
format article
author Mário Melo
Gibeon Aquino
author_facet Mário Melo
Gibeon Aquino
author_sort Mário Melo
title FaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems
title_short FaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems
title_full FaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems
title_fullStr FaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems
title_full_unstemmed FaTEMa: A Framework for Multi-Layer Fault Tolerance in IoT Systems
title_sort fatema: a framework for multi-layer fault tolerance in iot systems
publisher MDPI AG
publishDate 2021
url https://doaj.org/article/61c90f19ecba4f47989eaad8dd8f4979
work_keys_str_mv AT mariomelo fatemaaframeworkformultilayerfaulttoleranceiniotsystems
AT gibeonaquino fatemaaframeworkformultilayerfaulttoleranceiniotsystems
_version_ 1718431587322822656