Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems

Abstract In clustered distributed storage systems (CDSSs), one of the main design goals is minimizing the transmission cost during the failed storage nodes repairing. Generalized regenerating codes (GRCs) are proposed to balance the intra‐cluster repair bandwidth and the inter‐cluster repair bandwid...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Shushi Gu, Fugang Wang, Qinyu Zhang, Tao Huang, Wei Xiang
Formato: article
Lenguaje:EN
Publicado: Wiley 2021
Materias:
Acceso en línea:https://doaj.org/article/dc273746940a43fc92c67d4404cc731f
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:dc273746940a43fc92c67d4404cc731f
record_format dspace
spelling oai:doaj.org-article:dc273746940a43fc92c67d4404cc731f2021-11-09T04:19:38ZGlobal repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems1751-86361751-862810.1049/cmu2.12289https://doaj.org/article/dc273746940a43fc92c67d4404cc731f2021-12-01T00:00:00Zhttps://doi.org/10.1049/cmu2.12289https://doaj.org/toc/1751-8628https://doaj.org/toc/1751-8636Abstract In clustered distributed storage systems (CDSSs), one of the main design goals is minimizing the transmission cost during the failed storage nodes repairing. Generalized regenerating codes (GRCs) are proposed to balance the intra‐cluster repair bandwidth and the inter‐cluster repair bandwidth for guaranteeing data availability. The trade‐off performance of GRCs illustrates that, it can reduce storage overhead and inter‐cluster repair bandwidths simultaneously. However, in practical big data storage scenarios, GRCs cannot give an effective solution to handle the heterogeneity of bandwidth costs among different clusters for node failures recovery. This paper proposes an asymmetric bandwidth allocation strategy (ABAS) of GRCs for the inter‐cluster repair in heterogeneous CDSSs. Furthermore, an upper bound of the achievable capacity of ABAS is derived based on the information flow graph (IFG), and the constraints of storage capacity and intra‐cluster repair bandwidth are also elaborated. Then, a metric termed global repair bandwidth cost (GRBC), which can be minimized regarding of the inter‐cluster repair bandwidths by solving a linear programming problem, is defined. The numerical results demonstrate that, maintaining the same data availability and storage overhead, the proposed ABAS of GRCs can effectively reduce the GRBC compared to the traditional symmetric bandwidth allocation schemes.Shushi GuFugang WangQinyu ZhangTao HuangWei XiangWileyarticleTelecommunicationTK5101-6720ENIET Communications, Vol 15, Iss 19, Pp 2469-2481 (2021)
institution DOAJ
collection DOAJ
language EN
topic Telecommunication
TK5101-6720
spellingShingle Telecommunication
TK5101-6720
Shushi Gu
Fugang Wang
Qinyu Zhang
Tao Huang
Wei Xiang
Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems
description Abstract In clustered distributed storage systems (CDSSs), one of the main design goals is minimizing the transmission cost during the failed storage nodes repairing. Generalized regenerating codes (GRCs) are proposed to balance the intra‐cluster repair bandwidth and the inter‐cluster repair bandwidth for guaranteeing data availability. The trade‐off performance of GRCs illustrates that, it can reduce storage overhead and inter‐cluster repair bandwidths simultaneously. However, in practical big data storage scenarios, GRCs cannot give an effective solution to handle the heterogeneity of bandwidth costs among different clusters for node failures recovery. This paper proposes an asymmetric bandwidth allocation strategy (ABAS) of GRCs for the inter‐cluster repair in heterogeneous CDSSs. Furthermore, an upper bound of the achievable capacity of ABAS is derived based on the information flow graph (IFG), and the constraints of storage capacity and intra‐cluster repair bandwidth are also elaborated. Then, a metric termed global repair bandwidth cost (GRBC), which can be minimized regarding of the inter‐cluster repair bandwidths by solving a linear programming problem, is defined. The numerical results demonstrate that, maintaining the same data availability and storage overhead, the proposed ABAS of GRCs can effectively reduce the GRBC compared to the traditional symmetric bandwidth allocation schemes.
format article
author Shushi Gu
Fugang Wang
Qinyu Zhang
Tao Huang
Wei Xiang
author_facet Shushi Gu
Fugang Wang
Qinyu Zhang
Tao Huang
Wei Xiang
author_sort Shushi Gu
title Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems
title_short Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems
title_full Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems
title_fullStr Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems
title_full_unstemmed Global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems
title_sort global repair bandwidth cost optimization of generalized regenerating codes in clustered distributed storage systems
publisher Wiley
publishDate 2021
url https://doaj.org/article/dc273746940a43fc92c67d4404cc731f
work_keys_str_mv AT shushigu globalrepairbandwidthcostoptimizationofgeneralizedregeneratingcodesinclustereddistributedstoragesystems
AT fugangwang globalrepairbandwidthcostoptimizationofgeneralizedregeneratingcodesinclustereddistributedstoragesystems
AT qinyuzhang globalrepairbandwidthcostoptimizationofgeneralizedregeneratingcodesinclustereddistributedstoragesystems
AT taohuang globalrepairbandwidthcostoptimizationofgeneralizedregeneratingcodesinclustereddistributedstoragesystems
AT weixiang globalrepairbandwidthcostoptimizationofgeneralizedregeneratingcodesinclustereddistributedstoragesystems
_version_ 1718441339947843584