KPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach
High-Performance Computing (HPC) systems need to be constantly monitored to ensure their stability. The monitoring systems collect a tremendous amount of data about different parameters or Key Performance Indicators (KPIs), such as resource usage, IO waiting time, etc. A proper analysis of this data...
Guardado en:
Autores principales: | , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
IEEE
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/15158ac5b42c43208be23fbca09f6197 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:15158ac5b42c43208be23fbca09f6197 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:15158ac5b42c43208be23fbca09f61972021-11-19T00:06:31ZKPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach2169-353610.1109/ACCESS.2021.3057427https://doaj.org/article/15158ac5b42c43208be23fbca09f61972021-01-01T00:00:00Zhttps://ieeexplore.ieee.org/document/9348884/https://doaj.org/toc/2169-3536High-Performance Computing (HPC) systems need to be constantly monitored to ensure their stability. The monitoring systems collect a tremendous amount of data about different parameters or Key Performance Indicators (KPIs), such as resource usage, IO waiting time, etc. A proper analysis of this data, usually stored as time series, can provide insight in choosing the right management strategies as well as the early detection of issues. In this paper, we introduce a methodology to cluster HPC jobs according to their KPI indicators. Our approach reduces the inherent high dimensionality of the collected data by applying two techniques to the time series: literature-based and variance-based feature extraction. We also define a procedure to visualize the obtained clusters by combining the two previous approaches and the Principal Component Analysis (PCA). Finally, we have validated our contributions on a real data set to conclude that those KPIs related to CPU usage provide the best cohesion and separation for clustering analysis and the good results of our visualization methodology.Mohamed Soliman HalawaRebeca P. Diaz RedondoAna Fernandez VilasIEEEarticleClusteringfeature extractionhigh-performance computingtime series analysisElectrical engineering. Electronics. Nuclear engineeringTK1-9971ENIEEE Access, Vol 9, Pp 25522-25543 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Clustering feature extraction high-performance computing time series analysis Electrical engineering. Electronics. Nuclear engineering TK1-9971 |
spellingShingle |
Clustering feature extraction high-performance computing time series analysis Electrical engineering. Electronics. Nuclear engineering TK1-9971 Mohamed Soliman Halawa Rebeca P. Diaz Redondo Ana Fernandez Vilas KPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach |
description |
High-Performance Computing (HPC) systems need to be constantly monitored to ensure their stability. The monitoring systems collect a tremendous amount of data about different parameters or Key Performance Indicators (KPIs), such as resource usage, IO waiting time, etc. A proper analysis of this data, usually stored as time series, can provide insight in choosing the right management strategies as well as the early detection of issues. In this paper, we introduce a methodology to cluster HPC jobs according to their KPI indicators. Our approach reduces the inherent high dimensionality of the collected data by applying two techniques to the time series: literature-based and variance-based feature extraction. We also define a procedure to visualize the obtained clusters by combining the two previous approaches and the Principal Component Analysis (PCA). Finally, we have validated our contributions on a real data set to conclude that those KPIs related to CPU usage provide the best cohesion and separation for clustering analysis and the good results of our visualization methodology. |
format |
article |
author |
Mohamed Soliman Halawa Rebeca P. Diaz Redondo Ana Fernandez Vilas |
author_facet |
Mohamed Soliman Halawa Rebeca P. Diaz Redondo Ana Fernandez Vilas |
author_sort |
Mohamed Soliman Halawa |
title |
KPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach |
title_short |
KPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach |
title_full |
KPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach |
title_fullStr |
KPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach |
title_full_unstemmed |
KPIs-Based Clustering and Visualization of HPC Jobs: A Feature Reduction Approach |
title_sort |
kpis-based clustering and visualization of hpc jobs: a feature reduction approach |
publisher |
IEEE |
publishDate |
2021 |
url |
https://doaj.org/article/15158ac5b42c43208be23fbca09f6197 |
work_keys_str_mv |
AT mohamedsolimanhalawa kpisbasedclusteringandvisualizationofhpcjobsafeaturereductionapproach AT rebecapdiazredondo kpisbasedclusteringandvisualizationofhpcjobsafeaturereductionapproach AT anafernandezvilas kpisbasedclusteringandvisualizationofhpcjobsafeaturereductionapproach |
_version_ |
1718420594388631552 |