Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]

Background: As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic change...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Su-Cheng Haw, Aisyah Amin, Chee-Onn Wong, Samini Subramaniam
Formato: article
Lenguaje:EN
Publicado: F1000 Research Ltd 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/1f1b1f7a22e546f7b94dd98987b0995f
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:1f1b1f7a22e546f7b94dd98987b0995f
record_format dspace
spelling oai:doaj.org-article:1f1b1f7a22e546f7b94dd98987b0995f2021-11-22T12:41:06ZImproving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]2046-140210.12688/f1000research.69108.1https://doaj.org/article/1f1b1f7a22e546f7b94dd98987b0995f2021-09-01T00:00:00Zhttps://f1000research.com/articles/10-907/v1https://doaj.org/toc/2046-1402Background: As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic changes of Web content. Most of the existing XML annotation is based on a labeling scheme to identify each hierarchical position of the XML nodes. This computation is costly as any updates will cause the whole XML tree to be re-labelled. This impact can be observed on large datasets. Therefore, a robust labeling scheme that avoids re-labeling is crucial. Method: Here, we present ORD-GAP (named after Order Gap), a robust and persistent XML labeling scheme that supports dynamic updates. ORD-GAP assigns unique identifiers with gaps in-between XML nodes, which could easily identify the level, Parent-Child (P-C), Ancestor-Descendant (A-D) and sibling relationship. ORD-GAP adopts the OrdPath labeling scheme for any future insertion. Results: We demonstrate that ORD-GAP is robust enough for dynamic updates, and have implemented it in three use cases: (i) left-most, (ii) in-between and (iii) right-most insertion. Experimental evaluations on DBLP dataset demonstrated that ORD-GAP outperformed existing approaches such as ORDPath and ME Labeling concerning database storage size, data loading time and query retrieval. On average, ORD-GAP has the best storing and query retrieval time. Conclusion: The main contributions of this paper are: (i) A robust labeling scheme named ORD-GAP that assigns certain gap between each node to support future insertion, and (ii) An efficient mapping scheme, which built upon ORD-GAP labeling scheme to transform XML into RDB effectively.Su-Cheng HawAisyah AminChee-Onn WongSamini SubramaniamF1000 Research LtdarticleMedicineRScienceQENF1000Research, Vol 10 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Su-Cheng Haw
Aisyah Amin
Chee-Onn Wong
Samini Subramaniam
Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]
description Background: As the standard for the exchange of data over the World Wide Web, it is important to ensure that the eXtensible Markup Language (XML) database is capable of supporting not only efficient query processing but also capable of enduring frequent data update operations over the dynamic changes of Web content. Most of the existing XML annotation is based on a labeling scheme to identify each hierarchical position of the XML nodes. This computation is costly as any updates will cause the whole XML tree to be re-labelled. This impact can be observed on large datasets. Therefore, a robust labeling scheme that avoids re-labeling is crucial. Method: Here, we present ORD-GAP (named after Order Gap), a robust and persistent XML labeling scheme that supports dynamic updates. ORD-GAP assigns unique identifiers with gaps in-between XML nodes, which could easily identify the level, Parent-Child (P-C), Ancestor-Descendant (A-D) and sibling relationship. ORD-GAP adopts the OrdPath labeling scheme for any future insertion. Results: We demonstrate that ORD-GAP is robust enough for dynamic updates, and have implemented it in three use cases: (i) left-most, (ii) in-between and (iii) right-most insertion. Experimental evaluations on DBLP dataset demonstrated that ORD-GAP outperformed existing approaches such as ORDPath and ME Labeling concerning database storage size, data loading time and query retrieval. On average, ORD-GAP has the best storing and query retrieval time. Conclusion: The main contributions of this paper are: (i) A robust labeling scheme named ORD-GAP that assigns certain gap between each node to support future insertion, and (ii) An efficient mapping scheme, which built upon ORD-GAP labeling scheme to transform XML into RDB effectively.
format article
author Su-Cheng Haw
Aisyah Amin
Chee-Onn Wong
Samini Subramaniam
author_facet Su-Cheng Haw
Aisyah Amin
Chee-Onn Wong
Samini Subramaniam
author_sort Su-Cheng Haw
title Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]
title_short Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]
title_full Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]
title_fullStr Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]
title_full_unstemmed Improving the support for XML dynamic updates using a hybridization labeling scheme (ORD-GAP) [version 1; peer review: 2 approved]
title_sort improving the support for xml dynamic updates using a hybridization labeling scheme (ord-gap) [version 1; peer review: 2 approved]
publisher F1000 Research Ltd
publishDate 2021
url https://doaj.org/article/1f1b1f7a22e546f7b94dd98987b0995f
work_keys_str_mv AT suchenghaw improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgapversion1peerreview2approved
AT aisyahamin improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgapversion1peerreview2approved
AT cheeonnwong improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgapversion1peerreview2approved
AT saminisubramaniam improvingthesupportforxmldynamicupdatesusingahybridizationlabelingschemeordgapversion1peerreview2approved
_version_ 1718417598221123584