Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing
Abstract With the growth of metabolomics research, more and more studies are conducted on large numbers of samples. Due to technical limitations of the Liquid Chromatography–Mass Spectrometry (LC/MS) platform, samples often need to be processed in multiple batches. Across different batches, we often...
Guardado en:
Autores principales: | , , , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Nature Portfolio
2020
|
Materias: | |
Acceso en línea: | https://doaj.org/article/b7e4a35c4a4445a4849e3d300e5b7365 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:b7e4a35c4a4445a4849e3d300e5b7365 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:b7e4a35c4a4445a4849e3d300e5b73652021-12-02T18:51:41ZAddressing the batch effect issue for LC/MS metabolomics data in data preprocessing10.1038/s41598-020-70850-02045-2322https://doaj.org/article/b7e4a35c4a4445a4849e3d300e5b73652020-08-01T00:00:00Zhttps://doi.org/10.1038/s41598-020-70850-0https://doaj.org/toc/2045-2322Abstract With the growth of metabolomics research, more and more studies are conducted on large numbers of samples. Due to technical limitations of the Liquid Chromatography–Mass Spectrometry (LC/MS) platform, samples often need to be processed in multiple batches. Across different batches, we often observe differences in data characteristics. In this work, we specifically focus on data generated in multiple batches on the same LC/MS machinery. Traditional preprocessing methods treat all samples as a single group. Such practice can result in errors in the alignment of peaks, which cannot be corrected by post hoc application of batch effect correction methods. In this work, we developed a new approach that address the batch effect issue in the preprocessing stage, resulting in better peak detection, alignment and quantification. It can be combined with down-stream batch effect correction methods to further correct for between-batch intensity differences. The method is implemented in the existing workflow of the apLCMS platform. Analyzing data with multiple batches, both generated from standardized quality control (QC) plasma samples and from real biological studies, the new method resulted in feature tables with better consistency, as well as better down-stream analysis results. The method can be a useful addition to the tools available for large studies involving multiple batches. The method is available as part of the apLCMS package. Download link and instructions are at https://mypage.cuhk.edu.cn/academics/yutianwei/apLCMS/ .Qin LiuDouglas WalkerKaran UppalZihe LiuChunyu MaViLinh TranShuzhao LiDean P. JonesTianwei YuNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 10, Iss 1, Pp 1-13 (2020) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Qin Liu Douglas Walker Karan Uppal Zihe Liu Chunyu Ma ViLinh Tran Shuzhao Li Dean P. Jones Tianwei Yu Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing |
description |
Abstract With the growth of metabolomics research, more and more studies are conducted on large numbers of samples. Due to technical limitations of the Liquid Chromatography–Mass Spectrometry (LC/MS) platform, samples often need to be processed in multiple batches. Across different batches, we often observe differences in data characteristics. In this work, we specifically focus on data generated in multiple batches on the same LC/MS machinery. Traditional preprocessing methods treat all samples as a single group. Such practice can result in errors in the alignment of peaks, which cannot be corrected by post hoc application of batch effect correction methods. In this work, we developed a new approach that address the batch effect issue in the preprocessing stage, resulting in better peak detection, alignment and quantification. It can be combined with down-stream batch effect correction methods to further correct for between-batch intensity differences. The method is implemented in the existing workflow of the apLCMS platform. Analyzing data with multiple batches, both generated from standardized quality control (QC) plasma samples and from real biological studies, the new method resulted in feature tables with better consistency, as well as better down-stream analysis results. The method can be a useful addition to the tools available for large studies involving multiple batches. The method is available as part of the apLCMS package. Download link and instructions are at https://mypage.cuhk.edu.cn/academics/yutianwei/apLCMS/ . |
format |
article |
author |
Qin Liu Douglas Walker Karan Uppal Zihe Liu Chunyu Ma ViLinh Tran Shuzhao Li Dean P. Jones Tianwei Yu |
author_facet |
Qin Liu Douglas Walker Karan Uppal Zihe Liu Chunyu Ma ViLinh Tran Shuzhao Li Dean P. Jones Tianwei Yu |
author_sort |
Qin Liu |
title |
Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing |
title_short |
Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing |
title_full |
Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing |
title_fullStr |
Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing |
title_full_unstemmed |
Addressing the batch effect issue for LC/MS metabolomics data in data preprocessing |
title_sort |
addressing the batch effect issue for lc/ms metabolomics data in data preprocessing |
publisher |
Nature Portfolio |
publishDate |
2020 |
url |
https://doaj.org/article/b7e4a35c4a4445a4849e3d300e5b7365 |
work_keys_str_mv |
AT qinliu addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT douglaswalker addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT karanuppal addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT ziheliu addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT chunyuma addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT vilinhtran addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT shuzhaoli addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT deanpjones addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing AT tianweiyu addressingthebatcheffectissueforlcmsmetabolomicsdataindatapreprocessing |
_version_ |
1718377407397756928 |