BreakNet: detecting deletions using long reads and a deep learning approach

Abstract Background Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new a...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Junwei Luo, Hongyu Ding, Jiquan Shen, Haixia Zhai, Zhengjiang Wu, Chaokun Yan, Huimin Luo
Formato: article
Lenguaje:EN
Publicado: BMC 2021
Materias:
Acceso en línea:https://doaj.org/article/9283c944fedc46f89e94543b6169d296
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:9283c944fedc46f89e94543b6169d296
record_format dspace
spelling oai:doaj.org-article:9283c944fedc46f89e94543b6169d2962021-12-05T12:08:41ZBreakNet: detecting deletions using long reads and a deep learning approach10.1186/s12859-021-04499-51471-2105https://doaj.org/article/9283c944fedc46f89e94543b6169d2962021-12-01T00:00:00Zhttps://doi.org/10.1186/s12859-021-04499-5https://doaj.org/toc/1471-2105Abstract Background Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new approach is still needed to mine features in long-read alignment information. Recently, deep learning has attracted much attention in genome analysis, and it is a promising technique for calling SVs. Results In this paper, we propose BreakNet, a deep learning method that detects deletions by using long reads. BreakNet first extracts feature matrices from long-read alignments. Second, it uses a time-distributed convolutional neural network (CNN) to integrate and map the feature matrices to feature vectors. Third, BreakNet employs a bidirectional long short-term memory (BLSTM) model to analyse the produced set of continuous feature vectors in both the forward and backward directions. Finally, a classification module determines whether a region refers to a deletion. On real long-read sequencing datasets, we demonstrate that BreakNet outperforms Sniffles, SVIM and cuteSV in terms of their F1 scores. The source code for the proposed method is available from GitHub at https://github.com/luojunwei/BreakNet . Conclusions Our work shows that deep learning can be combined with long reads to call deletions more effectively than existing methods.Junwei LuoHongyu DingJiquan ShenHaixia ZhaiZhengjiang WuChaokun YanHuimin LuoBMCarticleComputer applications to medicine. Medical informaticsR858-859.7Biology (General)QH301-705.5ENBMC Bioinformatics, Vol 22, Iss 1, Pp 1-13 (2021)
institution DOAJ
collection DOAJ
language EN
topic Computer applications to medicine. Medical informatics
R858-859.7
Biology (General)
QH301-705.5
spellingShingle Computer applications to medicine. Medical informatics
R858-859.7
Biology (General)
QH301-705.5
Junwei Luo
Hongyu Ding
Jiquan Shen
Haixia Zhai
Zhengjiang Wu
Chaokun Yan
Huimin Luo
BreakNet: detecting deletions using long reads and a deep learning approach
description Abstract Background Structural variations (SVs) occupy a prominent position in human genetic diversity, and deletions form an important type of SV that has been suggested to be associated with genetic diseases. Although various deletion calling methods based on long reads have been proposed, a new approach is still needed to mine features in long-read alignment information. Recently, deep learning has attracted much attention in genome analysis, and it is a promising technique for calling SVs. Results In this paper, we propose BreakNet, a deep learning method that detects deletions by using long reads. BreakNet first extracts feature matrices from long-read alignments. Second, it uses a time-distributed convolutional neural network (CNN) to integrate and map the feature matrices to feature vectors. Third, BreakNet employs a bidirectional long short-term memory (BLSTM) model to analyse the produced set of continuous feature vectors in both the forward and backward directions. Finally, a classification module determines whether a region refers to a deletion. On real long-read sequencing datasets, we demonstrate that BreakNet outperforms Sniffles, SVIM and cuteSV in terms of their F1 scores. The source code for the proposed method is available from GitHub at https://github.com/luojunwei/BreakNet . Conclusions Our work shows that deep learning can be combined with long reads to call deletions more effectively than existing methods.
format article
author Junwei Luo
Hongyu Ding
Jiquan Shen
Haixia Zhai
Zhengjiang Wu
Chaokun Yan
Huimin Luo
author_facet Junwei Luo
Hongyu Ding
Jiquan Shen
Haixia Zhai
Zhengjiang Wu
Chaokun Yan
Huimin Luo
author_sort Junwei Luo
title BreakNet: detecting deletions using long reads and a deep learning approach
title_short BreakNet: detecting deletions using long reads and a deep learning approach
title_full BreakNet: detecting deletions using long reads and a deep learning approach
title_fullStr BreakNet: detecting deletions using long reads and a deep learning approach
title_full_unstemmed BreakNet: detecting deletions using long reads and a deep learning approach
title_sort breaknet: detecting deletions using long reads and a deep learning approach
publisher BMC
publishDate 2021
url https://doaj.org/article/9283c944fedc46f89e94543b6169d296
work_keys_str_mv AT junweiluo breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach
AT hongyuding breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach
AT jiquanshen breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach
AT haixiazhai breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach
AT zhengjiangwu breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach
AT chaokunyan breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach
AT huiminluo breaknetdetectingdeletionsusinglongreadsandadeeplearningapproach
_version_ 1718372205804388352