High information capacity DNA-based data storage with augmented encoding characters using degenerate bases

Abstract DNA-based data storage has emerged as a promising method to satisfy the exponentially increasing demand for information storage. However, practical implementation of DNA-based data storage remains a challenge because of the high cost of data writing through DNA synthesis. Here, we propose t...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Yeongjae Choi, Taehoon Ryu, Amos C. Lee, Hansol Choi, Hansaem Lee, Jaejun Park, Suk-Heung Song, Seojoo Kim, Hyeli Kim, Wook Park, Sunghoon Kwon
Formato: article
Lenguaje:EN
Publicado: Nature Portfolio 2019
Materias:
R
Q
Acceso en línea:https://doaj.org/article/1850e94c43934ed380bb8abe924a3e09
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:1850e94c43934ed380bb8abe924a3e09
record_format dspace
spelling oai:doaj.org-article:1850e94c43934ed380bb8abe924a3e092021-12-02T15:09:54ZHigh information capacity DNA-based data storage with augmented encoding characters using degenerate bases10.1038/s41598-019-43105-w2045-2322https://doaj.org/article/1850e94c43934ed380bb8abe924a3e092019-04-01T00:00:00Zhttps://doi.org/10.1038/s41598-019-43105-whttps://doaj.org/toc/2045-2322Abstract DNA-based data storage has emerged as a promising method to satisfy the exponentially increasing demand for information storage. However, practical implementation of DNA-based data storage remains a challenge because of the high cost of data writing through DNA synthesis. Here, we propose the use of degenerate bases as encoding characters in addition to A, C, G, and T, which augments the amount of data that can be stored per length of DNA sequence designed (information capacity) and lowering the amount of DNA synthesis per storing unit data. Using the proposed method, we experimentally achieved an information capacity of 3.37 bits/character. The demonstrated information capacity is more than twice when compared to the highest information capacity previously achieved. The proposed method can be integrated with synthetic technologies in the future to reduce the cost of DNA-based data storage by 50%.Yeongjae ChoiTaehoon RyuAmos C. LeeHansol ChoiHansaem LeeJaejun ParkSuk-Heung SongSeojoo KimHyeli KimWook ParkSunghoon KwonNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 9, Iss 1, Pp 1-7 (2019)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Yeongjae Choi
Taehoon Ryu
Amos C. Lee
Hansol Choi
Hansaem Lee
Jaejun Park
Suk-Heung Song
Seojoo Kim
Hyeli Kim
Wook Park
Sunghoon Kwon
High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
description Abstract DNA-based data storage has emerged as a promising method to satisfy the exponentially increasing demand for information storage. However, practical implementation of DNA-based data storage remains a challenge because of the high cost of data writing through DNA synthesis. Here, we propose the use of degenerate bases as encoding characters in addition to A, C, G, and T, which augments the amount of data that can be stored per length of DNA sequence designed (information capacity) and lowering the amount of DNA synthesis per storing unit data. Using the proposed method, we experimentally achieved an information capacity of 3.37 bits/character. The demonstrated information capacity is more than twice when compared to the highest information capacity previously achieved. The proposed method can be integrated with synthetic technologies in the future to reduce the cost of DNA-based data storage by 50%.
format article
author Yeongjae Choi
Taehoon Ryu
Amos C. Lee
Hansol Choi
Hansaem Lee
Jaejun Park
Suk-Heung Song
Seojoo Kim
Hyeli Kim
Wook Park
Sunghoon Kwon
author_facet Yeongjae Choi
Taehoon Ryu
Amos C. Lee
Hansol Choi
Hansaem Lee
Jaejun Park
Suk-Heung Song
Seojoo Kim
Hyeli Kim
Wook Park
Sunghoon Kwon
author_sort Yeongjae Choi
title High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_short High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_full High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_fullStr High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_full_unstemmed High information capacity DNA-based data storage with augmented encoding characters using degenerate bases
title_sort high information capacity dna-based data storage with augmented encoding characters using degenerate bases
publisher Nature Portfolio
publishDate 2019
url https://doaj.org/article/1850e94c43934ed380bb8abe924a3e09
work_keys_str_mv AT yeongjaechoi highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT taehoonryu highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT amosclee highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT hansolchoi highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT hansaemlee highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT jaejunpark highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT sukheungsong highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT seojookim highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT hyelikim highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT wookpark highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
AT sunghoonkwon highinformationcapacitydnabaseddatastoragewithaugmentedencodingcharactersusingdegeneratebases
_version_ 1718387736617943040