||
Highly accurate fluorogenic DNA sequencing with information theory–based error correction
Corresponding author: Yanyi Huang (黄岩宜)
Eliminating (消除) errors in next-generation DNA sequencing has proved challenging. Here we present error-correction code (ECC) sequencing, a method to greatly improve sequencing accuracy by combining fluorogenic sequencing-by-synthesis (SBS; 边合成边测序) with an information theory–based error-correction (纠错) algorithm. ECC embeds (嵌入) redundancy in sequencing reads by creating three orthogonal degenerate sequences (三重正交简并序列), generated by alternate dual-base (对偶碱基) reactions. This is similar to encoding and decoding strategies that have proved effective in detecting and correcting errors in information communication and storage. We show that, when combined with a fluorogenic SBS chemistry with raw accuracy of 98.1%, ECC sequencing provides single-end, error-free sequences up to 200 bp. ECC approaches should enable accurate identification of extremely rare genomic variations in various applications in biology and medicine.
消除NGS中的错误是非常具有挑战的事情,同时也是十分有意义的工作。本文提出了一种纠错编码(ECC)的测序技术,该技术可以通过结合荧光发生边测序边合成(SBS)和一个基于信息理论的纠错算法来大幅度提高测序的精确度。ECC通过对偶碱基的两两组合反应得到的三条正交简并序列上的冗余信息来对测序进行纠错。这种纠错方法类似于已经成功并广泛应用于信息通讯和存储的的编码和解码策略,该策略可有效检测和矫正错误。本文表明结合了精确度约为98.1%的荧光发生SBS技术,ECC测序技术可以提供单端、200bp的免错序列。ECC方法能够在生物学及医药学中精确鉴定极端罕见的基因组变异。
个人简介:2002-2005,美国加州理工学院应用物理系,博士后;2005-2006,美国斯坦福大学生物工程系,博士后;2006-2013,北京大学工学院,特聘研究员;2007-现在,北京大学化学与分子工程学院,兼职研究员;2010-现在,北京大学BIOPIC,研究员;2013-现在,北京大学工学院,教授;2013-现在,北京大学高通量测序中心常务副主任;2013-现在, 北京大学BIOPIC, 中心副主任;2015-现在,北大-清华生命科学联合中心,研究员;2016.8-现在,北大未来基因诊断高精尖创新中心,研究员。
doi: 10.1038/nbt.3982
Archiver|手机版|科学网 ( 京ICP备07017567号-12 )
GMT+8, 2024-11-26 20:43
Powered by ScienceNet.cn
Copyright © 2007- 中国科学报社