TY - JOUR
T1 - SOAP3-dp
T2 - Fast, Accurate and Sensitive GPU-Based Short Read Aligner
AU - Luo, Ruibang
AU - Wong, Thomas
AU - Zhu, Jianqiao
AU - Liu, Chi Man
AU - Zhu, Xiaoqian
AU - Wu, Edward
AU - Lee, Lap Kei
AU - Lin, Haoxiang
AU - Zhu, Wenjuan
AU - Cheung, David W.
AU - Ting, Hing Fung
AU - Yiu, Siu Ming
AU - Peng, Shaoliang
AU - Yu, Chang
AU - Li, Yingrui
AU - Li, Ruiqiang
AU - Lam, Tak Wah
PY - 2013/5/31
Y1 - 2013/5/31
N2 - To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners BarraCUDA and CUSHAW, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths. Transcending its predecessor SOAP3, which does not allow gapped alignment, SOAP3-dp by default tolerates alignment similarity as low as 60%. Real data evaluation using human genome demonstrates SOAP3-dp's power to enable more authentic variants and longer Indels to be discovered. Fosmid sequencing shows a 9.1% FDR on newly discovered deletions. SOAP3-dp natively supports BAM file format and provides the same scoring scheme as BWA, which enables it to be integrated into existing analysis pipelines. SOAP3-dp has been deployed on Amazon-EC2, NIH-Biowulf and Tianhe-1A.
AB - To tackle the exponentially increasing throughput of Next-Generation Sequencing (NGS), most of the existing short-read aligners can be configured to favor speed in trade of accuracy and sensitivity. SOAP3-dp, through leveraging the computational power of both CPU and GPU with optimized algorithms, delivers high speed and sensitivity simultaneously. Compared with widely adopted aligners including BWA, Bowtie2, SeqAlto, CUSHAW2, GEM and GPU-based aligners BarraCUDA and CUSHAW, SOAP3-dp was found to be two to tens of times faster, while maintaining the highest sensitivity and lowest false discovery rate (FDR) on Illumina reads with different lengths. Transcending its predecessor SOAP3, which does not allow gapped alignment, SOAP3-dp by default tolerates alignment similarity as low as 60%. Real data evaluation using human genome demonstrates SOAP3-dp's power to enable more authentic variants and longer Indels to be discovered. Fosmid sequencing shows a 9.1% FDR on newly discovered deletions. SOAP3-dp natively supports BAM file format and provides the same scoring scheme as BWA, which enables it to be integrated into existing analysis pipelines. SOAP3-dp has been deployed on Amazon-EC2, NIH-Biowulf and Tianhe-1A.
UR - http://www.scopus.com/inward/record.url?scp=84878532952&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0065632
DO - 10.1371/journal.pone.0065632
M3 - Article
SN - 1932-6203
VL - 8
JO - PLoS ONE
JF - PLoS ONE
IS - 5
M1 - e65632
ER -