Abstract
In the sequencing process, reads of the sequence are generated, then assembled to form contigs. New technologies can produce reads faster with lower cost and higher coverage. However, these reads are shorter. With errors, short reads make the assembly step more difficult. Chaisson et al. (2004) proposed an algorithm to correct the reads prior to the assembly step. The result is not satisfactory when the error rate is high (e.g., ≥ 3%). We improve their approach to handle reads of higher error rates. Experimental results show that our approach is much more effective in correcting errors, producing contigs of higher quality.
Original language | English |
---|---|
Pages (from-to) | 224-237 |
Number of pages | 14 |
Journal | International Journal of Bioinformatics Research and Applications |
Volume | 5 |
Issue number | 2 |
DOIs | |
Publication status | Published - Mar 2009 |