Technical Name Kart – Ultra-fast mapping for NGS short and long reads
Project Operator Institute of Information Science, Academia Sinica
Project Host 許聞廉
Summary
Kart adopts divide-and-conquer strategy. It finds an alignment by clustering all seeds on the reference genome. Kart divides a read into fragments that are easy to align (without INDELs) and those that require the dynamic programming algorithm, and align each fragment to compose the final alignment. Kart is the first method which use divide and conquer technique on the problem of NGS read mapping.
Scientific Breakthrough
We simulated read libraries of the human genome and generates synthetic reads with random sequencing errors. We compared Kart with several existing nine popular read aligners, like BWA-MEM, Bowtie2 and HISAT2, etc. The experiment result showed that most selected aligners produced comparable alignments. However, Kart is the fastest aligner among all considered algorithms.
Industrial Applicability
Our algorithm can not only handle NGS short read alignment and long read alignment, but it can also handle RNA-Seq read alignment, and genome sequence comparisons. It can be also applied to Metagenomics data analysis. Metagenomics data analysis involves huge amount of microbial sequences. We found that our algorithm can efficiently handle Metagenomics data and achieve strain-level resolution.
Keyword Next generation sequencing Third generation sequencing sequence alignment sequence variation divide-and-conquer seed-and-extend genome sequence alignment Metagenomics Computational biology Bioinformatics
other people also saw