I was looking for DNA assemblers that work on GPU. I found only this paper http://www.cs.gmu.edu/~tr-admin/pape...-TR-2011-1.pdf - GPU Euler. But I was not satisfied with the concept and results provided in the paper. It works by finding the whole Euler tour without any graph transformation and error correction, still it is getting results comparable to well established assemblers like Euler SR. Like max-length of 40,000 , N-50 value 8000, etc. No other assembler works by finding the whole Euler tour then how this paper is mentioning such good results. Does anyone have read or worked with this paper?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
I would say GPU is a no-go for genome assembly. We looked at various options for doing genome assembly in GPUs last year, and could not make the algorithms scale well. Genome assembly programs need very large memory bandwidth, and it is not possible to scale the programs well in the GPUs, whose greatest benefit is access to many 'parallel' processors. Late last year, I attended BGI's booth at HPC conference (Salt Lake City) and saw a number of GPU solutions being presented for various bioinformatics problems, but the genome assembly program did not seem to give any performance boost. At present our group is working on implementing a genome assembler in FPGA, where we can get the performance boost.
I will forward your question to BGI's Ruibang, who can probably shed more light on the current status.
-
According to the tech report, 90% of total time goes to "I/O". If I understand correctly, this "I/O" phase, unusually, includes k-mer counting and is done purely with CPU. K-mer counting is one of the slowest and most memory hungry steps in the construction of de Bruijn graph. If we cannot parallelize this step with GPU, we will not get much speed up.
In addition, the reported assembly speed is slower than what I would expect with velvet. I think velvet can usually get the results in a minute or so given 20X error-free data for a ~2Mbp genome. That is in par with GPU-Euler.
In all, I think the tech report does not prove that a GPU-based de Bruijn assembler is much better than CPU-based ones.
Comment
-
Thanks Samanta and lh3. I'm not closing the possibility of implementing an ultra-fast assembler on GPU, but the reduction nature of genome assembly problem constrained it from scaling well on GPU. For the latest GPU model nVidia GTX Titan, which has 2600+ cores but only ~300G memory bandwidth, every core will only get ~100MB/s memory bandwidth, not mentioning the optimal can only be achieve by coalesced memory access, which is almost impossible to be fulfilled no matter using DBG, String Graph or Greedy. Another problem is that GPU has only limited amount of on-board memory (3G-12G), swapping between host memory and GPU memory is possible but ultimately slow.
Differently, the problem of alignment is mainly "mapping" problem in MapReduce scheme, which makes it suitable for GPU or other HPC accelerator like FPGA and MIC. Plenty investigations have been done: "SOAP3-dp" (http://arxiv.org/abs/1302.5507) and "CUSHAW2-GPU" (http://cushaw2.sourceforge.net/homepage.htm#latest) has achieved more than 10x acceleration to CPU aligners, the most important, much higher sensitivity and accuracy in opening large gaps provide much more computational power.
BTW, frankly speaking, CPU assemblers, say SOAPdenovo2 and ALLPATH-LG, still have a large space to be improved. Samanta has a very good discussion on the hash function used in assemblers (http://homolog.us/blogs). A question is that, why we have to use standard, general hash functions in assembler? The only feature assemblers require the hash functions to have is the evenness, why shall we care that much about avalanche test.
Comment
-
-
Originally posted by davispeter View Posthmmmm. Thanks for reply. But what about the Euler approach. In the paper (that I mentioned) the Euler approach is implemented parallely. Does that mean Euler approach is not good?
-mark
Comment
-
Latest Articles
Collapse
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
-
by seqadmin
During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.
Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...-
Channel: Articles
09-09-2024, 10:59 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
13 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
22 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
0 responses
26 views
0 likes
|
Last Post
by seqadmin
09-30-2024, 08:33 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
18 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|
Comment