Dear all,
I just received 1 Flow cell of illumina plant sequencing data (75bp PE reads, ~2 X 26 million reads/lane). As we don't have in most cases a reference sequence available, I will have to do some de novo assembly.
I started doing some tests using velvet. First I did an assembly on 1 lane, next I combined 2 lanes. In att you can see the virtual memory that was used during this process (I only show the velvetg part as this was the hardest to do) (horizontal you see the time needed, vertical you have the G of RAM needed). As you can see, the more datasets you combine, the more memory that you need (additive). As I have 14 sets to combine, I will never be able to perform this assembly using velvet (2 lanes already required 91G of RAM).
My questions:
- is there an other way to use velvet (to reduce this memory issue)?
- Are there other (well performing) assembly tools that use less memory? (I tested the CLCBio assembly tool and this one requires much less memory. But, this is of course a commercial tool)
- All suggestions are welcome
Thanks
Steven
I just received 1 Flow cell of illumina plant sequencing data (75bp PE reads, ~2 X 26 million reads/lane). As we don't have in most cases a reference sequence available, I will have to do some de novo assembly.
I started doing some tests using velvet. First I did an assembly on 1 lane, next I combined 2 lanes. In att you can see the virtual memory that was used during this process (I only show the velvetg part as this was the hardest to do) (horizontal you see the time needed, vertical you have the G of RAM needed). As you can see, the more datasets you combine, the more memory that you need (additive). As I have 14 sets to combine, I will never be able to perform this assembly using velvet (2 lanes already required 91G of RAM).
My questions:
- is there an other way to use velvet (to reduce this memory issue)?
- Are there other (well performing) assembly tools that use less memory? (I tested the CLCBio assembly tool and this one requires much less memory. But, this is of course a commercial tool)
- All suggestions are welcome
Thanks
Steven
Comment