Does anyone have experience with merging scaffolds from a few assemblies into a single consensus assembly? When I run SOAPdenovo with different k cutoffs I get different resulting scaffold characteristics. Higher k values tend to give me a few longer scaffolds with better n50, but overall a lower mean scaffold size and more shorter scaffolds than with smaller values of k. It seems like a good idea to merge assemblies generated with a few of these kmer values into a single consensus assembly, right?
I have looked at a few tools to do this. One of them is called Reconciliator http://www.genome.umd.edu/reconcilia...structions.htm but it would require me to convert a few SOAPdenovo assembly output files into "Sanger/WashU" format which looks like it could involve quite a bit of work. I am not sure the amount of work required to generate all of these required input files would be worth it?
I also found minimus2 which looks like it is primarily designed to merge two assemblies at a time, and I have 4 I would like to merge. I could do pairwise merging with the previous consensus, but that seems like it could lead to problems... I am also finding some complaints on forums about how the program deals with N's.
The last program I found is called MAIA, but it is distributed as a Matlab package with dependencies on a few matlab distributed toolkits (yuck) and it looks like it also requires a "closely related reference genome" which I definitely do not have.
Thanks for any suggestions or experiences with this.
-John
I have looked at a few tools to do this. One of them is called Reconciliator http://www.genome.umd.edu/reconcilia...structions.htm but it would require me to convert a few SOAPdenovo assembly output files into "Sanger/WashU" format which looks like it could involve quite a bit of work. I am not sure the amount of work required to generate all of these required input files would be worth it?
I also found minimus2 which looks like it is primarily designed to merge two assemblies at a time, and I have 4 I would like to merge. I could do pairwise merging with the previous consensus, but that seems like it could lead to problems... I am also finding some complaints on forums about how the program deals with N's.
The last program I found is called MAIA, but it is distributed as a Matlab package with dependencies on a few matlab distributed toolkits (yuck) and it looks like it also requires a "closely related reference genome" which I definitely do not have.
Thanks for any suggestions or experiences with this.
-John
Comment