I have denovo assembled a 1.8 Mbp genome with a PE 100 bp 360 insert library using abyss resulting in 69 contigs. After that I scaffolded these contigs with a MP 100 bp 3300 insert library using SSPACE-Premium resulting in 7 scaffolds.
My problem is that SSPACE makes connections between contigs although the contig link to multiple other contigs. This is the command I used for running SSPACE.
libraryfile libSSPACE:
lib4000 bwa rawseq_MP_1.txt.gz rawseq_MP_2.txt.gz 3356 0.2 RF
When running this I get a connection between contig 37 and contig 15 although contig 37 has links to 4 different contigs.
r37 has 1118 links with r14 and gap of 351 bases
r37 has 1051 links with r36 and gap of 488 bases
r37 has 595 links with f12 and gap of 334 bases
f37 has 542 links with f15 and gap of -242 bases
I guess that this depends on the building scaffolds part with the two different two methods/ratios for generating reliable scaffolds.
I do not trust these results and how can I set the parameters to get reliable scaffolds with my long insert library?
My problem is that SSPACE makes connections between contigs although the contig link to multiple other contigs. This is the command I used for running SSPACE.
Code:
SSPACE_Premium_v2.2.pl -l libSSPACE.txt -s contigs.fna -k 20 -a 0.9 -x0 -b outdir -T 12 -p1
libraryfile libSSPACE:
lib4000 bwa rawseq_MP_1.txt.gz rawseq_MP_2.txt.gz 3356 0.2 RF
When running this I get a connection between contig 37 and contig 15 although contig 37 has links to 4 different contigs.
r37 has 1118 links with r14 and gap of 351 bases
r37 has 1051 links with r36 and gap of 488 bases
r37 has 595 links with f12 and gap of 334 bases
f37 has 542 links with f15 and gap of -242 bases
I guess that this depends on the building scaffolds part with the two different two methods/ratios for generating reliable scaffolds.
I do not trust these results and how can I set the parameters to get reliable scaffolds with my long insert library?
Comment