Greetings.
I am doing some de novo assembly of a 23 Mb genome using MiSeq paired end Illumina reads (250bp reads, 400bp insert (SD 130)). These reads, however, have been trimmed for quality and range widely in their finished size, with most at about 190bp. Assembly using -long/-longPaired vs -short/shortPaired gives surprisingly different final results. Any ideas why this is happening or which results are more reliable?
Thanks!
Commands:
Results, short:
Results, long:
I am doing some de novo assembly of a 23 Mb genome using MiSeq paired end Illumina reads (250bp reads, 400bp insert (SD 130)). These reads, however, have been trimmed for quality and range widely in their finished size, with most at about 190bp. Assembly using -long/-longPaired vs -short/shortPaired gives surprisingly different final results. Any ideas why this is happening or which results are more reliable?
Thanks!
Commands:
Code:
velveth Genome1_71 71 -short -fastq reads_R1.trimmed.fastq.se reads_R2.trimmed.fastq.se -shortPaired -separate -fastq reads_R1.trimmed.fastq.pe reads_R2.trimmed.fastq.pe velvetg Genome1_71 -exp_cov 43 -ins_length 407 -ins_length_sd 130 velveth Genome1_71 71 -long -fastq reads_R1.trimmed.fastq.se reads_R2.trimmed.fastq.se -longPaired -separate -fastq reads_R1.trimmed.fastq.pe reads_R2.trimmed.fastq.pe velvetg Genome1_71 -exp_cov 43 -ins_length_long 407 -ins_length_long_sd 130
Code:
Final graph has 128642 nodes and n50 of 17324, max 332339, total 26267561, using 6042595/7501247 reads
Code:
Final graph has 148426 nodes and n50 of 1610, max 28675, total 26984545, using 6083488/7501247 reads
Comment