Unconfigured Ad

**Brian Bushnell** · 06-20-2014, 01:25 PM

Have you tried varying the kmer length when assembling? Also, it would be helpful to know more about your data, like the read length and total amount, and quality metrics.

I encourage you to read this thread:

de novo assembly of 1.7Gb reptile - Best Practices? - SEQanswers

http://seqanswers.com/forums/showthread.php?t=42555

Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

**luc** · 06-20-2014, 04:01 PM

... and the amount of contaminating sequences?

Originally posted by Brian Bushnell View Post

Have you tried varying the kmer length when assembling? Also, it would be helpful to know more about your data, like the read length and total amount, and quality metrics.

I encourage you to read this thread:
http://seqanswers.com/forums/showthread.php?t=42555

**arundurvasula** · 06-23-2014, 10:44 AM

Thanks for the replies.

I used VelvetOptimiser to determine optimal k-mer length. Our data contains a mixture of grape and virus reads, but we removed the reads that aligned to the grape reference genome. Our read length is 50 bp and we have 7,764,190 reads after filtering out the grape reads.

Here is the quast output from the optimal velvet run:

All statistics are based on contigs of size >= 100 bp, unless otherwise noted (e.g., "# contigs (>= 0 bp)" and "Total length (>= 0 bp)" include all contigs).

Assembly contigs
# contigs (>= 0 bp) 3547
# contigs (>= 1000 bp) 1
Total length (>= 0 bp) 326445
Total length (>= 1000 bp) 1073
# contigs 941
Largest contig 1073
Total length 156559
GC (%) 46.84
N50 162
N75 122
L50 305
L75 584
# N's per 100 kbp 0.00

**mastal** · 06-23-2014, 11:13 AM

Are you trying to assemble genomic data or transcriptomic data?

What is the expected genome size of the virus genome you are trying to assemble?

What kmer length have you used?

As Brian already mentioned above, I would play around with the kmer length
when using velvet, to see what kmer length gives you the best n50.

Have you done any QC, adapter trimming or quality trimming on your reads?

**SNPsaurus** · 06-23-2014, 12:42 PM

Do you think the viral genome will be divergent within a sample from replication errors? That could cause issues for assembly if there are lots of related kmers at a location instead of just one or two alleles and a low level of sequencing error.

**arundurvasula** · 07-01-2014, 09:20 AM

I was able to assemble my data using IDBA_UD. I set it to cycle through k mers that were less than my read size and it produced a 15000bp sequence: idba_ud -r ../data/trimmed-reads/LV89-02.fa -o ../results/contigs/008 --mink 19 --maxk 49 --step 2

**jpummil** · 07-01-2014, 11:07 AM

quast quality stats of the assembly?

**arundurvasula** · 07-01-2014, 11:18 AM

All statistics are based on contigs of size >= 100 bp, unless otherwise noted (e.g., "# contigs (>= 0 bp)" and "Total length (>= 0 bp)" include all contigs).

Assembly contig
# contigs (>= 0 bp) 295
# contigs (>= 1000 bp) 37
Total length (>= 0 bp) 199556
Total length (>= 1000 bp) 86293
# contigs 295
Largest contig 15124
Total length 199556
GC (%) 46.27
N50 759
N75 426
L50 53
L75 141
# N's per 100 kbp 0.00

Topics	Statistics	Last Post
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 14 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 24 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 28 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM
DNA Methylation Study Reveals How Epigenetic Changes Pass Between Generations by SEQadmin2 Started by SEQadmin2, 06-02-2026, 11:40 AM	0 responses 22 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 11:40 AM

Unconfigured Ad

Increasing contig lengths

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News