Hi all,
I've done a bit of E. coli resequencing but I am new to the art of de novo and could use some input on what strategies would give me a fully contiguous genome (and at what cost).
Trying to assemble a 5.2Mb bacteria (estimated from pulsed field) with 220bp SE MiSeq filtered reads (Q>=25) @ ~150x coverage. Velvet Optimiser tells me:
k=147
numContigs = 161
n50 = 81.6kb
longest contig = 279kb
num contigs > 1kb = 107
total bases in contigs = 5.1Mb (~98.9% covered)
I don't necessarily need a fully contiguous genome but what would it take? According to what I've read, a mate-paired run could help but it seems that I am at a point of diminishing returns.
Thanks for any suggestions!
I've done a bit of E. coli resequencing but I am new to the art of de novo and could use some input on what strategies would give me a fully contiguous genome (and at what cost).
Trying to assemble a 5.2Mb bacteria (estimated from pulsed field) with 220bp SE MiSeq filtered reads (Q>=25) @ ~150x coverage. Velvet Optimiser tells me:
k=147
numContigs = 161
n50 = 81.6kb
longest contig = 279kb
num contigs > 1kb = 107
total bases in contigs = 5.1Mb (~98.9% covered)
I don't necessarily need a fully contiguous genome but what would it take? According to what I've read, a mate-paired run could help but it seems that I am at a point of diminishing returns.
Thanks for any suggestions!
Comment