Seqanswers Leaderboard Ad

**colindaven** · 10-25-2011, 03:39 AM

We have done > 20 genomes on 454 titanium but also have a lot of Illumina data. The advantage with 454 is you can do de novo assembly or reference based.

With Illumina reads the reference based approach would most likely be more relevant for E. coli, because you'd end up with a lot of small contigs after de novo assembly.

However ref based assembly means you can have difficulties finding new components of the accessory genome. SNP detection vs a reference is very nice though.

**SHB** · 10-25-2011, 04:33 AM

We are really looking for possible new components in the acessory genome and less for SNPs so we are going for de novo sequencing and then genome comparison between strains and ref. genomes.

**pmiguel** · 10-25-2011, 04:59 AM

Another issue here is that of late I have seen some astounding improvements in de novo assemblers that are real game changers for small genome assembly. Using 10% of a lane of sequence from a HiScanSQ 2x100 run on a simple fragment (PE) TruSeq library assembled with ABySS-PE using kmer 70 we get a reasonable draft sequence.

By "reasonable" I mean that for 3 Salmonella strains our N50 was >220 kb with 50% of their respective genomes in 8 or 9 contigs. Between 60 and 70 total contigs with sizes 1 kb or larger.

This is without gap filling or mate-end libraries. Also, these are completely de novo assemblies. (Although, obviously, reference-based assemblies could have been undertaken.)

--
Phillip

**krobison** · 10-27-2011, 06:12 PM

Originally posted by pmiguel View Post

Another issue here is that of late I have seen some astounding improvements in de novo assemblers that are real game changers for small genome assembly. Using 10% of a lane of sequence from a HiScanSQ 2x100 run on a simple fragment (PE) TruSeq library assembled with ABySS-PE using kmer 70 we get a reasonable draft sequence.

About what fold coverage of reads does this work out to?

**pmiguel** · 10-28-2011, 03:47 AM

Generally >100X base coverage. In some cases we have overshot and ended up with >200X with a smaller (1 megabase) bacterial genomes--which leads to "embarrassment of riches" with the assembler. (ABySS-PE).

--
Phillip

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

E coli de novo sequencing

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News