Re-assembly of singletons?

fbarreto

Member

Join Date: Jan 2010

Posts: 15
- Share
- Tweet
#1

Re-assembly of singletons?

02-20-2010, 05:50 PM

Hi, all

I have 454 transcriptome data, and I am a bit puzzled by some assemblies I have been doing. When I did a first assembly (on CLC Workbench), the ~600,000 reads assembled into 13,000 contigs and 157,000 singletons. Just out of curiosity, I used the set of singletons and did a new assembly of them. Since they were supposedly unmatched reads, I expected them to not assemble very well, if at all. Instead, the singletons assembled into a new set of 14,000 contigs, leaving a remaining 70,000 singletons!! I used the same stringency for both assemblies (0.5 length fraction, and 0.90 identity).

When I look at the coverage for the re-assembled singletons, it seems they have comparable coverage and length distribution to the original assembly. So they seem to be OK.

I guess I don't really understand the assembly algorithm, cause I am a bit puzzled as to why it would leave so many reads unassembled in a first pass, when they were obviously matched to other reads.

So my question is: is re-assembly of 'left-over' singletons from a first assembly a reasonable approach? Or does that somehow force 'bad' contigs to be formed?

Any insight would be extremely helpful, since the structure of my dataset will vary tremendously depending on the answer!

Thanks!

Felipe
Tags: 454, assembly, singletons
cnavarro

Junior Member

Join Date: Jul 2011

Posts: 1
- Share
- Tweet
#2

08-03-2012, 07:29 AM

the same concern...
Comment
sklages

Senior Member

Join Date: May 2008

Posts: 628
- Share
- Tweet
#3

08-04-2012, 02:13 AM

Keep in mind that CLC GWB is not a transcriptome but a genome assembler. That might lead to unforseen results. You may want to ask CLC support tocomment on the results you have gotten.
Comment

Previous template Next

Exploring the Dynamics of the Tumor Microenvironment

by seqadmin

The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
- Channel: Articles
07-08-2024, 03:19 PM

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, Yesterday, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin Yesterday, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

Re-assembly of singletons?

Comment

Comment

Latest Articles

ad_right_rmr

News