Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Re-assembly of singletons?

    Hi, all

    I have 454 transcriptome data, and I am a bit puzzled by some assemblies I have been doing. When I did a first assembly (on CLC Workbench), the ~600,000 reads assembled into 13,000 contigs and 157,000 singletons. Just out of curiosity, I used the set of singletons and did a new assembly of them. Since they were supposedly unmatched reads, I expected them to not assemble very well, if at all. Instead, the singletons assembled into a new set of 14,000 contigs, leaving a remaining 70,000 singletons!! I used the same stringency for both assemblies (0.5 length fraction, and 0.90 identity).

    When I look at the coverage for the re-assembled singletons, it seems they have comparable coverage and length distribution to the original assembly. So they seem to be OK.

    I guess I don't really understand the assembly algorithm, cause I am a bit puzzled as to why it would leave so many reads unassembled in a first pass, when they were obviously matched to other reads.

    So my question is: is re-assembly of 'left-over' singletons from a first assembly a reasonable approach? Or does that somehow force 'bad' contigs to be formed?

    Any insight would be extremely helpful, since the structure of my dataset will vary tremendously depending on the answer!

    Thanks!

    Felipe

  • #2
    the same concern...

    Comment


    • #3
      Keep in mind that CLC GWB is not a transcriptome but a genome assembler. That might lead to unforseen results. You may want to ask CLC support tocomment on the results you have gotten.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Advances in Sequencing Technologies
        by seqadmin







        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

        Long-Read Sequencing
        Long-read sequencing has...
        Yesterday, 01:49 PM
      • seqadmin
        Genetic Variation in Immunogenetics and Antibody Diversity
        by seqadmin



        The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
        11-06-2024, 07:24 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 09:29 AM
      0 responses
      77 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 09:06 AM
      0 responses
      39 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 08:03 AM
      0 responses
      25 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 11-22-2024, 07:36 AM
      0 responses
      65 views
      0 likes
      Last Post seqadmin  
      Working...
      X