Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Minimum contig & Coverage

    Hello,

    I have some 454 sequences reads from a cDNA library for a novel organism (non-sequenced genome).

    I have processed the data with CLC Genomics Workbench and done four different de_novo assemblies with 200 bp, 300 bp, 400 bp and 500 bp minimal contig length. As one would expect, increasing the minimum contig length decreases the total number of contigs (or assembled cDNAs).

    I have two questions:

    1) Is there any consensus on the minimum contig length for cDNAs?

    2) What is considered a minimal amount of coverage for any given cDNA within the final assembled contigs (cDNAs)? I would assume that rare cDNAs would have much lower coverage than abundant cDNAs.

    Thanks,
    CH
    Last edited by cement_head; 08-23-2012, 05:03 AM. Reason: clarity

  • #2
    Originally posted by cement_head View Post
    1) Is there any consensus on the minimum contig length for cDNAs?
    No. But if you throw away contigs > 500bp say, then you won't discover any transcripts that are <= 500 bp.

    2) What is considered a minimal amount of coverage for any given cDNA within the final assembled contigs (cDNAs)?
    Same as for genomic DNA. That is, there is no hard rule. 1x is probably too low. 100x is probably overkill. Somewhere in between is a reasonable trade-off between getting long transcripts, and not getting false positives.

    I would assume that rare cDNAs would have much lower coverage than abundant cDNAs.
    Yes. The assumption is the #reads is proportional to the #templates in the library.

    Comment


    • #3
      Thanks - this is what I suspected.

      Regards,
      CH

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Best Practices for Single-Cell Sequencing Analysis
        by seqadmin



        While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
        06-06-2024, 07:15 AM
      • seqadmin
        Latest Developments in Precision Medicine
        by seqadmin



        Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

        Somatic Genomics
        “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
        05-24-2024, 01:16 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 06-07-2024, 06:58 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-06-2024, 08:18 AM
      0 responses
      21 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-06-2024, 08:04 AM
      0 responses
      20 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-03-2024, 06:55 AM
      0 responses
      14 views
      0 likes
      Last Post seqadmin  
      Working...
      X