Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cDNA read map parameter for gsAssembler

    When I use gsAssembler from 454 to do cDNA assembling, I was confused about this parameter '-rip'.
    '-rip' --> Flag to output each read in only one contig.
    and this option default false.
    As I known for genome assembly, one read should be assembled to only one contig.
    I want to know, for cDNA assembly, reads should be assembled to multiple contigs or not? If yes, i can't understand, who can tell me why. If not, in what situation reads should be assembled to multiple contigs?
    what message can we get by assembling one read to multiple contigs?

    urgent for answer...

  • #2
    Interestingly, in Newbler 2.5 (recently released) there is a warning message when using both the -cdna and the -rip switch:

    The -rip option has no effect for cDNA assembly projects.
    After looking at the 2.5 output, it appears to be ripping reads apart. I still need to play around with the 2.5 results a bit more in order to be sure. Thus the answer to your question of "reads should be assembled to multiple contigs or not?" is, at least for Newbler, "yes". As for why, I suspect that it is because Newbler likes making Isotigs which, more or less, represent alternative splicing.

    Comment


    • #3
      Originally posted by westerman View Post
      Interestingly, in Newbler 2.5 (recently released) there is a warning message when using both the -cdna and the -rip switch:



      After looking at the 2.5 output, it appears to be ripping reads apart. I still need to play around with the 2.5 results a bit more in order to be sure. Thus the answer to your question of "reads should be assembled to multiple contigs or not?" is, at least for Newbler, "yes". As for why, I suspect that it is because Newbler likes making Isotigs which, more or less, represent alternative splicing.
      Thanks for reply and sorry for being offline due to the time difference. I still work with Newbler 2.3, maybe I need a uprade.
      I ran a set of sff file using Newbler 2.3 with '-rip' and parallel without '-rip',then compare the result '454IsotigsLayout.txt' and '454ReadStatus.txt' file. Profound differences were found between the two '454IsotigsLayout.txt' file, I am confused by which is more close to the truth.
      Another aspect, even using '-rip' option, '454ReadStatus.txt' file still contain reads assembled to multi contigs, cause difficult to get the expression level.
      I'm still confused about this.
      GK033JC03GCSGP Assembled contig00261 7 + contig00262 281 -
      GK033JC03G0PHY Assembled contig00261 7 + contig00262 436 -
      GK033JC03GNYQQ Assembled contig00261 7 + contig00262 470 -
      GK033JC03G292T Assembled contig00261 1 + contig00262 434 -
      GK033JC03GT171 Assembled contig00261 1 + contig00262 378 -

      Comment


      • #4
        Originally posted by shaojingwang View Post
        I want to know, for cDNA assembly, reads should be assembled to multiple contigs or not? If yes, i can't understand, who can tell me why.
        Both for genome assemblies and cDNA assemblies (and contrary to many other assemblers), newbler sometimes places parts of reads in different contigs. The reason why I try to explain in my blog on newbler:
        I thought to start by explaining briefly how newbler works. I’ll do this by following the output newbler generates during the assembly process. This information is displayed during assembly, …


        The -rip option has been discussed previously:
        Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

        Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc


        Best of luck,

        flxlex

        Comment


        • #5
          That's just what I want to know, thank you.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-25-2024, 11:49 AM
          0 responses
          19 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-24-2024, 08:47 AM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          62 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          60 views
          0 likes
          Last Post seqadmin  
          Working...
          X