Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • CompBio
    Member
    • Aug 2009
    • 26

    Why Do a Gene's Reads Appear on Both Strands?

    I've got a naive question about Illumina short reads (though it may apply to other technologies as well). I'm a computer scientist, so there are some serious gaps in my understanding of molecular biology.

    When we map short reads to a genome, we see clusters on both strands, not just the strand where a gene is annotated. In fact, the read depth appears to be almost perfectly symmetrical. I've seen the same phenomenon in published papers, though I've yet to find an explanation. What would cause this to happen?
  • ECO
    --Site Admin--
    • Oct 2007
    • 1360

    #2
    Hey Compbio,

    I'm assuming you're referring to an RNA-seq library. The read depth you're describing is a consequence of how the library is constructed. While the RNA is indeed strand specific, the conversion to cDNA can result in the loss of strand information.

    It is possible to construct RNA-seq libraries that retain the strand information, but from a molecular biology standpoint it is much easier to discard it. See the following two papers for a couple of methods for maintaining strand information:

    Lister, et al: http://www.ncbi.nlm.nih.gov/pubmed/18423832

    Cloonan, et al: http://www.ncbi.nlm.nih.gov/pubmed/18516046

    Hope that at least helps a bit.

    Comment

    • CompBio
      Member
      • Aug 2009
      • 26

      #3
      Originally posted by ECO View Post
      Hey Compbio,

      I'm assuming you're referring to an RNA-seq library. The read depth you're describing is a consequence of how the library is constructed. While the RNA is indeed strand specific, the conversion to cDNA can result in the loss of strand information.

      It is possible to construct RNA-seq libraries that retain the strand information, but from a molecular biology standpoint it is much easier to discard it. See the following two papers for a couple of methods for maintaining strand information:

      Lister, et al: http://www.ncbi.nlm.nih.gov/pubmed/18423832

      Cloonan, et al: http://www.ncbi.nlm.nih.gov/pubmed/18516046

      Hope that at least helps a bit.
      That was a fast reply - thanks! However, I'm still a bit confused by what I've been reading.

      For example, one review covers both the papers you mention (http://www.ncbi.nlm.nih.gov/pubmed/18587314). The author states that in the procedure used by Lister et al. "directional information is captured", just as in the Cloonan paper. But the figures in the Lister paper show symmetrical read distributions on both strands, while the figures in the Cloonan paper, reads appear only on one strand.

      If directional information is captured, shouldn't that give the strand implicitly?

      Comment

      • ECO
        --Site Admin--
        • Oct 2007
        • 1360

        #4
        It's been a while since I've read Lister's paper, but I think they have two different approaches buried in there...one which is very labor intensive and retains the strand info...

        Ryan lurks on this forum occasionally...maybe he'll chime in.

        Comment

        • ScottC
          Senior Member
          • Jan 2008
          • 244

          #5
          Also, there's this paper which deals with strand-specific RNA-seq data from a bacterium.

          A strand-specific RNA-Seq analysis of the transcriptome of the typhoid bacillus Salmonella typhi.

          Perkins et al., PLoS Genet. 2009 Jul;5(7):e1000569.

          High-density, strand-specific cDNA sequencing (ssRNA-seq) was used to analyze the transcriptome of Salmonella enterica serovar Typhi (S. Typhi). By mapping sequence data to the entire S. Typhi genome, we analyzed the transcriptome in a strand-specific manner and further defined transcribed regions e …

          Comment

          • marrykonta

            #6
            Re:Why Do a Gene's Reads Appear on Both Strands? Reload this Page

            Hi CompBio,

            Well,Yes they can appear on two strands.Thanks for asking these query with all of us.
            A different way to answer is to start with the estimate that two unrelated humans differ by about 3 million nucleotides (plus some copy number differences etc). This is 1 difference about every 1000 nucleotides.
            Assume that the two parents of the siblings are unrelated, then each sibling receives a genome which is essentially the consensus (the 999 nucleotides which are present in both parents and then a random choice between 2 options for the 1000th nucleotide. The two siblings should therefore differ by about 1.5 million SNPs (plus some other differences.
            These 1.5 million SNP differences far outweigh the approximately 100 new mutations for each sibling (200 total differences).

            My conclusion is that difference between siblings has very little to do with mutation and is mainly due to random inheritance of different sequences from the two parents.
            Mutation is important over spans of hundreds and thousands of generations.

            Thanks

            Comment

            • ECO
              --Site Admin--
              • Oct 2007
              • 1360

              #7
              Originally posted by marrykonta View Post
              Hi CompBio,

              Well,Yes they can appear on two strands.Thanks for asking these query with all of us.
              A different way to answer is to start with the estimate that two unrelated humans differ by about 3 million nucleotides (plus some copy number differences etc). This is 1 difference about every 1000 nucleotides.
              Assume that the two parents of the siblings are unrelated, then each sibling receives a genome which is essentially the consensus (the 999 nucleotides which are present in both parents and then a random choice between 2 options for the 1000th nucleotide. The two siblings should therefore differ by about 1.5 million SNPs (plus some other differences.
              These 1.5 million SNP differences far outweigh the approximately 100 new mutations for each sibling (200 total differences).

              My conclusion is that difference between siblings has very little to do with mutation and is mainly due to random inheritance of different sequences from the two parents.
              Mutation is important over spans of hundreds and thousands of generations.

              Thanks
              Nice sneaky spammer. Copy pasting text from other similarly related websites. Bye!

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Pathogen Surveillance with Advanced Genomic Tools
                by seqadmin




                The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                03-24-2025, 11:48 AM
              • seqadmin
                New Genomics Tools and Methods Shared at AGBT 2025
                by seqadmin


                This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                The Headliner
                The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                03-03-2025, 01:39 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-20-2025, 05:03 AM
              0 responses
              49 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-19-2025, 07:27 AM
              0 responses
              57 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-18-2025, 12:50 PM
              0 responses
              50 views
              0 reactions
              Last Post seqadmin  
              Started by seqadmin, 03-03-2025, 01:15 PM
              0 responses
              201 views
              0 reactions
              Last Post seqadmin  
              Working...