Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Interpreting 5'/3' bias

    Hi All,
    We have 100bp PE RNAseq data generated on a HiSeq2000 from a library prepped with the Nugen Ovation v2 kit. The samples are from human and rat, with the human samples of lesser quality (RIN<7) whilst the rat are fine. Reads were aligned with bowtie using default options.

    Using RSeQC we see the following profiles of counts across transcripts:

    Click image for larger version

Name:	rseqc_gene_coverage.png
Views:	1
Size:	142.2 KB
ID:	307786

    Does anyone have any ideas on how to interpret this? I was expecting a possible 3' bias from the degraded(?) samples and/or the library prep, but wasn't expecting to see both a 3' and 5' bias...

    What would you do on the back of this, anything?

    Thanks.
    @sidderb

  • #2
    Can anyone help?
    @sidderb

    Comment


    • #3
      What %of reads were mapped?

      Comment


      • #4
        ~60% in all samples
        @sidderb

        Comment


        • #5
          I suspect the library prep method was not a stranded protocol. This would result in ~1/2 of the PE data being in one orientation (i.e. reads are 5'-3' relative to transcript) and the other 1/2 being in the opposite orientation (3'-5'). I have heard the ovation kit has a 3' bias, so in your case the 5' bias is simply the reads in the antisense direction.

          I am not sure this is correct, just a hunch.

          Comment


          • #6
            I would suspect at the library prep phase as scooter.But since the data is out - I would try to improve mapping by using other aligners.
            Bowtie is good for RNA seq with end to end mapping (sensitive mode) but in that way it is more stringent and I personally dont want to use it as my first aligner of choice.
            If you like Bowtie's performance - I would suggest you to use Bowtie2 - which allows gapped alignment.
            I would also suggest you to use bwa just to have another comparison of mapping.

            By the way have you done pre-mapping QC?Did you had to trim the reads?
            what were you mapping to?

            -Aparna

            Comment


            • #7
              Originally posted by sidderb View Post
              Hi All,
              We have 100bp PE RNAseq data generated on a HiSeq2000 from a library prepped with the Nugen Ovation v2 kit. The samples are from human and rat, with the human samples of lesser quality (RIN<7) whilst the rat are fine. Reads were aligned with bowtie using default options.

              Using RSeQC we see the following profiles of counts across transcripts:

              [ATTACH]1333[/ATTACH]

              Does anyone have any ideas on how to interpret this? I was expecting a possible 3' bias from the degraded(?) samples and/or the library prep, but wasn't expecting to see both a 3' and 5' bias...

              What would you do on the back of this, anything?

              Thanks.
              Doesn't this kit do ligations to the end (or extensions from the end)? If so, it might be the result of a failure of the RNA in the human sample to fragment prior to adding adapters.

              If you had provided a link to the protocol I might have read it and been able to give you a more targeted answer.

              --
              Phillip

              Comment


              • #8
                Originally posted by sidderb View Post
                Hi All,
                We have 100bp PE RNAseq data generated on a HiSeq2000 from a library prepped with the Nugen Ovation v2 kit. The samples are from human and rat, with the human samples of lesser quality (RIN<7) whilst the rat are fine. Reads were aligned with bowtie using default options.

                Using RSeQC we see the following profiles of counts across transcripts:

                [ATTACH]1333[/ATTACH]

                Does anyone have any ideas on how to interpret this? I was expecting a possible 3' bias from the degraded(?) samples and/or the library prep, but wasn't expecting to see both a 3' and 5' bias...

                What would you do on the back of this, anything?

                Thanks.
                Maybe you can try "read_distribution.py" in RSeQC package to reaffirm this observation. You are expecting to see higher coverage in both 3'UTR and 5'UTR than the internal CDS exons.

                Comment


                • #9
                  Did you map to a genome or transcriptome?
                  If you mapped to a genome, you should try mapping with something specifically made for RNA-Seq data like Tophat or STAR so that you can map reads split across an exon-exon junction. I have noticed a larger 3'UTR bias sometimes when I map RNA-Seq data with Bowtie or BWA to a genome. The 3'UTR is often the longest part of a gene and you'll get more reads mapping there since there are fewer split reads.

                  Comment


                  • #10
                    I have seen a similar large 3' bias in rRNA-depleted libraries, but not poly-A selected ones. I don't know how to explain it, though.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Recent Developments in Metagenomics
                      by seqadmin





                      Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                      09-23-2024, 06:35 AM
                    • seqadmin
                      Understanding Genetic Influence on Infectious Disease
                      by seqadmin




                      During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                      Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                      09-09-2024, 10:59 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 10-02-2024, 04:51 AM
                    0 responses
                    11 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 10-01-2024, 07:10 AM
                    0 responses
                    19 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-30-2024, 08:33 AM
                    0 responses
                    24 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-26-2024, 12:57 PM
                    0 responses
                    17 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X