Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • lihairi
    Junior Member
    • Oct 2009
    • 5

    question about RNA-seq

    Hi everyone,

    I have a question about RNA-seq.

    "Some manipulations during library construction also complicate the analysis
    of RNA-Seq results. For example, many shorts reads that are identical to each
    other can be obtained from cDNA libraries that have been amplified. These could be a genuine reflection of abundant RNA species, or they could be PCR artefacts. One way to discriminate between these possibilities is to determine whether the same sequences are observed in different biological replicates.(Nat Rev Genet. 2009 Jan;10(1):57-63)"

    Is there any other way in the above situation to discriminate between a genuine reflection of abundant RNA or PCR artefacts?

    Thanks,

    Hai-Ri Li
  • simonandrews
    Simon Andrews
    • May 2009
    • 870

    #2
    If your library has been randomly fragmented then it should be possible to look for fragments which appear way more frequently than would be expected by chance. In even a short mRNA you should have a range of different possible fragments and if this is an abundant transcript most or all of these should appear more frequently. PCR artefacts usually affect only a small subset of all possible fragments and would produce a very uneven distribution of fragments over the transcript.

    Comment

    • lihairi
      Junior Member
      • Oct 2009
      • 5

      #3
      Thank you for interpretation. You mean we can ignore PCR artefacts because they affect only a small subsets of fragments and it will not affect final analysis results. However, sometimes we found so many tags hit the same positions so that we cannot ingore them.

      Again, is there any other way in the above situation to discriminate between a genuine reflection of abundant RNA or PCR artefacts?

      Thanks,

      Hai-Ri Li

      Comment

      • simonandrews
        Simon Andrews
        • May 2009
        • 870

        #4
        Originally posted by lihairi View Post
        You mean we can ignore PCR artefacts because they affect only a small subsets of fragments and it will not affect final analysis results.
        No, that's not what I'm saying. What I was trying to say was that you can normally distinguish PCR artefacts from expression changes because expression changes normally involved the even enrichment of a large number of different fragments over the expressed region, whereas PCR artefacts usually take only a small number of fragments and amplify them to an unnatural degree when viewed in the context of the surrounding fragments.

        We usually filter our data by measuring the percentage of reads in a region which come from exact overlaps. If this value is above 5-10% then we reject it as a likely PCR artefact. I'm intending to move this to an observed/expected calculation though as this is less prone to errors in very short regions with high coverage.

        Comment

        • lihairi
          Junior Member
          • Oct 2009
          • 5

          #5
          Last time you mentioned "We usually filter our data by measuring the percentage of reads in a region which come from exact overlaps". Here region must mean a window, 100 base? 200 base?

          How to do observed/expected calculation?

          Thanks.

          Comment

          • simonandrews
            Simon Andrews
            • May 2009
            • 870

            #6
            Originally posted by lihairi View Post
            Last time you mentioned "We usually filter our data by measuring the percentage of reads in a region which come from exact overlaps". Here region must mean a window, 100 base? 200 base?
            In our case region is pretty generic - sometimes we use fixed size windows (with a size which depends normally on our data density), in other cases we construct contigs from sets of overlapping reads, or we might design probes over particular classes of annotation feature (genes, exons, microRNAs, whatever). These things change depending on what kind of experiment you're running.


            Originally posted by lihairi View Post
            How to do observed/expected calculation?
            We're actually not using a proper O/E calculation at the moment (though it would be nice to move to that). Our filter calculates what percentage of reads which overlap a particular region come from exact overlaps, with the same start and end position. For randomly placed reads this value is usually very low (below 5%), but in some cases you will see towers of exactly duplicated reads which usually indicate a mapping or PCR problem, and we filter these out. You can also get high values from low absolute numbers of reads, so you either need to account for this, or ignore it if you're going to filter those regions anyway.

            I hope that makes things a bit clearer.

            Comment

            • lihairi
              Junior Member
              • Oct 2009
              • 5

              #7
              Recently I downloaded a lot of RNA-seq data from NCBI and mapped to Reference RNA using eland_25. I found around 30-40% of tags were mapped to the exactly same postions as others (even though removing the effect of RNA isorforms), much higher than 5%. I do not know how to interpretate this results.

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM
              • SEQadmin2
                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                by SEQadmin2


                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                Introduction

                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                05-22-2026, 06:42 AM
              • SEQadmin2
                Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                by SEQadmin2

                Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                05-06-2026, 09:04 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Today, 08:59 AM
              0 responses
              9 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-02-2026, 12:03 PM
              0 responses
              21 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-02-2026, 11:40 AM
              0 responses
              17 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-28-2026, 11:40 AM
              0 responses
              30 views
              0 reactions
              Last Post SEQadmin2  
              Working...