Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • travelk
    Member
    • Jul 2013
    • 20

    Genomic Contamination with Clontech PolyA selection?

    Hi Everyone,

    I am working with single cell RNA-Seq and am currently analyzing my data. I've noticed that I often get a lot of reads aligning to the UTR but not at all to the exons. When I look closer, I see that there is often a poly A region here.

    So, my question: Poly A tail selection using the Clontech SMARTer technology is supposed to only target mRNA (and lncRNA). However, is it possible it also sometimes catches genomic DNA? Or will the fact that it's double stranded prevent the Clontech adapters from binding to the region? Anybody else have problems with genomic contamination? Or perhaps you have another theory why I'm seeing a lot of specific alignment to to multiple areas with genomic polyA regions?

    Thanks for your insight!
  • morgane51
    Junior Member
    • Sep 2015
    • 4

    #2
    Hey I don't know about this kit but this happened to me with a Truseq kit.
    Anyway one should avoid to have genomic DNA in the starting sample...
    We noticed it before loading on the Hiseq because the genomic DNA hadn't been fragmented like the RNA. Was obvious the Bioanalyzer, we got excessively long fragments. Did you check this out?
    Did you do a DNase step before prepararing the libraries?
    If you still have RNA (I guess not...) you could make an intron test.
    what do you mean with " I'm seeing a lot of specific alignment to to multiple areas with genomic polyA regions?"
    Cheers

    Comment

    • travelk
      Member
      • Jul 2013
      • 20

      #3
      Thanks for the response!

      I don't do a DNase step because I use the Fluidigm C1 system and it's done within the chip in the machine. The tech sheet specifically states: "Ribosomal*RNA*(rRNA)*removal*or*DNase*treatment*of*RNA*samples*is*not*required*for*these*kits.These*kits*selectively*and*efficiently*amplify*polyA+*RNA*regardless*of*the*presence*of*rRNA*or*genomic*DNA."

      We do run all our samples on a Fragment Analyzer but the upper marker is set at 6000 bp and none of our samples get up that high so it didn't set off any flags. At the same time, when I look at the output, it doesn't show anything higher than that so I don't know if the genomic DNA would be missed because it is off the scale. I will look further into that. Thanks for the suggestion.

      In regards to the alignment, I use StringTie to look for potential new lncRNA and of course I do a differential expression analysis (with DESeq) between two different cell types. Occasionally, I'll find a DE target that statistically looks great, but when I look closer on IGV, there aren't any reads that align to the exons, there are only reads that align to the UTR. At first I thought maybe it was an interesting regulatory RNA element but I noticed the same pattern in numerous genes. When I looked closer at the actual sequence, I often found a polyA region. This led me to be suspicious that during our polyA selection, it might be picking up genomic DNA. The same with a lot of the new transcripts found by StringTie. They often end with a poly A region in the genome sequence (although I know this can be possible with non-coding RNA transcripts).

      Or, if it's not genomic DNA, perhaps the data is being biased by polyA sequences found in UTRs of mRNA (although I would have thought that this would cut off the end of the UTR rather than the translated exons).

      I'm not sure if that made it any clearer or more confusing...

      Comment

      Latest Articles

      Collapse

      • SEQadmin2
        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
        by SEQadmin2


        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

        Here are nine questions we think about, in roughly the order they matter, before...
        06-18-2026, 07:11 AM
      • SEQadmin2
        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
        by SEQadmin2


        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
        ...
        06-02-2026, 10:05 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, 06-17-2026, 06:09 AM
      0 responses
      39 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-09-2026, 11:58 AM
      0 responses
      102 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-05-2026, 10:09 AM
      0 responses
      123 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-04-2026, 08:59 AM
      0 responses
      114 views
      0 reactions
      Last Post SEQadmin2  
      Working...