Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Genomic Contamination with Clontech PolyA selection?

    Hi Everyone,

    I am working with single cell RNA-Seq and am currently analyzing my data. I've noticed that I often get a lot of reads aligning to the UTR but not at all to the exons. When I look closer, I see that there is often a poly A region here.

    So, my question: Poly A tail selection using the Clontech SMARTer technology is supposed to only target mRNA (and lncRNA). However, is it possible it also sometimes catches genomic DNA? Or will the fact that it's double stranded prevent the Clontech adapters from binding to the region? Anybody else have problems with genomic contamination? Or perhaps you have another theory why I'm seeing a lot of specific alignment to to multiple areas with genomic polyA regions?

    Thanks for your insight!

  • #2
    Hey I don't know about this kit but this happened to me with a Truseq kit.
    Anyway one should avoid to have genomic DNA in the starting sample...
    We noticed it before loading on the Hiseq because the genomic DNA hadn't been fragmented like the RNA. Was obvious the Bioanalyzer, we got excessively long fragments. Did you check this out?
    Did you do a DNase step before prepararing the libraries?
    If you still have RNA (I guess not...) you could make an intron test.
    what do you mean with " I'm seeing a lot of specific alignment to to multiple areas with genomic polyA regions?"
    Cheers

    Comment


    • #3
      Thanks for the response!

      I don't do a DNase step because I use the Fluidigm C1 system and it's done within the chip in the machine. The tech sheet specifically states: "Ribosomal*RNA*(rRNA)*removal*or*DNase*treatment*of*RNA*samples*is*not*required*for*these*kits.These*kits*selectively*and*efficiently*amplify*polyA+*RNA*regardless*of*the*presence*of*rRNA*or*genomic*DNA."

      We do run all our samples on a Fragment Analyzer but the upper marker is set at 6000 bp and none of our samples get up that high so it didn't set off any flags. At the same time, when I look at the output, it doesn't show anything higher than that so I don't know if the genomic DNA would be missed because it is off the scale. I will look further into that. Thanks for the suggestion.

      In regards to the alignment, I use StringTie to look for potential new lncRNA and of course I do a differential expression analysis (with DESeq) between two different cell types. Occasionally, I'll find a DE target that statistically looks great, but when I look closer on IGV, there aren't any reads that align to the exons, there are only reads that align to the UTR. At first I thought maybe it was an interesting regulatory RNA element but I noticed the same pattern in numerous genes. When I looked closer at the actual sequence, I often found a polyA region. This led me to be suspicious that during our polyA selection, it might be picking up genomic DNA. The same with a lot of the new transcripts found by StringTie. They often end with a poly A region in the genome sequence (although I know this can be possible with non-coding RNA transcripts).

      Or, if it's not genomic DNA, perhaps the data is being biased by polyA sequences found in UTRs of mRNA (although I would have thought that this would cut off the end of the UTR rather than the translated exons).

      I'm not sure if that made it any clearer or more confusing...

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-25-2024, 11:49 AM
      0 responses
      19 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-24-2024, 08:47 AM
      0 responses
      19 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      62 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      60 views
      0 likes
      Last Post seqadmin  
      Working...
      X