Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How can I remove DNA from RNA dataset by bioinformatics?

    Dear all,

    I am a new starter as I just start to do research related to bioinformatics.
    And I am now facing a big problem.

    My demonstrator told me that my RNA data contain DNA by blasting the whole dataset. And now he would like me to find out a tool to remove DNA data.

    my research is related to metagenomics research, using RNA but not DNA.
    As I have no reference to filter DNA that may come from difference species, could anyone give me a hand?

    Is there any tools to remove DNA from RNA data that with higher accuracy than removal of blast result which indicate DNA features ?

  • #2
    Perhaps someone else will chime in with a bright idea, but I seriously doubt you can distinguish reads coming from DNA and RNA. Just randomly blasting stuff and saying, "Gee, this read falls in the middle of no where in all genomes to which it matches...must just be DNA contamination", seems like a really bad idea. I certainly hope that your instructor/demonstrator/whatever did something vastly more clever than that, but I suspect not.

    Next time, just tell whomever is preparing the samples to DNase treat things.

    Comment


    • #3
      Perhaps hellingwyk is looking to retain reads that match rRNA and discard the rest.

      If that is so look into getting the appropriate sequences databases (http://www.arb-silva.de/ or http://rdp.cme.msu.edu/) for the search.
      Last edited by GenoMax; 09-26-2012, 09:33 AM.

      Comment


      • #4
        Thank you very much for answer my question~

        However, I am now focusing on all kinds of RNA from microbe and virus in order to search novel information...............

        that's why I cannot easily find a reference to filter out DNA....................

        Comment


        • #5
          Overall, if you are looking for all kinds of RNA than you need to trust your library preps and assume that all of the sequences you are seeing are RNAs and not DNAs (and if stuff is mapping to things that have not previously known to be transcribed that doesn't mean it's DNA, it means that perhaps there is more transcription in your system than previously characterized). BLASTing to the genome and assuming that something is DNA because it hasn't been shown to be transcribed before is not evidence of contamination (although it could be, I suppose; hard to say without knowing more of your experimental setup).

          In eukaryotes, a "rough" way of doing this would be to look for evidence of unspliced transcripts, and if their numbers are extremely(!) high - to go back and redo the library preps.

          Comment


          • #6
            I basically agree with dvanic. There is not going to be any reliable way of distinguishing DNA reads from RNA reads when they are mixed together in an RNA library. If this distinction is important to your analysis then you need to verify that efforts were made to remove the DNA that will contaminate any RNA prep from that prep during library construction. In most cases this should include a DNAse treatment of the purified RNA.

            If the RNA library is strand-specific you could re-assure yourself that this was the case by making sure that strand-specificity is reflected in the data. Not sure what a reasonable ratio of +strand to -strand is in a pure RNA library, but it should be pretty high, overall.

            --
            Phillip

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Latest Developments in Precision Medicine
              by seqadmin



              Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

              Somatic Genomics
              “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
              Yesterday, 01:16 PM
            • seqadmin
              Recent Advances in Sequencing Analysis Tools
              by seqadmin


              The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
              05-06-2024, 07:48 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 07:15 AM
            0 responses
            13 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-23-2024, 10:28 AM
            0 responses
            17 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-23-2024, 07:35 AM
            0 responses
            20 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-22-2024, 02:06 PM
            0 responses
            10 views
            0 likes
            Last Post seqadmin  
            Working...
            X