Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Amative
    Member
    • Dec 2011
    • 45

    RNA-Seq alignment issue

    Hello all,

    I have been given paired-end RNA-Seq files to align against a couple of references. I used Bowtie2 to do the job. The alignment results were very low in most of the cases (less than 5% overall alignment rate).Now, we are thinking this might be caused by either contamination or mix up samples.

    Any suggestion what to do in such case?
    Thank you in advance
  • swbarnes2
    Senior Member
    • May 2008
    • 910

    #2
    Bioinformatically, there's nothing you can do, other than help the people to know what went wrong.

    For starters, spot-check some random high quality reads, BLAST them against nr, see if you can determine what they are.

    Try aligning to the whole genome to see how much of the library was genomic.

    See if there are certain highly repetitive reads (like Illumina adapters) taking up a lot of reads.

    And of course see if the run overall was of good enough quality for you to believe that your reads are accurate.

    Comment

    • rboettcher
      Member
      • Oct 2010
      • 71

      #3
      Hi Amative,

      what kind of reference did you provide? Bowtie2 is not splicing aware, so it is not able to deal with reads spanning splice junctions. Therefore, it can only be used to align against the transcriptome (for RNAseq). This is why TopHat was created to align against the whole genome.

      Regards

      Comment

      • Amative
        Member
        • Dec 2011
        • 45

        #4
        Thanks swbarnes2 & rboettcher,

        @swbarnes2
        • I tried to blast the first ten reads from one of the samples I have, blast results were not that good. I tried to align against the available sequences of the two of the top blast hits. Same low alignment rate.
        • I checked for adapters, sequences are already trimmed.


        @rboettcher
        Yes, I am aligning against the transcriptome sequences.

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #5
          Originally posted by Amative View Post
          I tried to blast the first ten reads from one of the samples I have, blast results were not that good.
          You probably want to go into the file some ways. With illumina atleast the first hundred (or more) sequences may not represent the best of the lot since they are generally from the edge of the flowcell/start of the lane.

          You may also want to use this tool to do some screening: http://www.bioinformatics.babraham.a.../fastq_screen/

          Comment

          • Amative
            Member
            • Dec 2011
            • 45

            #6
            Thanks GenoMax, for the suggestion I am working on it.

            I like the fastq_screen It saves some time!

            Comment

            Latest Articles

            Collapse

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Today, 06:09 AM
            0 responses
            11 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-09-2026, 11:58 AM
            0 responses
            33 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-05-2026, 10:09 AM
            0 responses
            38 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-04-2026, 08:59 AM
            0 responses
            43 views
            0 reactions
            Last Post SEQadmin2  
            Working...