Header Leaderboard Ad

Collapse

Percentage of mapped reads ?

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Percentage of mapped reads ?

    Hi all,

    I am new to the RNA seq world. I have processed bunch of illumina paired end reads. The reported alignment is only 49%.. I was wondering if this is normal. Here's the command and output. Thanks.

    bowtie -n 3 -p 10 --best -e 200 --trim5 15 --trim3 25 --sam BowtieIndexes/mm9 -1 Raw_files/sample-1_export.fq -2 sample-2_export.fq sample.sam


    # reads processed: 21482797
    # reads with at least one reported alignment: 10637665 (49.52%)
    # reads that failed to align: 10845132 (50.48%)
    Reported 10637665 paired-end alignments to 1 output stream(s)

  • #2
    If you could provide us with a little more information about the data you are analyzing that would be helpful. What are the reads from?
    I have had some samples that have had 90+% accuracy, but with things that are likely high in indels like cancer, I have gotten as low as 60ish %..

    Comment


    • #3
      This is a mouse cancer dataset. However not a lot indels are expected. Some of the samples are normal controls. Any more information needed?

      Comment


      • #4
        That's fairly low then. Try rerunning a set of the samples instead of as paired-end, as separate single reads. Also, try trimming less off of the reads, because there is a possibility that that decreases the number of possible locations for the paired ends to match up.

        Let me know the results.

        Comment


        • #5
          If you are analyzing RNAseq data why are you using bowtie and not tophat? Your percent aligned will be low with bowtie alone as only reads aligning to an exons will map, while all reads crossing a exon-exon junction will not align. Depending on the read length this can be a large percentage or your reads.

          Comment


          • #6
            Plassaaw,

            I used mm9_with_junctions index which supposedly takes care of the exon-exon junctions .

            When I run single reads on these two paired_end reads _1 and _2, 80% and 78% of the reads are aligned.

            However, when I run paired-end alignment, only 50% of the reads are aligned... Any idea?



            # reads processed: 21482797
            # reads with at least one reported alignment: 17370381 (80.86%)
            # reads that failed to align: 4112416 (19.14%)
            Reported 17370381 alignments to 1 output stream(s)

            Comment


            • #7
              single read mapping can reach 80%

              # reads processed: 34439174
              # reads with at least one reported alignment: 27167675 (78.89%)
              # reads that failed to align: 7271499 (21.11%)

              Well paired mapping is only 52%

              Any idea? Thanks!

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
                by seqadmin



                Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
                03-21-2023, 01:49 PM
              • seqadmin
                Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
                by seqadmin




                Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
                03-10-2023, 05:31 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 11:44 AM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-24-2023, 02:45 PM
              0 responses
              18 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-22-2023, 12:26 PM
              0 responses
              22 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-17-2023, 12:32 PM
              0 responses
              19 views
              0 likes
              Last Post seqadmin  
              Working...
              X