Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie: How to retain only uniquely mapped reads?

    Hi!

    I have been searching for the answer to this questions on the internet (google, this forum, bowtie manual), but can't find a clear answer.

    I am mapping RNASeq reads to the mouse genome . If one read aligns to multiple positions on the genome I want it to not be mapped at all.
    I used the --best option for this, but I am not sure if it's right. The bowtie manual only says: "--best: Make Bowtie guarantee that reported singleton alignments are "best" in terms of stratum [...] and in terms of the quality values at the mismatched position(s)." But what happens if two alignments are equally good?

    If it is not right to use --best for this purpose, which combination of output options can I use?

    Thanks for any help.

    Cheers,
    Esther

  • #2
    -m 1

    exemple on bowtie website :

    Example 7: -a -m 3

    $ ./bowtie -a -m 3 -v 2 e_coli -c ATGCATCATGCGCCAT
    No alignments
    Specifying -m 3 instructs bowtie to refrain from reporting any alignments for reads having more than 3 reportable alignments. The -m option is useful when the user would like to guarantee that reported alignments are "unique", for some definition of unique.

    Comment


    • #3
      Thank you very much, Nico. I'll try with that one.

      I am still wondering though, what happens with the output using --best if there are two (or more) best (equally good) alignments.

      And am I right if I say that it doesn't make sense to combine -m 1 with -n 2 (allow 2 mismatches), because this would throw out a read entirely if it matches one time perfect and another time with mismatches? I am having a dilemma here, because I want to allow for mismatches.

      If --best threw out a read entirely if it has two equally good best alignments and still reported reads that have more than one alignment (including one best) this would solve my problem, wouldn't it? Does --best behave this way?

      Cheers,
      Esther

      Comment


      • #4
        yeah indeed allowing two mismatches and only one alignment in the whole genome will not give you good results ( I guess, try it to see the results ).

        Comment


        • #5
          I think the solution to your problem would be a combination of several options, such as -n 2 -l {your_read_length} --best -m 1.

          If you do it this way --best is trying to report the best alignment while allowing up to 2 mismatches. If there are more equally good alignments which would be considered 'best', none of them will be reported.

          Comment


          • #6
            Thanks very much to the both of you!

            I now figured out entirely how all the different bowtie output options work together. Sorry for my question, I could have answered it by reading the manual very carefully, but as a starter in this topic I was very confused.

            So, thanks again!

            Comment


            • #7
              Hi there
              For human analysis I used -n 2 and -m 1 but I obtained variations within repeat area. What do you think I should do to avoid such results?
              Thanks

              Comment


              • #8
                I forget to say :
                My reads are 76 bases long

                Comment


                • #9
                  No idea?
                  Thanks for your time

                  Comment


                  • #10
                    It is not a good idea to be using Bowtie to map RNA-seq reads. Why aren't you using TopHat ? You are mapping mouse reads and they do have introns, because a mouse is a Eukaryote.

                    Comment


                    • #11
                      I may be mistaken. But I think you also need the --strata option to do what you want. E.g.
                      -a -n 2 -m 1 --best --strata

                      You need -a otherwise -m 1 is meaningless (e.g. -k 1 ensures more than 1 alignment will never be reported anyway, so -m 1 will never get evaluated.)
                      In that case --best by itself just lists all the alignments from best to worst, whereas --strata will cause it to only report alignments with the best strata, then -m 1 can filter out cases where there are more than 1 alignment matching the best strata.
                      Last edited by dhirallin; 03-24-2018, 11:42 PM.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Recent Advances in Sequencing Analysis Tools
                        by seqadmin


                        The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                        Today, 07:48 AM
                      • seqadmin
                        Essential Discoveries and Tools in Epitranscriptomics
                        by seqadmin




                        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                        04-22-2024, 07:01 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Today, 07:17 AM
                      0 responses
                      11 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 05-02-2024, 08:06 AM
                      0 responses
                      19 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-30-2024, 12:17 PM
                      0 responses
                      20 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-29-2024, 10:49 AM
                      0 responses
                      28 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X