Header Leaderboard Ad

Collapse

bowtie2 parameters for chip-seq

Collapse

Announcement

Collapse

SEQanswers June Challenge Has Begun!

The competition has begun! We're giving away a $50 Amazon gift card to the member who answers the most questions on our site during the month. We want to encourage our community members to share their knowledge and help each other out by answering questions related to sequencing technologies, genomics, and bioinformatics. The competition is open to all members of the site, and the winner will be announced at the beginning of July. Best of luck!

For a list of the official rules, visit (https://www.seqanswers.com/forum/sit...wledge-and-win)
See more
See less
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bowtie2 parameters for chip-seq

    Working with the new bowtie2, anybody here done alignment for chip-seq using it. Please comment of choice of parameters

    It is common to choose reads if they match uniquely, for old --best and -m 1 did the trick, not sure about bowtie2, will try -M 1.

  • #2
    Is there any suggestion about this question?

    Comment


    • #3
      From Bowtie2's Manual

      it is said:
      =======
      Mapping quality: higher = more unique

      Accurate mapping qualities are useful for downstream tools like variant callers. For instance, a variant caller might choose to ignore evidence from alignments with mapping quality less than, say, 10. A mapping quality of 10 or less indicates that there is at least a 1 in 10 chance that the read truly originated elsewhere.
      =======

      Could we chose a threshold for Mapping quality for Chip-seq? For example, all reads with

      Mapping quality higher than 30 are considered as uniquely mapping reads ?

      tks

      Comment


      • #4
        I think I find the answer from this paper.

        The answer is YES.

        "...Reads were filtered by removing those with a BWA alignment quality score less than 15..."

        Differential oestrogen receptor binding is associated with clinical outcome in breast cancer
        Nature, Vol. advance online publication (4 January 2012) doi:10.1038/nature10730


        Originally posted by harryzs View Post
        From Bowtie2's Manual

        it is said:
        =======
        Mapping quality: higher = more unique

        Accurate mapping qualities are useful for downstream tools like variant callers. For instance, a variant caller might choose to ignore evidence from alignments with mapping quality less than, say, 10. A mapping quality of 10 or less indicates that there is at least a 1 in 10 chance that the read truly originated elsewhere.
        =======

        Could we chose a threshold for Mapping quality for Chip-seq? For example, all reads with

        Mapping quality higher than 30 are considered as uniquely mapping reads ?

        tks

        Comment


        • #5
          What does a mapping quality of 0 mean then? That the read may have originated anywhere in the genome?
          And if I understand well, reads with low Mapq should be filtered before calling peaks, right?
          Luca

          Comment


          • #6
            In bowtie2, a MAPQ of 0 means one of the following:
            1. The reported alignment and the next best alignment are both equivalently good, but neither are exact matches (if they're exact matches, the MAPQ is set to 1).
            2. The absolute difference in alignment score between the best and second best alignment is >= 10% (and <30%) of the maximum possible difference in alignment scores and the best alignment's score is itself <67% of the maximum difference in alignment scores.

            This is for end-to-end alignments. For local alignments, only #1 will produce this (#2 would produce MAPQs of 9, 12, 14, or 17, depending).

            Yes, this is highly confusing and no, it's not documented (unless you consider source code to be documentation).

            Comment


            • #7
              Originally posted by dpryan View Post
              Yes, this is highly confusing


              Right, then the reasonable way to proceed is to keep the aligned tags with say MAPQ>10 and call the peaks with them. Does it make sense?
              Is it possible that the low-scoring tags are still informative, e.g. on the binding to repetitive sequences?

              Comment


              • #8
                Originally posted by crepaldi View Post


                Right, then the reasonable way to proceed is to keep the aligned tags with say MAPQ>10 and call the peaks with them. Does it make sense?
                Is it possible that the low-scoring tags are still informative, e.g. on the binding to repetitive sequences?
                That seems like a sensible MAPQ threshold. I agree that the multimappers can still be quite informative. It's likely a good idea to look at them in IGV and bring up a repeatmasker track to see if these might turn out to be interesting or not. The last thing you want to do is throw out multimappers if it turns out that your protein does bind to a repeat region!

                Comment


                • #9
                  Yes. I agree.
                  See how people from ENCODE are doing:
                  samtools view -b -F 1548 -q 30 chipSampleRep1.bam

                  they(Anshul Kundaje) use -q 30 in their guideline.

                  Comment

                  Latest Articles

                  Collapse

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Today, 01:08 PM
                  0 responses
                  5 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 06-01-2023, 08:56 PM
                  0 responses
                  12 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 06-01-2023, 07:33 AM
                  0 responses
                  88 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 05-31-2023, 07:50 AM
                  0 responses
                  126 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X