Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bowtie2 parameters for chip-seq

    Working with the new bowtie2, anybody here done alignment for chip-seq using it. Please comment of choice of parameters

    It is common to choose reads if they match uniquely, for old --best and -m 1 did the trick, not sure about bowtie2, will try -M 1.

  • #2
    Is there any suggestion about this question?

    Comment


    • #3
      From Bowtie2's Manual

      it is said:
      =======
      Mapping quality: higher = more unique

      Accurate mapping qualities are useful for downstream tools like variant callers. For instance, a variant caller might choose to ignore evidence from alignments with mapping quality less than, say, 10. A mapping quality of 10 or less indicates that there is at least a 1 in 10 chance that the read truly originated elsewhere.
      =======

      Could we chose a threshold for Mapping quality for Chip-seq? For example, all reads with

      Mapping quality higher than 30 are considered as uniquely mapping reads ?

      tks

      Comment


      • #4
        I think I find the answer from this paper.

        The answer is YES.

        "...Reads were filtered by removing those with a BWA alignment quality score less than 15..."

        Differential oestrogen receptor binding is associated with clinical outcome in breast cancer
        Nature, Vol. advance online publication (4 January 2012) doi:10.1038/nature10730


        Originally posted by harryzs View Post
        From Bowtie2's Manual

        it is said:
        =======
        Mapping quality: higher = more unique

        Accurate mapping qualities are useful for downstream tools like variant callers. For instance, a variant caller might choose to ignore evidence from alignments with mapping quality less than, say, 10. A mapping quality of 10 or less indicates that there is at least a 1 in 10 chance that the read truly originated elsewhere.
        =======

        Could we chose a threshold for Mapping quality for Chip-seq? For example, all reads with

        Mapping quality higher than 30 are considered as uniquely mapping reads ?

        tks

        Comment


        • #5
          What does a mapping quality of 0 mean then? That the read may have originated anywhere in the genome?
          And if I understand well, reads with low Mapq should be filtered before calling peaks, right?
          Luca

          Comment


          • #6
            In bowtie2, a MAPQ of 0 means one of the following:
            1. The reported alignment and the next best alignment are both equivalently good, but neither are exact matches (if they're exact matches, the MAPQ is set to 1).
            2. The absolute difference in alignment score between the best and second best alignment is >= 10% (and <30%) of the maximum possible difference in alignment scores and the best alignment's score is itself <67% of the maximum difference in alignment scores.

            This is for end-to-end alignments. For local alignments, only #1 will produce this (#2 would produce MAPQs of 9, 12, 14, or 17, depending).

            Yes, this is highly confusing and no, it's not documented (unless you consider source code to be documentation).

            Comment


            • #7
              Originally posted by dpryan View Post
              Yes, this is highly confusing


              Right, then the reasonable way to proceed is to keep the aligned tags with say MAPQ>10 and call the peaks with them. Does it make sense?
              Is it possible that the low-scoring tags are still informative, e.g. on the binding to repetitive sequences?

              Comment


              • #8
                Originally posted by crepaldi View Post


                Right, then the reasonable way to proceed is to keep the aligned tags with say MAPQ>10 and call the peaks with them. Does it make sense?
                Is it possible that the low-scoring tags are still informative, e.g. on the binding to repetitive sequences?
                That seems like a sensible MAPQ threshold. I agree that the multimappers can still be quite informative. It's likely a good idea to look at them in IGV and bring up a repeatmasker track to see if these might turn out to be interesting or not. The last thing you want to do is throw out multimappers if it turns out that your protein does bind to a repeat region!

                Comment


                • #9
                  Yes. I agree.
                  See how people from ENCODE are doing:
                  samtools view -b -F 1548 -q 30 chipSampleRep1.bam

                  they(Anshul Kundaje) use -q 30 in their guideline.

                  Access Google Sites with a personal Google account or Google Workspace account (for business use).

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Exploring the Dynamics of the Tumor Microenvironment
                    by seqadmin




                    The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                    07-08-2024, 03:19 PM
                  • seqadmin
                    Exploring Human Diversity Through Large-Scale Omics
                    by seqadmin


                    In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                    06-25-2024, 06:43 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Today, 11:09 AM
                  0 responses
                  16 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 07-19-2024, 07:20 AM
                  0 responses
                  148 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 07-16-2024, 05:49 AM
                  0 responses
                  121 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 07-15-2024, 06:53 AM
                  0 responses
                  111 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X