Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • wangzkai
    Member
    • Feb 2010
    • 11

    Picard MarkDuplicates

    This may be a naive question, but I was trying to figure out whether I should set "REMOVE_DUPLICATES" to true or false when using picard's "MarkDuplicates" to remove duplicate reads. Since I want to subsequently call variants using samtools pileup, I am not sure whether samtools pileup will then remove from consideration these duplicate reads that are marked by flags when it calls SNPs.

    By setting the "REMOVE_DUPLICATES=true", my understanding is that the duplicates read will not even be written to the output file, which sounds a bit safer ...

    Thanks for any insight on this!
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #2
    Originally posted by wangzkai View Post
    This may be a naive question, but I was trying to figure out whether I should set "REMOVE_DUPLICATES" to true or false when using picard's "MarkDuplicates" to remove duplicate reads. Since I want to subsequently call variants using samtools pileup, I am not sure whether samtools pileup will then remove from consideration these duplicate reads that are marked by flags when it calls SNPs.

    By setting the "REMOVE_DUPLICATES=true", my understanding is that the duplicates read will not even be written to the output file, which sounds a bit safer ...

    Thanks for any insight on this!
    Save the duplicates so you never lose any data. You can them use the "-m" option in 'samtools pileup' to filter reads based on their flag (including duplicates: 0x400). If you need to make sure that your "FLAG" is set correctly, see http://picard.sourceforge.net/explain-flags.html.

    Comment

    • xguo
      Member
      • Jul 2008
      • 48

      #3
      Originally posted by nilshomer View Post
      Save the duplicates so you never lose any data. You can them use the "-m" option in 'samtools pileup' to filter reads based on their flag (including duplicates: 0x400). If you need to make sure that your "FLAG" is set correctly, see http://picard.sourceforge.net/explain-flags.html.
      Is the read flag included in pileup file? I'm thinking to call SNP using all reads and then filter them based on the number of non-duplicate reads supporting a SNP.

      Comment

      Latest Articles

      Collapse

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, Yesterday, 10:09 AM
      0 responses
      9 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-04-2026, 08:59 AM
      0 responses
      17 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-02-2026, 12:03 PM
      0 responses
      26 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-02-2026, 11:40 AM
      0 responses
      21 views
      0 reactions
      Last Post SEQadmin2  
      Working...