Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • foxyg
    Member
    • May 2010
    • 54

    at which step should I remove duplicate

    I tend to think I should run picard to remove duplicate after I use GATK to do realign and recalibrate, then remove duplicate. Or should I remove duplicate first.

    I fail to understand the algorithm Picard uses to remove duplicates. Can someone explains how does Picard determine if a read is duplicate?

    Thanks
  • drio
    Senior Member
    • Oct 2008
    • 323

    #2
    I don't think it should affect if you do it before or after (I do it before).

    Picard looks for reads that are aligned in the same position and same direction and marks them as duplicates based on the basecall quality. If working with PE/MP data, both ends are taken into account.
    -drd

    Comment

    Latest Articles

    Collapse

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    12 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-04-2026, 08:59 AM
    0 responses
    24 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-02-2026, 12:03 PM
    0 responses
    28 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-02-2026, 11:40 AM
    0 responses
    22 views
    0 reactions
    Last Post SEQadmin2  
    Working...