Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Picard CalculateHsMetrics Function - what do all the output abbreviations mean?

    Hi,

    I am using the CalculateHsMetrics in Picard to estimate the per cent of on and off target reads for some next generation sequence data that I am aligning against a reference genome. I am running Picard via the Galaxy server (usegalaxy.org/).

    I am having trouble understanding some of the program outputs. A list of my specific queries is given below. Any advice anyone can give me would be greatly appreciated.


    What do the following outputs stand for?
    PF_READS
    PF_UQ_READS_ALIGNED 5403054
    PCT_PF_UQ_READS_ALIGNED
    PF_UQ_BASES_ALIGNED
    ON_BAIT_BASES
    ON_BAIT_VS_SELECTED
    FOLD_ENRICHMENT
    ZERO_CVG_TARGETS_PCT
    FOLD_80_BASE_PENALTY
    HS_PENALTY

    I am also unsure about exactly what these outputs refer to:
    BAIT_TERRITORY and TARGET_TERRITORY - are these references to the total number of base pairs captured by the baits and targets?
    NEAR_BAIT_BASES - how near the baits are these bases? I understand that they cover a 250bp interval around each target or bait. Is this what the output is referring to?
    PCT_SELECTED_BASES - does this refer to the percent of the referene genome captured by the targets or baits?
    OFF_BAIT_BASES vs NEAR_BAIT_BASES - does the count of off-target bases include those bases that are near the baits/targets?

  • #2
    refer to this page http://picard.sourceforge.net/picard...html#HsMetrics

    Comment


    • #3
      Thank you ersgupta, much appreciated.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Developments in Metagenomics
        by seqadmin





        Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
        09-23-2024, 06:35 AM
      • seqadmin
        Understanding Genetic Influence on Infectious Disease
        by seqadmin




        During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

        Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
        09-09-2024, 10:59 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 10-02-2024, 04:51 AM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 10-01-2024, 07:10 AM
      0 responses
      21 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-30-2024, 08:33 AM
      0 responses
      25 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 09-26-2024, 12:57 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Working...
      X