Unconfigured Ad

**N311V** · 07-31-2014, 02:05 AM

In regards to your first question, I do think GATK UnifiedGenotyper would have treated each different read group as a different sample (http://gatkforums.broadinstitute.org...bout-bam-files).

Is there a particular reason you're using the UnifiedGenotyper? HaplotypeCaller is it's successor (http://www.broadinstitute.org/gatk/g...-discovery-ovw).

**Jolin** · 07-31-2014, 05:24 AM

Hi N311V,

Thank you very much. If they treat different read groups as different samples, then the read groups of each lane are supposed to be the same, right? But this is not mentioned at all in GATK website.

I just called SNPs not indels. So unified genotyper seems to be faster. Did HaplotyperCaller run better than Unified Genotyper in your project?

**westerman** · 07-31-2014, 08:08 AM

From the GATK web page:

The HaplotypeCaller is a more recent and sophisticated tool than the UnifiedGenotyper. Its ability to call SNPs is equivalent to that of the UnifiedGenotyper, and its ability to call indels is far superior. We recommend using HaplotypeCaller in all cases, with only a few exceptions:

If you want to analyze more than 100 samples at a time (for performance reasons)
If you are working with non-diploid organisms (UG can handle different levels of ploidy while HC cannot)
If you are working with pooled samples (also due to the HC’s limitation regarding ploidy)
In those cases, we recommend using UnifiedGenotyper instead of HaplotypeCaller.

Personally I am not sure which is better. Getting different results bioinformatically is not a proof of correctness.

**athomson** · 07-31-2014, 11:42 AM

Originally posted by N311V View Post

In regards to your first question, I do think GATK UnifiedGenotyper would have treated each different read group as a different sample (http://gatkforums.broadinstitute.org...bout-bam-files).

If you look at the desc of the SM tag in that page, its seems GATK would treat all read groups with the same SM as coming from the same sample

GATK tools treat all read groups with the same SM value as containing sequencing data for the same sample. Therefore it's critical that the SM field be correctly specified, especially when using multi-sample tools like the Unified Genotyper.

**N311V** · 07-31-2014, 02:19 PM

Originally posted by Jolin View Post

If they treat different read groups as different samples, then the read groups of each lane are supposed to be the same, right? But this is not mentioned at all in GATK website.

I did read somewhere on the GATK website that each sample needs a unique read group, sorry don't have a link right now. To keep track of lane perhaps you could use picard tools AddOrReplaceReadGroups.jar and specify the library name as the lane.

Originally posted by Jolin View Post

I just called SNPs not indels. So unified genotyper seems to be faster. Did HaplotyperCaller run better than Unified Genotyper in your project?

I was interested in SNPs and indels which made HaplotypeCaller an great all-in-one solution. Also, I was only interested in a couple of genes so speed was not a concern. I haven't compared the SNP results from HaplotypeCaller to UnifiedGenotyper so can't say if they're the same. I assume so but better check.

**Jolin** · 07-31-2014, 07:05 PM

Hi Westerman, Thank you. Actually our lab used Unified Genotyper all the time and did some PCR validation on the predicted SNVs. It seems that UG works well in SNV detection.

**Jolin** · 07-31-2014, 07:06 PM

Thanks a lot, N311V

Topics	Statistics	Last Post
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, Today, 05:37 AM	0 responses 5 views 0 reactions	Last Post by SEQadmin2 Today, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 50 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 109 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM

Unconfigured Ad

SNV calling using GATK with data from multiple lanes

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News