Unconfigured Ad

**dschika** · 09-27-2012, 11:51 PM

Hi krueml,

If you want to filter the bam file for a minimum mapping quality have a look at the -q option of samtools view:

samtools view -bq 20 your.bam > out.bam

If you want to see a "List of Phred-scaled genotype likelihoods" you should have a look at the output of your pipeline before using the awk command. The awk command also dismisses some header lines starting with '##' e.g.:

PHP Code:


##FORMAT=<ID=GL,Number=3,Type=Float,Description="Likelihoods for RR,RA,AA genotypes (R=ref,A=alt)">

##FORMAT=<ID=DP,Number=1,Type=Integer,Description="# high-quality bases">

##FORMAT=<ID=SP,Number=1,Type=Integer,Description="Phred-scaled strand bias P-value">

##FORMAT=<ID=PL,Number=G,Type=Integer,Description="List of Phred-scaled genotype likelihoods">

Btw: Your link points to the description of samtools pileup, but you use mpileup. Afaik there are some differences between both for example mpileup does not provide a consensus...

**krueml** · 09-28-2012, 05:36 AM

Yes sorry I posted the wrong link. This what I have posted was my mpileup output.
Well sorry that I wasn't clear. I was just asking how I can set the minimum consensus quality. After the whole pipeline, there is a FQ listed in the info of the vcf file.

Code:

##INFO=<ID=FQ,Number=1,Type=Float,Description="Phred probability of all samples being the same">

#CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO    FORMAT  bar
1     9778579   .       C       T       62      .       DP=17;VDB=0.0000;AF1=1;AC1=2;DP4=0,0,13,0;MQ=60;FQ=-66  GT:PL:GQ        1/1:95,39,0:75
1     9781412   .       C       G       214     .       DP=66;VDB=0.0768;AF1=1;AC1=2;DP4=0,0,37,15;MQ=55;FQ=-184        GT:PL:GQ        1/1:247,157,0:99
1     9782556   .       C       T       222     .       DP=368;VDB=0.0599;AF1=1;AC1=2;DP4=2,0,263,42;MQ=59;FQ=-282;PV4=1,0.27,0.36,1    GT:PL:GQ        1/1:255,255,0:99

I changed the script by now of the varFilter but I wanted to know if there is another way (w/o changing the script), because in the paper there was no changing at all and again: they used the same version of samtools, so I was just curious.

Topics	Statistics	Last Post
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, Yesterday, 05:37 AM	0 responses 8 views 0 reactions	Last Post by SEQadmin2 Yesterday, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 17 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 52 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 110 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM

Unconfigured Ad

variant calling with SAMtools

Comment

Comment

Latest Articles

ad_right_rmr

News