Hi,
I filter variants from a WGS data with different databases like genomicSuperDups. However, many variants (more than 70%) get filtered by genomicSuperDups. As it is not normal, how is it possible to find out why so many variants get filtered? The problem is that annovar provides a list of dropped variants for different filters except genomicSuperDups. So there is no way to find out the variants that get filtered by this db.
another question is that if snp129 is more clean than snp137 in using snp137NonFlagged,snp137?
variants_reduction.pl myfile -protocol genomicSuperDups,1000g2012apr_all,snp137NonFlagged,snp137,dgv -buildver hg19 humandb/ -operation r,f,f,f,r -aaf_threshold .01 -remove
NOTICE: Processing operation=r protocol=genomicSuperDups
NOTICE: Running step 1 with system command <annotate_variation.pl -regionanno -dbtype genomicSuperDups -buildver hg19 -outfile myfile.step1 myfile.step0.varlist /humandb/>
NOTICE: Reading annotation database humandb/hg19_genomicSuperDups.txt ... Done with 51599 regions
NOTICE: Finished region-based annotation on 2033 genetic variants in myfile.step0.varlist
NOTICE: Output files were written to myfile.step1.hg19_genomicSuperDups
NOTICE: After step 1, 61 variants are left in analysis.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 11:49 AM
|
0 responses
15 views
0 likes
|
Last Post
by seqadmin
Yesterday, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|