I have a question about syndip dataset : https://github.com/lh3/CHM-eval . I'm struggling to find the syndip vcf.
In the release ( https://github.com/lh3/CHM-eval/releases ), we have a file named : rep2.37.broad.hc.raw.vcf.gz, that i don't know what it is. And we have a file named CHM-evalkit-20180222.tar wich contain full.37m.vcf and other files ( bed, eval ...). So i did my search and according to this file: they mentionned that full.37m.vcf is the truth dataset. ( https://www.biorxiv.org/content/bior...1/456103-1.pdf Page 16).
The problem is that the file rep2.37.broad.hc.raw.vcf.gz contain variants with MQ, DP, GQ ... that i need to extract. But the full.37m.vcf doesn't contain this information.. ( just Chrom pos ref alt and QUAL.)
So i tried to intersect rep2.37.broad.hc.raw.vcf.gz with full.37m.vcf and take the variant that present in two files, with the DP MQ GQ in rep2.37.broad.hc.raw.vcf.gz. Is that okay ? Since I don't know what is rep2.37.broad.hc.raw.vcf.gz.
And i also noticed that the QUAL in the full.37m.vcf is always 30 .. Is it normal ? Thank's
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
Latest Articles
Collapse
-
by seqadmin
Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.
Long-Read Sequencing
Long-read sequencing has seen remarkable advancements,...-
Channel: Articles
12-02-2024, 01:49 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 12-02-2024, 09:29 AM
|
0 responses
158 views
0 likes
|
Last Post
by seqadmin
12-02-2024, 09:29 AM
|
||
Started by seqadmin, 12-02-2024, 09:06 AM
|
0 responses
57 views
0 likes
|
Last Post
by seqadmin
12-02-2024, 09:06 AM
|
||
Started by seqadmin, 12-02-2024, 08:03 AM
|
0 responses
48 views
0 likes
|
Last Post
by seqadmin
12-02-2024, 08:03 AM
|
||
Started by seqadmin, 11-22-2024, 07:36 AM
|
0 responses
77 views
0 likes
|
Last Post
by seqadmin
11-22-2024, 07:36 AM
|