Thousand genome SNP counts

BetterPrimate

Member

Join Date: May 2010

Posts: 15
- Share
- Tweet
#1

Thousand genome SNP counts

09-09-2010, 08:21 PM

Can anyone shed some light on the VCF files on the thousand genomes site? I downloaded these two files:

ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/pilot_data/release/2010_07/trio/snps/CEU.trio.2010_03.genotypes.vcf.gz
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/pilot_data/release/2010_07/low_coverage/snps/CEU.low_coverage.2010_07.genotypes.vcf.gz

After decompressing I counted the lines expecting that the low-coverage data which is taken from 60 individuals would list considerably more SNPs than the trio data which by definition is taken from 3 individuals. Here's what I found:

low-coverage: 277,123 lines
trio: 3,646,774 lines

Why are there so few SNPs for the low-coverage data?

Last edited by BetterPrimate; 09-09-2010, 11:17 PM.
Tags: None

Previous template Next

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad