I am a bit confused about the chromStart and chromEnd positions in the BED format.
According to UCSC:
chromStart - The starting position of the feature in the chromosome or scaffold. The first base in a chromosome is numbered 0.
chromEnd - The ending position of the feature in the chromosome or scaffold. The chromEnd base is not included in the display of the feature. For example, the first 100 bases of a chromosome are defined as chromStart=0, chromEnd=100, and span the bases numbered 0-99.
Assuming I download a bed file for a gene from UCSC as below:
chromStart: 300
chromEnd: 500
Now, I get a set of SNPs by mapping reads to hg18 and calling SNPs using whatever SNPcaller. I want to know how many SNPs were called within the above gene. Should I compare each SNP position with the gene range as
300<=SNP_POSITION<=500
or
301<=SNP_POSITION<=500
or
301<=SNP_POSITION<=499
?
Does anyone know which is correct?
Thanks
According to UCSC:
chromStart - The starting position of the feature in the chromosome or scaffold. The first base in a chromosome is numbered 0.
chromEnd - The ending position of the feature in the chromosome or scaffold. The chromEnd base is not included in the display of the feature. For example, the first 100 bases of a chromosome are defined as chromStart=0, chromEnd=100, and span the bases numbered 0-99.
Assuming I download a bed file for a gene from UCSC as below:
chromStart: 300
chromEnd: 500
Now, I get a set of SNPs by mapping reads to hg18 and calling SNPs using whatever SNPcaller. I want to know how many SNPs were called within the above gene. Should I compare each SNP position with the gene range as
300<=SNP_POSITION<=500
or
301<=SNP_POSITION<=500
or
301<=SNP_POSITION<=499
?
Does anyone know which is correct?
Thanks
Comment