Hi Everyone, I've got a question about the UMD3.1 reference genome for cattle.
The (I think original?) version from Maryland can be obtained here (1) ftp://ftp.cbcb.umd.edu/pub/data/asse...aurus_UMD_3.1/.
There are, however, other reference .fa files available for UMD3.1, for example, from
(2) ftp://ftp.ncbi.nlm.nih.gov/genomes/Bos_taurus/CHR_25/ or (3) ftp://ftp.cbcb.umd.edu/pub/data/asse...aurus_UMD_3.1/
The problem is, these files are not identical. Just looking at the (unzipped) file size for one chromosome (25 – it’s the smallest), considerable differences can be observed. For example:
(Source) Size FileName
(1) 43,619,264 bytes Chr25.fa
(2) 43,517,209 bytes bt_ref_Bos_taurus_UMD_3.1_chr25(1).fa
(3) 43,517,969 bytes bt_ref_Bos_taurus_UMD_3.1_chr25(2).fa
A quick "diff" in Unix shows that the files really aren't the same.
I haven’t found a reason or an explanation for this. As long as we know which reference was used for alignment, we should be ok. Nevertheless, it’s a little unnerving to see that three different “References” that should technically be the same, aren’t…
Does anyone know why this is?
Cheers,
Chris
The (I think original?) version from Maryland can be obtained here (1) ftp://ftp.cbcb.umd.edu/pub/data/asse...aurus_UMD_3.1/.
There are, however, other reference .fa files available for UMD3.1, for example, from
(2) ftp://ftp.ncbi.nlm.nih.gov/genomes/Bos_taurus/CHR_25/ or (3) ftp://ftp.cbcb.umd.edu/pub/data/asse...aurus_UMD_3.1/
The problem is, these files are not identical. Just looking at the (unzipped) file size for one chromosome (25 – it’s the smallest), considerable differences can be observed. For example:
(Source) Size FileName
(1) 43,619,264 bytes Chr25.fa
(2) 43,517,209 bytes bt_ref_Bos_taurus_UMD_3.1_chr25(1).fa
(3) 43,517,969 bytes bt_ref_Bos_taurus_UMD_3.1_chr25(2).fa
A quick "diff" in Unix shows that the files really aren't the same.
I haven’t found a reason or an explanation for this. As long as we know which reference was used for alignment, we should be ok. Nevertheless, it’s a little unnerving to see that three different “References” that should technically be the same, aren’t…
Does anyone know why this is?
Cheers,
Chris
Comment