Currently, I'm using the NCBI taxonomy and NCBI genomes that are tied to it to analyse pathogen genomes and reads in human samples. I am particularly interested in looking at EBV (Human herpesvirus 4) and my understanding is there are 8-9 different reference genomes floating around. At least, this is what I've understood from a couple of papers. However, NCBI only contains two genome references in their Virus folder. I was curious if anyone new where to find the other reference sequences for various EBV strains. Ideally, I would like them in fasta format so that I can manually incorporate them into my database.
Also, I'm curious why NCBI does not have the other reference sequences? After all, many of the viral and bacterial genomes are not completely annotated or built, but still manage to make it into the database?
Any thoughts on EBV references would be much appreciated!
Also, I'm curious why NCBI does not have the other reference sequences? After all, many of the viral and bacterial genomes are not completely annotated or built, but still manage to make it into the database?
Any thoughts on EBV references would be much appreciated!
Comment