Hi,
I have a set of Illumina Hiseq 2000 bacteria transcriptome data set. From NCBI, I noticed that they have 2 same species but different strain genome sequence is available.
I decide to choose a closely related genome for my bacteria transcriptome data set as a genome-guide assembly.
Can I know how I determine which available genome is more closely related to my data set?
I have try to map the clean read of my bacteria transcriptome data set against those 2 genome separately.
It shown 97% of baceteria transcriptome read map to genome A (genome size is 4Mb) and 96% of bacteria transcriptome read map to genome B (genome size is 3.8Mb).
It seems like my reads map more in genome A than B.
But genome size of A is slightly larger than B.
I not sure whether more reads map to genome A is due to larger genome size in genome A.
Thanks for any advice that able to determine which genome is more closely related to my bacteria transcriptome data set.
I have a set of Illumina Hiseq 2000 bacteria transcriptome data set. From NCBI, I noticed that they have 2 same species but different strain genome sequence is available.
I decide to choose a closely related genome for my bacteria transcriptome data set as a genome-guide assembly.
Can I know how I determine which available genome is more closely related to my data set?
I have try to map the clean read of my bacteria transcriptome data set against those 2 genome separately.
It shown 97% of baceteria transcriptome read map to genome A (genome size is 4Mb) and 96% of bacteria transcriptome read map to genome B (genome size is 3.8Mb).
It seems like my reads map more in genome A than B.
But genome size of A is slightly larger than B.
I not sure whether more reads map to genome A is due to larger genome size in genome A.
Thanks for any advice that able to determine which genome is more closely related to my bacteria transcriptome data set.
Comment