Hello,
I have a 5 yeast genomes (Illumina MiSeq) -- I will call them A, B, C, D, and E.
I grew a colony of yeast over a period of a couple days under four different conditions in order to see how each particular condition would affect the genome of the yeast. These conditions should induce mutations in the DNA of the yeast.
Thus, A is the genome of the yeast I started with, and B-E (one for each condition) are the "altered" genomes of the yeast that I ended up at the end of the experiment.
I have aligned A-E to the S288C S. cerevisiae reference genome using BWA and called SNPs through two methods (with the mpileup function in SAMtools; also, via GATK and VCFtools), but these methods haven't quite given me the results I wanted.
To be brief, when mapped to the reference genome, Genomes B-E show fewer mutations than when Genome A is mapped to the reference genome. I would like to align B-E to A to call SNPs/INDELs. I hope, in this way, that I can get a better fit of B-E onto A (since B-E are more closely related to A than they are to the reference genome, and thus should provide a better fit).
How do I go about mapping B-E onto A? Do I need to process A to serve as a reference genome, and, if so, how would I do that? I have A-E as .fastq files, as well as all the .SAM and .BAM files after aligning to the reference genome.
I will readily admit I am not particularly good at working with computers, but if you request any other information, please let me know.
I have a 5 yeast genomes (Illumina MiSeq) -- I will call them A, B, C, D, and E.
I grew a colony of yeast over a period of a couple days under four different conditions in order to see how each particular condition would affect the genome of the yeast. These conditions should induce mutations in the DNA of the yeast.
Thus, A is the genome of the yeast I started with, and B-E (one for each condition) are the "altered" genomes of the yeast that I ended up at the end of the experiment.
I have aligned A-E to the S288C S. cerevisiae reference genome using BWA and called SNPs through two methods (with the mpileup function in SAMtools; also, via GATK and VCFtools), but these methods haven't quite given me the results I wanted.
To be brief, when mapped to the reference genome, Genomes B-E show fewer mutations than when Genome A is mapped to the reference genome. I would like to align B-E to A to call SNPs/INDELs. I hope, in this way, that I can get a better fit of B-E onto A (since B-E are more closely related to A than they are to the reference genome, and thus should provide a better fit).
How do I go about mapping B-E onto A? Do I need to process A to serve as a reference genome, and, if so, how would I do that? I have A-E as .fastq files, as well as all the .SAM and .BAM files after aligning to the reference genome.
I will readily admit I am not particularly good at working with computers, but if you request any other information, please let me know.
Comment