Hi all,
I have raw fastq files of paired-end short reads for two samples. I would like to tag the reads of the two samples with some identifier and pool them into a single fastqfile and perform the alignment and variant calling. Below is the detailed explanation of what i want to achieve.
In general, we align the reads of the two samples independently and add read groups to the bam files using samtools or picard tools and perform variant calling using GATK or samtools. Here the variant calling algorithm will treat them as two different samples based on the readgroup information.
But, i would like to tag the reads of both samples with two different read groups before doing alignment which would produce a single alignment bam file carrying the read group information of both the samples. And this bam file would be used for variant calling, where the algorithm treats it as two samples from read group information.
Could anyone help!!!
I have raw fastq files of paired-end short reads for two samples. I would like to tag the reads of the two samples with some identifier and pool them into a single fastqfile and perform the alignment and variant calling. Below is the detailed explanation of what i want to achieve.
In general, we align the reads of the two samples independently and add read groups to the bam files using samtools or picard tools and perform variant calling using GATK or samtools. Here the variant calling algorithm will treat them as two different samples based on the readgroup information.
But, i would like to tag the reads of both samples with two different read groups before doing alignment which would produce a single alignment bam file carrying the read group information of both the samples. And this bam file would be used for variant calling, where the algorithm treats it as two samples from read group information.
Could anyone help!!!
Comment