Hi all,
I'm seeing tophat2 output bam files of different sizes (~800mb vs ~300mb) for two fastq files from the same strain (different biological replicates) with similar total read numbers for each (~14 million from 100bp single end illumina hiseq 2000 run). both fastqs are being mapped against the same reference genome but each sample was sequenced at different times and on different lanes. I don't believe this is a problem but would like to understand the size discrepancy.
Cheers,
I'm seeing tophat2 output bam files of different sizes (~800mb vs ~300mb) for two fastq files from the same strain (different biological replicates) with similar total read numbers for each (~14 million from 100bp single end illumina hiseq 2000 run). both fastqs are being mapped against the same reference genome but each sample was sequenced at different times and on different lanes. I don't believe this is a problem but would like to understand the size discrepancy.
Cheers,
Comment