I am trying to use Cufflinks (Cuffquant and Cuffnorm) to generate gene/transcript expression levels from the CCLE RNAseq data on cghub. The RNAseq data is provided as bam files aligned to genome assembly hg19_Broad_variant. The transcriptome assembly I would like to use is based on hg19/GRCh37. I have not found much information on hg19_Broad_variant to understand how different it is from hg19/GRCh37.
Do I need to convert the CCLE bam files to hg19/GRCh37 before putting them through a Cufflinks workflow? Or is hg19_Broad_variant close enough that it will work without conversion?
If I have to convert to hg19/GRCh37, is it possible to liftOver bam files? Or do I have to do bam->fastq using hg19_Broad_variant, then start over with alignment to hg19/GRCh37?
Thanks in advance for any advice.
Do I need to convert the CCLE bam files to hg19/GRCh37 before putting them through a Cufflinks workflow? Or is hg19_Broad_variant close enough that it will work without conversion?
If I have to convert to hg19/GRCh37, is it possible to liftOver bam files? Or do I have to do bam->fastq using hg19_Broad_variant, then start over with alignment to hg19/GRCh37?
Thanks in advance for any advice.
Comment