Hi every one can anybody tell me
after performing mapping based approach for RNA seq data with tophat-cufflink pipeline i mapped my reads to reference genome and gff file for annotation. then how can i calculate my transcriptome coverage ?? after mapping and assembled transcript by cufflinks how can i know my RNA seq data covered transcriptome? or how many number of genes are covered in my RNA seq analysis with tophat -cufflink pipeline.
And in lastly anybody know how to extract gene length from a GFF or even GenBank file? BEDTools does it but I have duplicated values (for gene/CDS/exons). which one i consider for make fasta sequence from RNA seq analysis output? for that consider Gene cordinates or CDS cordinated or exon cordinates? what can i do for in case of replicate? Thanks
after performing mapping based approach for RNA seq data with tophat-cufflink pipeline i mapped my reads to reference genome and gff file for annotation. then how can i calculate my transcriptome coverage ?? after mapping and assembled transcript by cufflinks how can i know my RNA seq data covered transcriptome? or how many number of genes are covered in my RNA seq analysis with tophat -cufflink pipeline.
And in lastly anybody know how to extract gene length from a GFF or even GenBank file? BEDTools does it but I have duplicated values (for gene/CDS/exons). which one i consider for make fasta sequence from RNA seq analysis output? for that consider Gene cordinates or CDS cordinated or exon cordinates? what can i do for in case of replicate? Thanks
Comment