Unconfigured Ad

**dpryan** · 03-26-2015, 12:26 AM

Just quantitate over genes, rather than transcripts. This is simplest with Ensembl's annotation files.

**neokao** · 03-26-2015, 12:37 AM

Originally posted by dpryan View Post

Just quantitate over genes, rather than transcripts. This is simplest with Ensembl's annotation files.

Thanks for your reply. Could you shed more light? Do you mean I should use Ensembl annotation file for my reference genome/transcriptome?
At which step were you suggesting to change?

I used the UCSC file refMrna.fa as reference transcriptome.
Then I used bwa for alignment and a perl script to count the reads.

I finally used the biomaRt package to update my refseqID to MGI symbol, etc.

useDataset("mmusculus_gene_ensembl",mart=ensembl)

Thanks.

**dpryan** · 03-26-2015, 12:43 AM

Ah, ditch UCSC and transcriptome alignments. The best method for RNAseq data is to use STAR or HISAT (or tophat2 if you enjoy wasting time) and align to the genome. These tools can be supplied with an annotation file (GTF or GFF format). The resulting SAM/BAM file can then be processed with featureCounts to produce gene-level counts. This is the process I personally use for my mouse datasets and it works quite well. I recommend Ensembl's reference sequence and annotation files, they're more convenient than UCSC's.

**neokao** · 03-26-2015, 01:29 AM

I thought STAR is adapted to align long reads. Mine are short reads. I guess I might be wrong.
Regarding the Ensembl reference genome/transcriptome for mouse RNAseq, is the Mus_musculus.GRCm38.79.gtf.gz the right one to use for now?

Thanks.

**dpryan** · 03-26-2015, 01:34 AM

STAR work great with short reads, even small RNAs (e.g. miRNAs).

Edit: Yes, that's the correct file. Get the fasta file too, since chromosome names differ between Ensembl and UCSC.

**neokao** · 04-05-2015, 04:22 PM

Originally posted by dpryan View Post

STAR work great with short reads, even small RNAs (e.g. miRNAs).

Edit: Yes, that's the correct file. Get the fasta file too, since chromosome names differ between Ensembl and UCSC.

Thanks dpryan. I finally did it with STAR and featurecounts.
I have some following questions posted with a different topic.

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 12 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

issues of DE genes vs DE transcripts

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News