Seqanswers Leaderboard Ad

**krobison** · 06-18-2010, 06:51 AM

I would think you would want to use mapped tags as the denominator; otherwise poorly sequenced/prepared libraries will artificially have lower expression values

**Wolfgang Huber** · 06-19-2010, 01:46 AM

Form follows function

Hi Xhuister

when asking 'how' to do normalisation it is a good idea to first ask 'what for'. In RNA-Seq, a typical reason is to avoid spurious differential expression calls just because of differential library coverage. Another consideration is that you want to keep track of the actual counts when assessing statistical confidence in differential expression calls, since the counting noise is relatively more important when the numbers are small, even for the same fold-change.

A paper by M. Robinson and A. Oshlack discusses especially the first aspect in quite some detail: http://genomebiology.com/2010/11/3/R25

A paper by S. Anders and myself combines a very similar normalisation method with the error modeling needed for confidence computations: http://precedings.nature.com/documents/4282/version/2

Best wishes
Wolfgang Huber

**xhuister** · 06-19-2010, 04:30 AM

Thank you Wolfgang and Krobison,

I'm reading the DEG paper now. But I'm not sure whether it is suitable for my case.

In my case, for each gene, there are only some (normally <5) locations with reads and most of the locations are in 3'-UTR, not like the case that the reads are distributed along the transcripts.

Do you think it's OK to use the normalization method using DESeq or just use 'Tag per million' to normalize by the total count in each library? Thank you!

**Wolfgang Huber** · 06-19-2010, 11:39 AM

Dear Xhuister

as long as you have reason to believe that your counts are roughly proportional to the true target gene abundance (with unknown proportionality factors that depend on sample, lane and gene), these normalisations are in principle suitable. (And if that were not the case, then I am not sure what you want to normalise.)

Best wishes
Wolfgang

**xhuister** · 06-19-2010, 03:17 PM

Thank you Wolfgang. Maybe I'll have a try both TPM and DESeq.

Topics	Statistics	Last Post
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, Yesterday, 02:46 PM	0 responses 11 views 0 likes	Last Post by seqadmin Yesterday, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 13 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, 05-06-2024, 07:17 AM	0 responses 17 views 0 likes	Last Post by seqadmin 05-06-2024, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 23 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM

Seqanswers Leaderboard Ad

Announcement

How to Normalize NGS data? Tags per million?

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News