Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Using cqn package with EdgeR

    I am doing an RNA-seq differential expression analysis, and am trying to use the cqn package with EdgeR to correct for GC content effects on estimation of expression. I have a few questions about using this package:

    1. How should the GC content for each gene be calculated (in concept)? I guess the simplest way would be to calculate it for the exonic regions of the gene. However, this seems to me to be flawed, as different transcripts from the same gene may have different GC contents, and also different expression levels. Say for example that you have a gene with three exons and two transcripts, one (T1) with the first two exons and one (T2) with the second two. Say the GC content taking into account all the exons is 50%. Now say T2 is the predominant isoform, and hardly any T1 is expressed. Say the GC content for T2 is 60% - if you base the correction factor on all the exons in the gene your correction for the expression estimate of this gene is going to be way out. But considering transcript-level expression when calculating the correction factor seems like it would be very complicated, and somewhat circular, as the correction factor is needed to accurately estimate expression...

    2. How should the GC content for each gene be calculated (in practice)? I have read this thread (https://support.bioconductor.org/p/58846/), but I can't see how to get the method suggested there to work in my case - half of my genes get excluded from the txdb due to "bad strand information" (I'm using a gtf file produced by Cufflinks and all the novel single exon genes predicted by Cufflinks have no strand information).

    3. Same as 1 above but for gene lengths - transcript length may vary a lot between different isoforms of the same gene, leading to the same problem as for GC content.

    4. Same as 2 above but for gene lengths.

    5. Do people commonly do RNA-seq analysis in EdgeR without this correction? I'm not convinced it will make that much difference, and seems very complicated to implement!

Latest Articles

Collapse

  • seqadmin
    Recent Advances in Sequencing Analysis Tools
    by seqadmin


    The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
    05-06-2024, 07:48 AM
  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:57 AM
0 responses
11 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-06-2024, 07:17 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-02-2024, 08:06 AM
0 responses
19 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-30-2024, 12:17 PM
0 responses
24 views
0 likes
Last Post seqadmin  
Working...
X