Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Help required with edgeR contrasts

    Hi,

    I am using edgeR for analyzing a batch of RNA-Seq data made of differement cell compartment extracts (nuclear and cytoplasmic) before stimulation (00h) and several timepoints after stimulation (02h and 04h). Each condition has two replicates.
    The goal is to analyze whether the stimulation has an effect on the nuclear export of RNAs.

    For that, I would like to study :
    -Which annotations are significantly compartimented in cytoplasm or nucleus for each timepoint (including 00h)
    -The effect of stimulation (using 00h as reference) on the compartimentalization
    ....first by comparing 00h to individual timepoints 02h and 04h separately
    ....then trying to see what are the effect of stimulation independently of the time after stimulation by comparing 00h to the mix of all timepoints after stimulation

    I built a matrix containing counts of all experiments and a design matrix 'edgerGlmDesign' as follow (no intercept) :

    Code:
    >colnames(edgerGlmDesign)
    "Cyto.00h" "Cyto.02h" "Cyto.04h" "Nucl.00h" "Nucl.02h" "Nucl.04h"
    Before using the results of the analysis I would like to be sure that I am indeed comparing the right things (I doubt it actually) with the following setup :

    Code:
       
        # Prepare the 'contrasts' summarizing the conditions we want to compare
        myContrasts <- makeContrasts(
                # First Nuclear and Cytoplasmic will be compared for each timepoint individually
                NuclearVsCytoplasmic_00h     = Nucl.00h-Cyto.00h,
                NuclearVsCytoplasmic_02h     = Nucl.02h-Cyto.02h,
                NuclearVsCytoplasmic_04h     = Nucl.04h-Cyto.04h,
    
                # Then Nuclear significant from 02h will be compared to Nuclear significant of 00h
                Nuclear_00vs02    = (Nucl.02h-Cyto.02h)-(Nucl.00h-Cyto.00h),
                # Then Nuclear significant from 04h will be compared to Nuclear significant of 00h
                Nuclear_00vs04    = (Nucl.04h-Cyto.04h)-(Nucl.00h-Cyto.00h),
    
                # Then Cytoplasmic significant from 02h will be compared to Cytoplasmic significant of 00h
                Cytoplasmic_00vs02    = (Cyto.02h-Nucl.02h)-(Cyto.00h-Nucl.00h),
                # Then Cytoplasmic significant from 04h will be compared to Cytoplasmic significant of 00h
                Cytoplasmic_00vs04    = (Cyto.04h-Nucl.04h)-(Cyto.00h-Nucl.00h),
    
                # Then Nuclear of 00h compared to the mix of nuclear specific from all other timepoints
                Nuclear_00vsTotal = (((Nucl.02h-Cyto.02h)+(Nucl.04h-Cyto.04h))/2)-(Nucl.00h-Cyto.00h),
    
                # Then Cytoplasmic of 00h compared to the mix of nuclear specific from all other timepoints
                Cytoplasmic_00vsTotal = (((Cyto.02h-Nucl.02h)+(Cyto.04h-Nucl.04h))/2)-(Cyto.00h-Nucl.00h),
    
                levels=edgerGlmDesign);
    I am pretty sure that my conception of contrasts is wrong since results from 'Nuclear_00vsTotal' and 'Cytoplasmic_00vsTotal' are symmetric.

    I took inspiration from documentation chapter 3.3.1 where a similar setup is used to compared DrugVsPlacebo.1h, but I probably extrapolated too much from it...


    Thank you for your help,
    Regards.

  • #2
    It seems to me that Nuclear_00vs02 is equivalent to Nuclear_00vs02, you should get the same results just with opposite sign. The same for Nuclear_00vs04 and Cytoplasmic_00vs04 and the same for Nuclear_00vsTotal and Cytoplasmic_00vsTotal.

    I would suggest to run the analysis, pick the top hits and see on the normalized data (i.e. cpm or rpkm) if the result is consistent with what you expect the model to do. Also as a test, add to the raw data some dummy genes that for each contrast should come up as significant or non significant (sort of positive and negative controls), just to make sure the contrasts are doing what you expect.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-25-2024, 11:49 AM
    0 responses
    19 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-24-2024, 08:47 AM
    0 responses
    19 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    62 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Working...
    X