Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • DESeq : without replication

    Hello

    I have a question regarding to the replicates for DESeq.

    Here is my data design..

    > head(data)

    KYM_pre KYM_post GDY_pre GDY_post HKS_pre HKS_post KWK_pre KWK_post PYM_pre PYM_post SHM_pre SHM_post
    A1BG 13 24 25 10 47 22 19 12 27 45 50 61
    A2M 3 4 4 8 37 8 19 6 4 3 7 11
    A2M-AS1 7 1 1 1 10 1 7 7 0 0 0 8
    ......


    > meta

    patient drug
    KYM_pre 1 pre
    KYM_post 1 post
    GDY_pre 2 pre
    GDY_post 2 post
    HKS_pre 3 pre
    HKS_post 3 post
    KWK_pre 4 pre
    KWK_post 4 post
    PYM_pre 5 pre
    PYM_post 5 post
    SHM_pre 6 pre
    SHM_post 6 post


    =====================================
    My goal is to find DEG responsing to the drug..

    I tried two strategy neither of them worked..

    1) Frankly, I don't have replicates.. so..
    d_rare <- newCountDataSet(data, meta)
    d_rare <- estimateSizeFactors(d_rare)
    d_rare <- estimateDispersions(d_rare, method="blind", sharingMode="fit-only")

    dh_fit1 = fitNbinomGLMs(d_rare, count ~ patient + drug)
    dh_fit0 = fitNbinomGLMs(d_rare, count ~ patient)

    Unfortunately, this model return NO DEG based on adjusted pvalue.

    2) I assumes 6 patients having same disease are biological replicates for more solid estimation for dispersion


    d_rare <- newCountDataSet(data, meta)
    d_rare <- estimateSizeFactors(d_rare)
    d_rare <- estimateDispersions(d_rare) # which means using default option (maximum likelihood estimation)

    Again, in this second trial I got the following error
    "None of your conditions is replicated. Use method='blind' to estimate across conditions, or 'pooled-CR', if you have crossed factors "

    Does anyone have an idea? I have replications for patient(e.g. 1 1 2 2 3 3 44 ...etc) and drug(pre/post) too. why I am getting this message?? My suspection is that when I consider the cross design.. for example, for fixed factor(e.g. pre) I only have one patient 1, one patient 2, one patient 3... etc.. Is this the problem??

    I also refer the DESeq manual example for pasilla Dataset.
    > pasillaDesign
    condition libType
    untreated1 untreated single-end
    untreated2 untreated single-end
    untreated3 untreated paired-end
    untreated4 untreated paired-end
    treated1 treated single-end
    treated2 treated paired-end
    treated3 treated paired-end

    In this design, they could sucessfully run. But in same way(like my model), for fixed single-end, they also only have one treated sample..

    > cdsFull = newCountDataSet( pasillaCountTable, pasillaDesign )
    > cdsFull = estimateDispersions( cdsFull )


    Maybe, I am missing something... please help me..
    Last edited by younko; 08-12-2014, 12:38 AM.

  • #2
    Firstly, use DESeq2 (in fact, you'll see many people using designs very similar to yours with DESeq2).

    Secondly, you do have replicates, you just also have a patient effect that you want to compensate for. It's been a while since I've used DESeq1, but the 'pooled-CR' method is probably what you want.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin


      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    39 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    41 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    35 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X