Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • rndouglas
    replied
    I just realized I'm only seeing this in my smallRNA libraries (adapter removed, t/rRNA removed, size-selected for 20-25nt in sRNA Workbench).

    I map the reads using bowtie (-v 0).

    I generate read counts with htseq-count, then build my count table(s).

    I run DESeq following along with the vignette section 3.1.

    So far, every 1v1 count table I've looked (7 of the possible 15) at has called more significantly changed loci (padj < 0.05) than I get when looking at the exact same comparison using a count table that includes all 30 of my bio-reps.

    The biggest 'jump' was from 76 to 372 loci for one comparison.

    Leave a comment:


  • feralBiologist
    replied
    Originally posted by rndouglas View Post
    This is what I had been doing all along (running everything in one count table), but then this morning I thought I'd try A vs. B in a separate count table. I was surprised to find almost double the loci with padj < 0.05 compared to when I ran everything in one count table (and hence my new-found concern).
    This is really strange and counter-intuitive. Do you find more loci for all the contrasts? I can imagine this being the case for a small subset of the contrasts where there happens to be less variation within the sample groups but I find it hard to believe that you would get in total a much higher number of differentially expressed features by splitting the count table. What sort of normalisation do you do? Can you give more details about your bioinformatics workflow?

    Leave a comment:


  • dpryan
    replied
    Have a look at the size factors. If one of them from the full dataset is very different than the others, that can cause this sort of result.

    Leave a comment:


  • rndouglas
    replied
    This is what I had been doing all along (running everything in one count table), but then this morning I thought I'd try A vs. B in a separate count table.

    I was surprised to find almost double the loci with padj < 0.05 compared to when I ran everything in one count table (and hence my new-found concern).

    Leave a comment:


  • feralBiologist
    replied
    It is better to have a single count table. This shall lead to increased statistical power. You are likely to get somewhat higher number of differentially expressed genes as DESeq would be able to tease out more signal from the noise.

    Leave a comment:


  • rndouglas
    started a topic What to include in my count table(s) for DESeq

    What to include in my count table(s) for DESeq

    I have five experimental groups (A, B, C, D, and E; 3-6 bioreps each) that I want to compare to each other and a control (Y; 3 bioreps) using DESeq.

    When I run DESeq is it better to use a count table that includes A, B, C, D, E, and Y, then run each comparison, or is it better to make a count table for each comparison (ie: a table for A and Y, another table for B and Y, another table for A and B, etc.) and end up with ~15 different count tables?

    I ask because I end up with different lists of significantly changed loci depending on how I run the analysis and I'm not sure which is more 'correct.'

Latest Articles

Collapse

  • SEQadmin2
    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
    by SEQadmin2


    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
    ...
    06-02-2026, 10:05 AM
  • SEQadmin2
    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
    by SEQadmin2


    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


    Introduction

    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
    05-22-2026, 06:42 AM
  • SEQadmin2
    Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
    by SEQadmin2

    Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


    Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
    05-06-2026, 09:04 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by SEQadmin2, Today, 08:59 AM
0 responses
10 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-02-2026, 12:03 PM
0 responses
21 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-02-2026, 11:40 AM
0 responses
17 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 05-28-2026, 11:40 AM
0 responses
30 views
0 reactions
Last Post SEQadmin2  
Working...