Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BS-Seq data analysis with more than two (2+) conditions

    Dear all,

    I could need some help/recommendations as I am about to analyse BS-Seq data for more than two conditions.

    We conducted BS-Seq on mice samples, which were held under four different feeding conditions. For each conditions the dataset contains three replicates. The experiment went well. (So the data is already generated!)

    Now my question: After conducting the alignment, how would you suggest to proceed in order to analyse differential methylation on ALL four conditions. I know there are enough tools and pipelines out there to conduct pairwise analyses but that does not help me in this case as multiplie pairwise comparisons are probably not really comparable to each other. (and if so, I could imagine that would be probably only on a qualitative but not statistically proper quantitative level).
    Also, the handling of the information of the replicates should not be considered.
    As I am only collecting ideas at the moment, I do not care too much whether the analysis would be on the scale of single Cs or whole regions.

    An ideal solution would probably be an equivalent to an ANOVA in Transcriptomics (which is also used by DESeq2 or edgeR) where one can conduct the test with multiple factor-levels and even multiple factors.

    Is there any magical tool out there being able to handle this kind of task and I have not heard about it? Or has anyone came across a paper/pipeline where they tackled that question? (I didnt)

    If not maybe someone of you has a good sugestion how such a test should be conducted? We are currently thinking about converting somehow the information from a BS-Seq mapper (like Bismark) into sth that could be read by an ANOVA, Deseq2 etc. OR adapting the statistical framework of an ANOVA to mapped BS-reads.

    Thank you very much for reading this long text and I am happy about any idea/suggestion you might have!

    Best,

    Oliver

  • #2
    It's surprising that no one has written something for this yet. I started to do something like this but never got around to completing it (e.g., most of the documentation is missing or wrong). If nothing else, the base code should already be there to fit models of arbitrary complexity with a beta-binomial using prior distributions (i.e., it's like DESeq2 and edgeR, but with a beta-binomial rather than a negative-binomial). I should note that we were specifically interested in looking at changes in variance, so none of that is profiled out.

    There's no direct way to convert BS-seq counts for use with DESeq2 or a linear model. If you can find a convenient linear model function (e.g. limma) that allows per-row weights then you should be able to use that after logit transforming ratios. That would be a reasonable approach, but everything I know of like limma only allows per-column/sample weights, which wouldn't be sufficient for you.

    Comment


    • #3
      Originally posted by dpryan View Post
      It's surprising that no one has written something for this yet.
      Indeed!!

      I started to do something like this but never got around to completing it (e.g., most of the documentation is missing or wrong). If nothing else, the base code should already be there to fit models of arbitrary complexity with a beta-binomial using prior distributions
      Interesting, thanks for sharing! I came up with a similar idea of using beta-binomial model using the glm functionality in the R package aod.

      If you can find a convenient linear model function (e.g. limma) that allows per-row weights then you should be able to use that after logit transforming ratios.
      An idea I wanted to try is to linearise the % methylation by means of e.g. logit, as you suggest. Then use the total counts per position as weight for limma/voom (i.e. total counts ~ counts per gene). Then combine the weights from voom with the logit data for lmFit. (Not sure it makes sense...)

      Comment


      • #4
        Regarding aod, I recall trying that and pretty much anything else that could fit betabinomials. As I recall, if you don't need to fit both mu and phi, possibly with different model matrices, at the same time then all of those work well enough (just keep in mind that beta binomials are a royal pain to fit well).

        I don't know about the limma/voom part of that, but the rest is going in the right direction. One would need to put a lot of thought into how voom works and converting that to be relevant for beta-binomials. I suspect that feeding a matrix of per-position/per-sample weights into lmFit along with the logit transformed methylation percentages would produce reasonable results. That'd also be vastly simpler than dealing with beta-binomials, though that should all be tested before anyone actually uses it.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Non-Coding RNA Research and Technologies
          by seqadmin




          Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

          Nobel Prize for MicroRNA Discovery
          This week,...
          10-07-2024, 08:07 AM
        • seqadmin
          Recent Developments in Metagenomics
          by seqadmin





          Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
          09-23-2024, 06:35 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 06:35 AM
        0 responses
        6 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 02:44 PM
        0 responses
        7 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-11-2024, 06:55 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-02-2024, 04:51 AM
        0 responses
        110 views
        0 likes
        Last Post seqadmin  
        Working...
        X