Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to do batch correction

    Hello everyone,

    I've got some raw counts for mRNA expression data. Before put these counts into EdgeR or DESeq, I was told I have to some batch correction and normalisation. Has anyone done the similiar work before? What shoul I do, with what package?

  • #2
    Depends on the nature of the batch effect. If all the samples in a batch seem to be affected similarly, then just add batch as a model in your design. Otherwise, have a look at the SVA package in bioconductor (pay special attention to the combat() function).

    Comment


    • #3
      Originally posted by dpryan View Post
      Depends on the nature of the batch effect. If all the samples in a batch seem to be affected similarly, then just add batch as a model in your design. Otherwise, have a look at the SVA package in bioconductor (pay special attention to the combat() function).
      So how to make a model? I also want to adjust some other covariats like age, race other than batch.
      What I want to do is to find differentially expressed genes between 2-3 treated groups. And what should I do next step?

      Many thanks!

      Comment


      • #4
        Just read the sections in the edgeR/DESeq2/etc. vignettes on more multifactor designs. The simplest route is to just add batches as a factor to the dataframe use to make the model matrix.

        Regarding what you do after DE analysis, that depends on your goals. Popular choices include (1) qPCR validation of some hits (2) GO analysis and (3) pathway analysis.

        Comment


        • #5
          [QUOTE=dpryan;157595]Just read the sections in the edgeR/DESeq2/etc. vignettes on more multifactor designs. The simplest route is to just add batches as a factor to the dataframe use to make the model matrix.

          is it this step?
          data.frame(Sample=colnames(y), FH, Seq_Batch, Age, RACE, BMI, MENSSTAT)
          design <- model.matrix(~FH+Seq_Batch+Age+RACE+BMI+MENSSTAT)

          FH is my group factor
          Seq_Batch=Sequence batch infromation, should I categorizise it into groups as "numeric" or just use the original information as "character"
          Age, RACE, BMI, MENSSTAT are my covariats

          Comment


          • #6
            Yeah, if you input a character it'll normally get converted to a factor anyway (this makes things convenient).

            Comment


            • #7
              Originally posted by dpryan View Post
              Yeah, if you input a character it'll normally get converted to a factor anyway (this makes things convenient).
              Can you have a closer look at my code, thanks?
              The last coecient "FH" is my group categorization. The others are adjusted factors in my model.

              > design <- model.matrix(~Seq_Batch+Age+RACE+BMI+MENSSTAT+FH)
              > rownames(design) <- colnames(y)
              > design
              (Intercept) Seq_Batch Age RACE BMI MENSSTAT FHYes
              FH0.B1.K100801 1 1 37 3 23.4 2 0
              FH1.B2.K100813 1 2 67 3 21.9 1 1
              FH0.B3.K100823 1 3 46 3 29.0 1 0
              FH1.B2.K100826 1 2 49 3 27.3 2 1
              FH1.B2.K100831 1 2 54 3 28.3 1 1
              FH0.B2.K101448 1 2 46 3 32.3 1 0
              FH1.B2.K102540 1 2 57 -1 28.0 1 1
              FH0.B2.K102654 1 2 59 3 41.6 1 0
              FH0.B5.K104200 1 5 63 3 26.5 1 0
              FH0.B5.K104238 1 5 55 3 22.1 1 0
              FH0.B2.K104239 1 2 48 -1 39.0 2 0
              FH0.B2.K104250 1 2 33 3 22.7 3 0
              FH0.B3.K104338 1 3 56 -1 34.3 1 0
              FH0.B2.K104343 1 2 44 3 33.4 1 0
              FH0.B3.K104403 1 3 46 3 34.2 1 0
              FH1.B4.K104416 1 4 37 3 32.5 1 1
              FH0.B5.K104443 1 5 38 3 20.7 2 0
              FH1.B5.K104506 1 5 59 3 25.9 1 1
              FH1.B3.K104557 1 3 41 3 33.1 2 1
              FH0.B2.K104603 1 2 63 3 31.0 1 0
              FH0.B2.K104638 1 2 78 3 22.6 1 0
              FH0.B4.K104662 1 4 64 3 29.9 1 0
              FH1.B2.K104824 1 2 57 -1 27.4 1 1
              attr(,"assign")
              [1] 0 1 2 3 4 5 6
              attr(,"contrasts")
              attr(,"contrasts")$FH
              [1] "contr.treatment"

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM
              • seqadmin
                Techniques and Challenges in Conservation Genomics
                by seqadmin



                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                Avian Conservation
                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                03-08-2024, 10:41 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:37 PM
              0 responses
              12 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, Yesterday, 06:07 PM
              0 responses
              10 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-22-2024, 10:03 AM
              0 responses
              52 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 03-21-2024, 07:32 AM
              0 responses
              68 views
              0 likes
              Last Post seqadmin  
              Working...
              X