Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Sol
    Member
    • Oct 2010
    • 13

    I would like know what the letters NA means as a result of DEGSeq.
    Another question: log2 is two fold change or four fold change
    thanks

    Comment

    • Xi Wang
      Senior Member
      • Oct 2009
      • 317

      Originally posted by Sol View Post
      I would like know what the letters NA means as a result of DEGSeq.
      Another question: log2 is two fold change or four fold change
      thanks
      Thanks for your question.

      NA: when the read counts for a gene in both samples are zero, or zero and a small number (say, <5), the program will not calculate the values (such as fold-change, p-value) for this gene. "NA"s appear in those places.

      log2 means base-2 logarithm. So
      if fold-change = 1, log2(fold-change) = 0;
      if fold-change = 2, log2(fold-change) = 1;
      if fold-change = 4, log2(fold-change) = 2;
      if fold-change = 0.5, log2(fold-change) = -1.
      Xi Wang

      Comment

      • Sol
        Member
        • Oct 2010
        • 13

        How do you do to calculated the cutoff in the value the DEGseq, in pvalue. cutoff = 2 for example
        thanks

        Comment

        • Xi Wang
          Senior Member
          • Oct 2009
          • 317

          Originally posted by Sol View Post
          How do you do to calculated the cutoff in the value the DEGseq, in pvalue. cutoff = 2 for example
          thanks
          The cufoffs are specified by users. If you ask how to calculate the p-values, please refer to our paper: http://bioinformatics.oxfordjournals.../full/26/1/136

          BTW, p-value should be any real number between 0 and 1.
          Xi Wang

          Comment

          • wdt
            Member
            • Oct 2009
            • 19

            HI,
            Using the sam to bed Perl script, I got the file like

            chr1 435837 435913 U0 0 +
            chr1 435837 435913 U0 0 -
            chr1 435837 435913 U1 0 -
            chr1 435838 435914 U1 0 +
            chr1 435838 435914 U1 0 -
            chr1 435838 435914 U1 0 -
            chr1 435840 435916 U2 0 -
            chr1 435840 435916 U2 0 -
            chr1 435840 435916 U3 0 -
            chr1 435840 435916 U2 0 -
            chr1 435842 435918 U4 0 -
            chr1 435842 435918 U4 0 -
            chr1 435844 435920 U2 0 -
            chr1 435844 435920 U2 0 -
            chr1 437189 437265 U2 0 +

            Could someone explain how U0, U1, U2 are assigned and
            what they are?

            Thanks,

            Comment

            • Xi Wang
              Senior Member
              • Oct 2009
              • 317

              Originally posted by wdt View Post
              HI,
              Using the sam to bed Perl script, I got the file like

              chr1 435837 435913 U0 0 +
              chr1 435837 435913 U0 0 -
              chr1 435837 435913 U1 0 -
              chr1 435838 435914 U1 0 +
              chr1 435838 435914 U1 0 -
              chr1 435838 435914 U1 0 -
              chr1 435840 435916 U2 0 -
              chr1 435840 435916 U2 0 -
              chr1 435840 435916 U3 0 -
              chr1 435840 435916 U2 0 -
              chr1 435842 435918 U4 0 -
              chr1 435842 435918 U4 0 -
              chr1 435844 435920 U2 0 -
              chr1 435844 435920 U2 0 -
              chr1 437189 437265 U2 0 +

              Could someone explain how U0, U1, U2 are assigned and
              what they are?

              Thanks,
              U (unique) means the uniquely mapped reads. Maybe the script regards all the reads as unique reads.

              And the integer means the number of mismatches.
              Xi Wang

              Comment

              • wdt
                Member
                • Oct 2009
                • 19

                I have RNA-seq data analyzed using tophat that generated bam files for each sample.
                Each group (cases/controls) has 5 samples each.
                Would the following be correct way to use DEGseq
                1. Convert BAMs to SAM to BED using samtools + sam2bed.pl
                2. Use DEGseq samWrapper to test 5 samples in one group with 5 samples in the other
                to identify diff expressed genes?

                Thanks a lot!
                Last edited by wdt; 11-23-2010, 09:20 PM.

                Comment

                • Xi Wang
                  Senior Member
                  • Oct 2009
                  • 317

                  Originally posted by wdt View Post
                  I have RNA-seq data analyzed using tophat that generated bam files for each sample.
                  Each group (cases/controls) has 5 samples each.
                  Would the following be correct way to use DEGseq
                  1. Convert BAMs to BED using sam2bed.pl
                  2. Use DEGseq samWrapper to test 5 samples in one group with 5 samples in the other
                  to identify diff expressed genes?

                  Thanks a lot!
                  Agreed. But please note that you need first convert BAM to SAM using samtools.
                  Xi Wang

                  Comment

                  • wdt
                    Member
                    • Oct 2009
                    • 19

                    Many thanks for your quick replies about the DEGseq.

                    Once BED files are provided, does DEGseq internally compute "raw counts" that are used for differential exp analysis?

                    Is there a way to output those raw counts (or equivalent numbers) per sample?

                    Thanks a lot!

                    Comment

                    • Xi Wang
                      Senior Member
                      • Oct 2009
                      • 317

                      Originally posted by wdt View Post
                      Many thanks for your quick replies about the DEGseq.

                      Once BED files are provided, does DEGseq internally compute "raw counts" that are used for differential exp analysis?

                      Is there a way to output those raw counts (or equivalent numbers) per sample?

                      Thanks a lot!
                      you can use the script below.

                      Code:
                      refFlat <- "refFlat.txt"
                      mapResultBatch = c("sample1","sample2","sample3","...") # replace the file names accordingly
                      geneExpr <- "geneExpr.txt"   # you may specify the file name to save the gene expresion values
                      getGeneExp(mapResultBatch, refFlat=refFlat, output=geneExpr)
                      Xi Wang

                      Comment

                      • newbietonextgen
                        Member
                        • Nov 2010
                        • 56

                        help With DEGseq

                        Hello all,

                        I have a 1.0 GB data file and was wondering how long it would take for the program to load this data? All i get after showing the path to sample A, is a spinning ball (mac) that keeps going on for half hour. I dont get the R prompt again and I just kill the process thinking some thing is wrong. Do i have to be patient ? The computer has 8 gb ram if that help. So please let me know.

                        Sample bed format file using the samtobed script

                        chr1 15562 15637 ILLUMINA-927B2F_0001:1:110:7901:1208#0/1 10 +
                        chr1 15564 15636 ILLUMINA-927B2F_0001:1:92:5422:11873#0/1 10 +
                        chr1 15564 15636 ILLUMINA-927B2F_0001:1:117:10103:16792#0/1 10 +
                        chr1 16084 16159 ILLUMINA-927B2F_0001:1:3:3987:6468#0/1 10 -

                        So please let me know if its the format or i just need the patience.
                        Last edited by newbietonextgen; 12-06-2010, 08:07 AM.

                        Comment

                        • Xi Wang
                          Senior Member
                          • Oct 2009
                          • 317

                          Originally posted by newbietonextgen View Post
                          Hello all,

                          I have a 1.0 GB data file and was wondering how long it would take for the program to load this data? All i get after showing the path to sample A, is a spinning ball (mac) that keeps going on for half hour. I just kill the process thinking some thing is wrong. Do i have to be patient ? The computer has 8 gb ram if that help. So please let me know. Thanks
                          What kind of data file you fed to DEGseq, BED, BAM? Usually, it couldn't need to take so much time to load 1GB data.
                          Xi Wang

                          Comment

                          • newbietonextgen
                            Member
                            • Nov 2010
                            • 56

                            Thanks Xi for the quick reply. It was a BED format file. I converted using the samTobed tools.

                            Comment

                            • Xi Wang
                              Senior Member
                              • Oct 2009
                              • 317

                              Originally posted by newbietonextgen View Post
                              Thanks Xi for the quick reply. It was a BED format file. I converted using the samTobed tools.
                              I just saw you updated the message.
                              Were there any screen display?
                              Xi Wang

                              Comment

                              • newbietonextgen
                                Member
                                • Nov 2010
                                • 56

                                No. I have tried both formats: giving the path to the file and then setting up the working dir and then naming the file. I am using a 64 bit R and i am nots sure if it a problem with it.

                                This is how the console looks:
                                >library(DEGseq)
                                Loading required package: qvalue
                                Loading Tcl/Tk interface
                                > sample A <- "path to the file (bed.txt)"
                                |

                                So there was no screen message after i hit return...

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  Yesterday, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, Yesterday, 12:03 PM
                                0 responses
                                17 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, Yesterday, 11:40 AM
                                0 responses
                                13 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-26-2026, 10:12 AM
                                0 responses
                                31 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...