Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • RockChalkJayhawk
    Senior Member
    • Mar 2009
    • 192

    DEGseq

    Hey all,

    A new package has come out for RNA Seq Analysis:

    DEGseq: an R package for identifying differentially expressed genes from RNA-seq data.
    Wang L, Feng Z, Wang X, Wang X, Zhang X.
    Bioinformatics. 2009 Oct 24. [Epub ahead of print]
    PMID: 19855105 [PubMed - as supplied by publisher]

    However, from my understanding (which may not be correct) they only use tags that map to the genome, not splice juntions. Does anyone else see this and if so, what would be the work-around?
  • svl
    Member
    • Sep 2009
    • 43

    #2
    I seem to be unable to install the package...anyone had succes?
    ----
    source("http://bioconductor.org/biocLite.R")
    biocLite("DEGseq")
    ----

    Also their site is unavailable: http://bioinfo.au.tsinghua.edu.cn/software/degseq
    Here is the page at Bioc: http://www.bioconductor.org/packages...ml/DEGseq.html

    Comment

    • RockChalkJayhawk
      Senior Member
      • Mar 2009
      • 192

      #3
      Originally posted by svl View Post
      I seem to be unable to install the package...anyone had succes?
      ----
      source("http://bioconductor.org/biocLite.R")
      biocLite("DEGseq")
      ----

      Also their site is unavailable: http://bioinfo.au.tsinghua.edu.cn/software/degseq
      Here is the page at Bioc: http://www.bioconductor.org/packages...ml/DEGseq.html
      I had to try it a few times before I could get it loaded. Good luck.

      Comment

      • svl
        Member
        • Sep 2009
        • 43

        #4
        Thanks. I figured it out. Had to update R and Bioconductor before trying to install the package....should keep my software better updated I guess

        Comment

        • Xi Wang
          Senior Member
          • Oct 2009
          • 317

          #5
          hi svl,

          Thanks for using DEGseq.

          If you want to install DEGseq from bioconductor, you should update your R to v2.10.0. Or you can install DEGseq by typing the following.

          --
          source("http://bioinfo.au.tsinghua.edu.cn/software/degseq/DEGseqInstall.R")
          --

          Best regards,
          Xi

          Originally posted by svl View Post
          I seem to be unable to install the package...anyone had succes?
          ----
          source("http://bioconductor.org/biocLite.R")
          biocLite("DEGseq")
          ----

          Also their site is unavailable: http://bioinfo.au.tsinghua.edu.cn/software/degseq
          Here is the page at Bioc: http://www.bioconductor.org/packages...ml/DEGseq.html
          Xi Wang

          Comment

          • Xi Wang
            Senior Member
            • Oct 2009
            • 317

            #6
            Sorry that yesterday our building encountered a power cut, so our web server was not at work. Our site works well now.

            Thanks.
            Xi


            Originally posted by svl View Post
            Xi Wang

            Comment

            • Xi Wang
              Senior Member
              • Oct 2009
              • 317

              #7
              Hi RockChalkJayhawk,

              Thanks for using DEGseq.

              In the current version of DEGseq, we do not consider the splice junctions, and so do the reads with multiple matches to the reference genome. Actually, we take the reads mapped to the reference genome not transcriptiome as input, and count the reads in the annotateed gene regions as the gene expressing level. It works if the gene expression patterns (isoform expression percentage) are similar between the case and control samples, although otherwise it may cause a litter bias. We are now working on how to use the information provided by splice junction reads and multiple aligned reads to refine the work on differetially expressed gene identification.

              Xi

              Originally posted by RockChalkJayhawk View Post
              Hey all,

              A new package has come out for RNA Seq Analysis:

              DEGseq: an R package for identifying differentially expressed genes from RNA-seq data.
              Wang L, Feng Z, Wang X, Wang X, Zhang X.
              Bioinformatics. 2009 Oct 24. [Epub ahead of print]
              PMID: 19855105 [PubMed - as supplied by publisher]

              However, from my understanding (which may not be correct) they only use tags that map to the genome, not splice juntions. Does anyone else see this and if so, what would be the work-around?
              Xi Wang

              Comment

              • Xi Wang
                Senior Member
                • Oct 2009
                • 317

                #8
                Originally posted by svl View Post
                Thanks. I figured it out. Had to update R and Bioconductor before trying to install the package....should keep my software better updated I guess
                Sorry for the inconvenience.

                If you want to install DEGseq through bioconductor by the followsing script, you need to update your R to 2.10.0 and bioc to 2.5

                source("http://bioconductor.org/biocLite.R")
                biocLite("DEGseq")

                Or alternatively, you can install DEGseq through our site by:

                source("http://bioinfo.au.tsinghua.edu.cn/software/degseq/DEGseqInstall.R")

                Yesterday, our building encountered a power cut, so the server was down. The server is running well now.

                Thanks and Best wishes,
                Xi
                Xi Wang

                Comment

                • svl
                  Member
                  • Sep 2009
                  • 43

                  #9
                  Originally posted by Xi Wang View Post
                  Sorry for the inconvenience.
                  Thanks for the update/info!

                  Comment

                  • shibujohn
                    Junior Member
                    • Oct 2009
                    • 6

                    #10
                    Is it possible to analyze SOLiD rna-seq in DEGseq

                    Originally posted by Xi Wang View Post
                    Hi RockChalkJayhawk,

                    Thanks for using DEGseq.

                    In the current version of DEGseq, we do not consider the splice junctions, and so do the reads with multiple matches to the reference genome. Actually, we take the reads mapped to the reference genome not transcriptiome as input, and count the reads in the annotateed gene regions as the gene expressing level. It works if the gene expression patterns (isoform expression percentage) are similar between the case and control samples, although otherwise it may cause a litter bias. We are now working on how to use the information provided by splice junction reads and multiple aligned reads to refine the work on differetially expressed gene identification.

                    Xi
                    Hi,
                    Is it possible to analyze SOLiD rna-seq data in DEGseq?

                    Shibu

                    Comment

                    • Xi Wang
                      Senior Member
                      • Oct 2009
                      • 317

                      #11
                      Originally posted by shibujohn View Post
                      Hi,
                      Is it possible to analyze SOLiD rna-seq data in DEGseq?

                      Shibu
                      Thanks for your question.

                      We only used the assumption that reads are uniformly distributed along transcripts. We can upload your uniquely mapped reads (or profiles) to a browser (such as UCSC genome browser) to check whether your data satisfy this assumption. And less stringently, you can check whether the variation between technical replicates (if any) can be explained by the random sampling model. A feature of our DEGseq package can help check it: see Section 3 in Supplementary Material on line for details. May this information helps you.

                      Xi
                      Xi Wang

                      Comment

                      • shibujohn
                        Junior Member
                        • Oct 2009
                        • 6

                        #12
                        SOLiD WT-data

                        Originally posted by Xi Wang View Post
                        Thanks for your question.

                        We only used the assumption that reads are uniformly distributed along transcripts. We can upload your uniquely mapped reads (or profiles) to a browser (such as UCSC genome browser) to check whether your data satisfy this assumption. And less stringently, you can check whether the variation between technical replicates (if any) can be explained by the random sampling model. A feature of our DEGseq package can help check it: see Section 3 in Supplementary Material on line for details. May this information helps you.

                        Xi
                        Yes, We have the SOLiD data which is uniquely mapped into the refseq (two groups BN & SHR).. And I don't have the replicates.. We had performed the Whole transcriptome in SOLiD, so is it possible to analyze the SOLiD uniquely mapped reads (.ma)?
                        Thanks,
                        Shibu

                        Comment

                        • Xi Wang
                          Senior Member
                          • Oct 2009
                          • 317

                          #13
                          Originally posted by shibujohn View Post
                          Yes, We have the SOLiD data which is uniquely mapped into the refseq (two groups BN & SHR).. And I don't have the replicates.. We had performed the Whole transcriptome in SOLiD, so is it possible to analyze the SOLiD uniquely mapped reads (.ma)?
                          Thanks,
                          Shibu
                          It looks possible. You'd better also try to valid the results given by DEGseq. Good luck!

                          Xi
                          Xi Wang

                          Comment

                          • adamreid
                            Junior Member
                            • Nov 2009
                            • 6

                            #14
                            Hi there,

                            When using DEGexp with read counts for each gene, how are the read counts normalised? I've put dummy values in for gene lengths. Does it use these, expecting
                            no exons?

                            Adam

                            Comment

                            • Xi Wang
                              Senior Member
                              • Oct 2009
                              • 317

                              #15
                              Originally posted by adamreid View Post
                              Hi there,

                              When using DEGexp with read counts for each gene, how are the read counts normalised? I've put dummy values in for gene lengths. Does it use these, expecting
                              no exons?

                              Adam
                              We don't recommend to normalize the read counts, although you can use RPKM as the gene expression level. DEGexp considers the number of total reads that map to the gene exon regions, when the sequence depth between samples are not the same. Why we do like this is to make sure the computation under the assumption of the random sampling model.

                              I am not sure I understand quite well what you meant by asking the second question. When using DEGexp, the gene expression levels are provided by your data. You can define the "gene expression" yourself. However, we recommend that you use the raw read counts as the gene expression levels.

                              Thanks for your question.
                              Xi
                              Xi Wang

                              Comment

                              Latest Articles

                              Collapse

                              • SEQadmin2
                                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                by SEQadmin2


                                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                ...
                                06-02-2026, 10:05 AM
                              • SEQadmin2
                                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                by SEQadmin2


                                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                Introduction

                                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                05-22-2026, 06:42 AM
                              • SEQadmin2
                                Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                by SEQadmin2

                                Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                05-06-2026, 09:04 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by SEQadmin2, Today, 08:59 AM
                              0 responses
                              8 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-02-2026, 12:03 PM
                              0 responses
                              21 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-02-2026, 11:40 AM
                              0 responses
                              15 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 05-28-2026, 11:40 AM
                              0 responses
                              29 views
                              0 reactions
                              Last Post SEQadmin2  
                              Working...