Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • joro
    Member
    • Feb 2010
    • 28

    #46
    Hi Simon,

    When using the command "cds = estimateVarianceFunctions(cds)" I get the following error:

    Error in lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    newsplit: out of vertex space
    In addition: Warning messages:
    1: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    2: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    3: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    4: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    5: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    6: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    7: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    8: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    9: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight
    10: In lfproc(x, y, weights = weights, cens = cens, base = base, geth = geth, :
    procv: no points with non-zero weight

    I'm not sure what's causing the error and was wondering if anyone has seen it before?

    Thank you.

    Comment

    • Simon Anders
      Senior Member
      • Feb 2010
      • 995

      #47
      Hi Joro,

      "out of vertex space" is an error from locfit, which DESeq uses internally. I've seen the error occuring onl,y once so far, in a quite unusual data set.

      You can try to increase the number of vertices with:
      Code:
      cds <- estimateVarianceFunctions( cds, locfit_extra_args=list(maxk=300) )
      If this does not help, could you send me your CountDataSet object (save it with save( cds, file="cds.rda" )) so that I can try around with it?

      Cheers
      Simon

      Comment

      • agent99
        Member
        • Jul 2010
        • 10

        #48
        recovering counts from cufflinks

        Originally posted by Boel View Post
        Is there a way to output the actual counts?
        Did anyone respond to this post?

        From the cufflinks output, I tried using the coverage value * (exon length/read length) to get counts, but it doesn't seem to be the right value. My reasoning was that if this is the counts (or fragments) in an exon, I should be able to recover the total number of mapped fragments and that value should be the same over all exons.

        fragments / (exon length/1000) = fragments/kb, then

        FPKM * fragments/kb = X million reads

        However, I'm getting a 2-fold range for how many million reads were sequenced. Could this be because the FPKM is a MLE?

        Thanks for any input/advice.

        Alisha

        Comment

        • RockChalkJayhawk
          Senior Member
          • Mar 2009
          • 192

          #49
          Originally posted by agent99 View Post
          Did anyone respond to this post?

          From the cufflinks output, I tried using the coverage value * (exon length/read length) to get counts, but it doesn't seem to be the right value. My reasoning was that if this is the counts (or fragments) in an exon, I should be able to recover the total number of mapped fragments and that value should be the same over all exons.

          fragments / (exon length/1000) = fragments/kb, then

          FPKM * fragments/kb = X million reads

          However, I'm getting a 2-fold range for how many million reads were sequenced. Could this be because the FPKM is a MLE?

          Thanks for any input/advice.

          Alisha
          I have had some incosistencies with cufflinks with gene-level expression. I would sugest moving on to using a simple program such as HT-Seq or BEDtools to get the actual tag counts within the gene region. Then, you could use DESeq for the normalization.

          Comment

          • agent99
            Member
            • Jul 2010
            • 10

            #50
            Originally posted by RockChalkJayhawk View Post
            I have had some incosistencies with cufflinks with gene-level expression. I would sugest moving on to using a simple program such as HT-Seq or BEDtools to get the actual tag counts within the gene region. Then, you could use DESeq for the normalization.
            Thanks for the advice. Apologies for being a little unclear in this post. I had hoped to get transcript level counts, so I was using cufflinks to determine abundance of different isoforms.

            Comment

            • Roman Bruno
              Junior Member
              • Mar 2010
              • 4

              #51
              Hi again Simon, I wonder if you had since then any suggestion about a possible method in order to compare multiple library condition.

              Thanks

              Comment

              • Gordon Smyth
                Member
                • Apr 2011
                • 91

                #52
                Multiple DGE libraries comparison. (EdgeR baySeq DESeq)

                Dear Roman,

                This is a very late reply to your original post of a year ago in which you described an experiment with four RNA-Seq libraries, Male-Old, Male-Young, Female-Old and Female-Young. You wanted to find transcripts differentially expressed between Male and Female.

                This type of experiment is typically treated in a way that is analogous to a paired t-test, in which you test for Male vs Female differences, adjusting for any differences between Young and Old.

                When you originally posted your question, edgeR did not have facilities for this type of analysis, but it has since the Bioconductor release of Oct 2010. The edgeR Vignette gives in Section 11 a worked example of an analysis with paired comparisons (tumour vs normal for the same patients) that is analogous to your problem. The analysis goes something like this:

                design <- model.matrix(~Age+Sex)
                y <- estimateGLMCommonDisp(y,design)
                disp <- y$common.dispersion
                fit <- glmFit(y,design,dispersion=disp)
                lrt <- glmLRT(y,fit)
                topTags(lrt)

                edgeR not require you to have replicates of any of the four
                conditions, because the linear model that is fitted has only 3 coefficients.

                Try out the paired comparison analysis and let us know if you have any problems with it.

                Best wishes
                Gordon

                Comment

                • cjp
                  Member
                  • Jun 2011
                  • 58

                  #53
                  Originally posted by Simon Anders View Post
                  Wow, this sounds like an embarrassing bug!

                  I've had a look immediately -- and, guess what, the variance adjustment does exactly what it should, but what I have written about it in the vignette (and further up in this thread) was completely the wrong way round.

                  ...

                  I'll correct the vignette right away.

                  Thanks for pointing this out
                  Simon
                  Hi Simon,

                  Thanks for the helpful comments in this thread!

                  But, I was reading the whole thread from the beginning - and maybe you should edit the post - number 31 - to say you noticed later it is the wrong way round for others doing the same and reading the whole thread! I've only just noticed on post 45 when I could have saved myself some confusion earlier in the thread.

                  Chris

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM
                  • SEQadmin2
                    Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                    by SEQadmin2


                    With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                    Introduction

                    Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                    05-22-2026, 06:42 AM
                  • SEQadmin2
                    Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                    by SEQadmin2

                    Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                    Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                    05-06-2026, 09:04 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, Yesterday, 08:59 AM
                  0 responses
                  14 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-02-2026, 12:03 PM
                  0 responses
                  22 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-02-2026, 11:40 AM
                  0 responses
                  19 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 05-28-2026, 11:40 AM
                  0 responses
                  32 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...