Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • westerman
    Rick Westerman
    • Jun 2008
    • 1104

    #16
    Originally posted by westerman View Post
    @danwiththeplan: Your question doesn't make much sense since in this forum thread N50 is also known as L50. Since they are one and the same then there is no 'Whaaa?' involved -- they both represent a length in bases. Now if you have a paper you are reading that uses L50 in a difference sense then give us a reference to the paper so we can see what they mean. ....
    Having had my attention brought back to this thread and having started using stats.sh from BBtools a lot more, I do note that stats.sh uses N/L50 to designate the Number of fragments at or above the Length50 cutoff.

    ABySS uses the terms N50 and n:N50 for the same information. I suspect that most programs use 'N50' for the length information and something else for the number information. However Brian's nomenclature in BBtools makes good sense as well.

    Comment

    • Brian Bushnell
      Super Moderator
      • Jan 2014
      • 2709

      #17
      I maintain that using "L" to denote a number and "N" to denote a length are just wrong, and I don't know how that got started. So I always define L50 as a length and N50 as a number. But Quast, for example, does use "L50" to indicate the number of contigs and "N50" to denote a length.

      Comment

      • kmcarr
        Senior Member
        • May 2008
        • 1181

        #18
        Originally posted by Brian Bushnell View Post
        I maintain that using "L" to denote a number and "N" to denote a length are just wrong, and I don't know how that got started...
        Really wish SEQanswers had a mechanism to "like" comments.

        Comment

        • westerman
          Rick Westerman
          • Jun 2008
          • 1104

          #19
          Originally posted by Brian Bushnell View Post
          I maintain that using "L" to denote a number and "N" to denote a length are just wrong, and I don't know how that got started. So I always define L50 as a length and N50 as a number. But Quast, for example, does use "L50" to indicate the number of contigs and "N50" to denote a length.

          You'll just love to know that the latest version of abyss-fac has ditched 'n:N50' (for the number) and 'N50' (for the length) in favor of 'L50' (for the number) and keeping 'N50' (length). Yes opposite of what you maintain and what makes logical sense.

          Comment

          • kbradnam
            Member
            • May 2011
            • 54

            #20
            Originally posted by Brian Bushnell View Post
            I maintain that using "L" to denote a number and "N" to denote a length are just wrong, and I don't know how that got started.
            I believe the precedent for this is in the 2001 publication of the human genome paper (in Nature): http://www.nature.com/nature/journal.../409860a0.html

            "We used a statistic called the ‘N50 length’, defined as the largest length L such that 50% of all nucleotides are contained in contigs of size at least L."

            Many others started using N50 after this publication, but I'm assuming that L50 for length came afterwards. So we now have a confusing situation, but unless someone can come up with a pre-2001 citation for use of L50, I suggest people stick with N50 for length and L50 for number of sequences (confusing though it is).

            Comment

            • kbradnam
              Member
              • May 2011
              • 54

              #21
              I've updated my blog post about this to include new evidence which also supports that N50 came first, and was first used in 2001.

              Comment

              • renekat
                Junior Member
                • May 2019
                • 1

                #22
                Clarification is stats.sh BBTools

                Originally posted by Brian Bushnell View Post
                I maintain that using "L" to denote a number and "N" to denote a length are just wrong, and I don't know how that got started. So I always define L50 as a length and N50 as a number. But Quast, for example, does use "L50" to indicate the number of contigs and "N50" to denote a length.
                Hello,
                I think that makes sense and just want to verify that in the BBTools stats.sh and statswrapper.sh N50 is for counts and L50 is for length. Is that correct?
                Thanks so much,
                René

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                21 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                14 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                29 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-26-2026, 10:12 AM
                0 responses
                31 views
                0 reactions
                Last Post SEQadmin2  
                Working...