Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • What are the appropriate terms and defintions in sequencing?

    It seems that there is no agreement on the usage of some terms in massively parallel sequencing. For example, the SAM specifications define as "segment" what many would call "read", namely "A contiguous (sub)sequence on a template which is sequenced or assembled." And I understand that their definition of "read" would include the two paired ends of a template sequenced by Illumina, e.g. So, according to SAM specifications, we could say "discordant read", instead of "discordant pair"; and it would not be appropriate to refer to a single end with the term "read". Convenient as it seems to standardize the use of these terms, the truth is that I don't see the terms used in the literature in the way suggested by SAM.

    Another problematic term is "insert", when applied to the fragment of DNA between the two sequenced ends of a pair, which may have never been "inserted".

    Then, the terms "clonal coverage", "span coverage", and also "physical coverage" have been used with apparently the same meaning.

    Lastly, I'm bothered by the flashy qualifier of the "next generation" sequencing. Doesn't it make more sense to say something less ephemeral, like "massively parallel" sequencing?

    I welcome any recommendation, reference, guidance or comment on the appropriate usage of these terms.

  • #2
    You are correct, it is a bit of a mess!

    The 'insert' is legacy terminology from BAC libraries etc. We should probably say 'fragment size' as it makes it clear that the length INCLUDES the original read(s).

    I've never heard those three coverage terms. Usually just here 'coverage', 'depth' or 'depth of coverage'.

    "Next-Gen" was ok originally, but many people (including myself) try to use terms like "high-throughput sequencing" (HTS).

    Comment


    • #3
      I think most people are standardizing (or at least they should) on calling Illumina/454/Solid/Ion "second generation sequencing". Pacbio/Nanopore stuff should be "third generation sequencing". We can argue about Helicos but it's a moot point right now.

      I too think there needs to be some consensus as to what a "read" is as Illumina counts forward and reverse reads separately for paired-end sequencing in their product literature, while I think they both together should be called a "read".

      Comment


      • #4
        I think the terms "second gen" and "third gen" are hopelessly pointless. PacBio and ONT are 'third' because they're single molecule? What about new nanopore technologies that require sample prep (possibly with some amplification)? What would you call GnuBIO? As the number of available platforms and technologies increases, it becomes more and more difficult to put them into (meaningless) categories. I just call them all 'next gen', but HTS probably makes more sense.

        Comment


        • #5
          I contend "high throughput" is meaningless as well.
          An ABI 3730 is "high throughput" as compared to an ABI 377, no?
          I'd say anything that detects a clonally amplified molecule is 'second' gen.
          If it sequences the single molecule itself it is 'third' gen.

          But go ahead and call it all 'next gen', because it is easier and it's best to not get bogged down in semantics.

          Comment


          • #6
            Originally posted by GW_OK View Post
            I contend "high throughput" is meaningless as well.
            An ABI 3730 is "high throughput" as compared to an ABI 377, no?
            I'd say anything that detects a clonally amplified molecule is 'second' gen.
            If it sequences the single molecule itself it is 'third' gen.

            But go ahead and call it all 'next gen', because it is easier and it's best to not get bogged down in semantics.
            Yeah, that's a really good point. Plus, I think some of the new technologies might focus more on quick, inexpensive sequencing (but not so much on the massive outputs).

            Comment


            • #7
              Do you think it will get to the point soon where "next gen" is implied and then it would be either clonally amplified (CA) sequencing or single molecule (SM) sequencing?

              Comment


              • #8
                In a draft of a recent paper I tried to coin the term Amplified Cyclic (AC) sequencing for flow cycle sequencing methods (Illumina and IonTorrent), and for single molecule (SM) sequencing, to roll the sequencing as SMS. I got some pushback against coining a new term though, and resorted to Next Generation for AC reads.

                cheers,
                -mark

                Originally posted by Joann View Post
                Do you think it will get to the point soon where "next gen" is implied and then it would be either clonally amplified (CA) sequencing or single molecule (SM) sequencing?

                Comment


                • #9
                  Thanks for the feedback. I will take this and other related forum-posted topics to the FGED meeting in Toronto next week on standards development.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Essential Discoveries and Tools in Epitranscriptomics
                    by seqadmin




                    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                    04-22-2024, 07:01 AM
                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Yesterday, 08:47 AM
                  0 responses
                  12 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  60 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  59 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  54 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X