Header Leaderboard Ad

Collapse

Do we still need to assemble a genome?

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Do we still need to assemble a genome?

    Hi,

    This may sound like a naive question, but I have been trying to come up with answers for a couple days and haven't yet been able to. Thank you in advance for any input.

    Now that we can detect human sequence variations (SNPs, indels, Structural Variants, etc) based on the set of paired-end reads, I wonder if there is still a need to assemble the original sequence. Wasn't the point to detect the variations?

    And we don't need to know the assembled sequence for the new sequence anymore to gain its gene positions because its paired-end reads can be mapped back to the human reference genome, so we can learn the gene positions from there.

    So aside from saving space (100 something GB vs 3GB) and time to analyze the data, do we really need to assemble any new human genome that has been resequenced?

    Thank you!

  • #2
    We know Beijing, New York, Paris, London, Tokyo... , but we still need a World map.

    Comment


    • #3
      No no, please don't get me wrong here. Let me clarify a bit.

      I understand we still need to sequence personalized genomes. However, the question is once we get the reads, do we need to assemble them?

      My thinking is that the need to have a fully assembled sequence arose from the fact that we need to know how this particular sequence vary from the human reference genome (hg18, for example). But based on just the reads, we can use programs like the SOAP package to locate variations already. For everything else, it is supposed to be identical to the reference genome we use.

      So why bother assembling, once we have the reads? Cannot we get all information we need from the reads alone?

      Comment


      • #4
        I would venture to say that most of the interest in assembly is in de novo assembly of novel organisms.

        I believe the number of organisms that have been fully sequenced is still in the low hundreds.
        --
        Jeremy Leipzig
        Bioinformatics Programmer
        --
        My blog
        Twitter

        Comment


        • #5
          Take a look at the recent pan genome paper (not to be confused with the Pan genome paper :-). There may be significant portions of human genome which are not yet represented in any genome database because they are structural variants restricted to populations not yet sampled.

          Full scale de novo sequencing may not always be necessary -- some sort of intelligent local reassembly / reassembly of everything that doesn't map followed by integration with that which does.

          Comment


          • #6
            For reliable detection of variations you need fairly high coverage (at least 10X and more is better), thus you need to assemble the multiple reads to determine the coverage. Regions with low coverage give less certainty in whether a variation is real and high coverage gives more confidence (obviously).

            Comment


            • #7
              I guess what you guys are trying to say here is that, to detect the variations specific to the individual whose genome is being sequenced, we have to assemble the reads anyway. Ok that I agree.

              But do we have a need for the finished personalized human genome sequence? (Assuming that all the variations would have already been detected during the process of genome assembling.)

              Thanks for any input.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                How RNA-Seq is Transforming Cancer Studies
                by seqadmin



                Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                09-07-2023, 11:15 PM
              • seqadmin
                Methods for Investigating the Transcriptome
                by seqadmin




                Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                Whole Transcriptome RNA-seq
                Whole transcriptome sequencing...
                08-31-2023, 11:07 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 09:05 AM
              0 responses
              14 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-21-2023, 06:18 AM
              0 responses
              11 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-20-2023, 09:17 AM
              0 responses
              13 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-19-2023, 09:23 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Working...
              X