Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • A real beginner's question on metagenomics! Plz help!

    Hi, I'm very much new to the field of metagenomics and sequencing. I have been trying to find out answers for this but I haven't been too successful probably because the answer is too obvious or my understanding is completely plainly wrong... Anyway, it would be great if some of you could enlighten me with an answer!

    In metagenomics, say a soil sample is taken for sequencing. Now, fragmented genomes from this mixture of species are sequenced say by 454 sequencing. My question is, will there always be some overlapping regions to guide de novo assembly? If I am understanding this correctly (probably not!), there will be abundant species in the sample and their genomes will be fairly easy to assemble as there are lots of them, but for the less abundant species, there won't be enough of them to have enough overlapping regions of the genomes for de novo assembly. I know this is a bit extreme, but what if there happens to be just one member of a novel species in the sample? is there just no way for the whole genome of this species to be sequenced?

  • #2
    Maybe try assembling a couple polyploids with samples from various different strains of the same species, heck try assembling a single species from a single sample accurately, then figure whatever is in the meta soup is pretty close to nonsense alpha=.0001, especially after you take into account 454 is only accurate in high coverage regions and you probably only have at most a couple of 454 runs. So you might as well give up, or at least make up some results.

    Comment


    • #3
      Originally posted by rskr View Post
      Maybe try assembling a couple polyploids with samples from various different strains of the same species, heck try assembling a single species from a single sample accurately, then figure whatever is in the meta soup is pretty close to nonsense alpha=.0001, especially after you take into account 454 is only accurate in high coverage regions and you probably only have at most a couple of 454 runs. So you might as well give up, or at least make up some results.
      While this could've been said nicer (), I mostly agree. New species discovery by soil shotgun de novo sequencing is a hard problem.

      Maybe with a few billion long reads....but i doubt with current technologies/throughputs. Either that or single cell sort + WGA + WGS.

      Comment


      • #4
        Thank you, but I'm sort of more confused...

        My question was, will there always be some overlapping regions to guide de novo assembly even for those species which are less abundant in the sample?

        Really sorry if I failed to understand your previous answers.

        Comment


        • #5
          There will never always be anything in biology...

          Your question is not really answerable with real facts, unless you already know how many species are present in your sample and at what abundances. It's a pretty easy calculation to estimate: how many species do you have, what are their relative abundances, how big are their genomes, how big is the insert size of your library, and how many reads will you generate. Of course there are more difficult questions of species similarity, repeat content, but the above is probably a good exercise for you to work through.

          The conservative answer is (especially with a few million reads from a few 454 runs)...that you will not have enough coverage to de novo assembly an entire RARE organism out of a metagenomic pool, let alone be confident that any apparent overlap is biologically real.

          Comment


          • #6
            Thank you ever so much! I think all the confusion started when someone told me it's possible to conduct whole genome sequencing from a metagenomic soup! I suppose de novo sequencing in metagenomics is a dream that this stage! Thank you for your answers!

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Yesterday, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            39 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            41 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            35 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Working...
            X