Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Metagenome assembly

    How many contigs one can get after metagenome assembly?

  • #2
    there are too many variables to answer your question can you be more specific?

    Comment


    • #3
      Assembly

      Many contigs can be assembled in a metagenome.

      Comment


      • #4
        i want to assemble 25 million reads. i am getting varying results with different assemblers.

        Comment


        • #5
          Originally posted by BIOin View Post
          i want to assemble 25 million reads. i am getting varying results with different assemblers.
          With any dataset you will get different results with different assemblers, and even different results with different parameter settings of the same assembler, and different results with the same assembler and same parameters but different pre-processing steps.

          For a metagenome, the complexity can vary depending on your sample. If you had a very complex sample, 25M reads (platform? paired end? read length?) is probably barely scratching the surface -- 25M 2x100 Illumina reads is only 5Gb, which isn't gigantic if you have a diverse sample.

          Comment


          • #6
            thanks for the reply.
            yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,

            I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
            Should i have to consider this assembly or need to run more Kmers...
            Please give me suggestions on assemblers to be use

            Comment


            • #7
              I tried to assembly a metagenome (plant endophyte, the plant genome is not avaiable now) uing ILLUMINA hiseq 2000 2*100 reads too, my data has 69 M paired end reads, 9.9 Billion bases. I assemblied these reads using CLC genomic workbench, and got 770 thousands contigs. I am working on these contigs now. How do you deal with your so many contigs? Could we share our idears>
              Originally posted by BIOin View Post
              thanks for the reply.
              yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,

              I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
              Should i have to consider this assembly or need to run more Kmers...
              Please give me suggestions on assemblers to be use

              Originally posted by BIOin View Post
              thanks for the reply.
              yes my data is complex(animal rumen), my data set Illumina 25M HiSeq 2000 2x100,

              I just started using meta-velvet to assemble high quality metagenome data. I tried running meta-velvet with a k-mer of 45, after the assembly is finished and I look at the output file "meta-velvetg.contigs.fa" got 1128469 contigs with max contig length 31758 bp and N50 190.
              Should i have to consider this assembly or need to run more Kmers...
              Please give me suggestions on assemblers to be use

              Comment


              • #8
                Currently we are finalizing our Assembly. It will be a great help you share your Ideas..
                Last edited by BIOin; 02-10-2013, 10:41 PM.

                Comment


                • #9
                  For the assembly of paired-end only Illumina data, I like to use ABySS assembler. But if the metagenome is too complicated, I agree with the previous post that both 25 M and 69 M reads are just to scratch the surface. Using different assemblers won't make signficant difference in terms of the number of contigs or n50.

                  If the purpose is just to recover genes from the metagenome, paired-end only Illumina data is useful to uncover genes except for those that suffer from strain variations. But to increase the integraty of the assembly dramatically (increase n50), mate-pair data with long inserts can significantly increase scaffolding performance. With some programs to resolve some gaps within scaffolds, the assembly can be improved further.

                  Comment


                  • #10
                    Metamos (with SOAPdenovo) appears to be a rather decent assembler although I have no personal experience with it..

                    savetherhino.org

                    Comment


                    • #11
                      Originally posted by Shuiquan View Post
                      For the assembly of paired-end only Illumina data, I like to use ABySS assembler. But if the metagenome is too complicated, I agree with the previous post that both 25 M and 69 M reads are just to scratch the surface. Using different assemblers won't make signficant difference in terms of the number of contigs or n50.

                      If the purpose is just to recover genes from the metagenome, paired-end only Illumina data is useful to uncover genes except for those that suffer from strain variations. But to increase the integraty of the assembly dramatically (increase n50), mate-pair data with long inserts can significantly increase scaffolding performance. With some programs to resolve some gaps within scaffolds, the assembly can be improved further.
                      Hi, what are there tools for closing gaps in metagenomes? such as, Bambus 2, and what?Thank you very much.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Non-Coding RNA Research and Technologies
                        by seqadmin


                        Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                        [Article Coming Soon!]...
                        Today, 08:07 AM
                      • seqadmin
                        Recent Developments in Metagenomics
                        by seqadmin





                        Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                        09-23-2024, 06:35 AM
                      • seqadmin
                        Understanding Genetic Influence on Infectious Disease
                        by seqadmin




                        During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                        Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                        09-09-2024, 10:59 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 10-02-2024, 04:51 AM
                      0 responses
                      13 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 10-01-2024, 07:10 AM
                      0 responses
                      23 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 09-30-2024, 08:33 AM
                      1 response
                      29 views
                      0 likes
                      Last Post EmiTom
                      by EmiTom
                       
                      Started by seqadmin, 09-26-2024, 12:57 PM
                      0 responses
                      19 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X