Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bad assembly or bad sequence data?

    I'm trying to assemble a metagenome containing one cyanobacterial genome along with a whole bunch of commensal bacteria with the intention of separating out the cyanobacterial sequences. I have done an assembly using Abyss and then run the contigs through PhyloPythiaS. Very few contigs are coming out affiliated with cyanobacteria which is troublesome since that was the dominant organism in the culture. It looks as though the commensal bacteria (e.g. Pseudomonas) have assembled very well.

    My question is, how likely is it that the assembler will generate chimeric sequences that obscure the organism of interest? Is there anyway of altering assembly parameters to compensate for this? Or is it more likely that my dna extractions have simply failed to work on the cyanobacteria?

  • #2
    So BBSplit analysis did not work that well?

    Comment


    • #3
      Not quite... It seemed to work, and mapped a portion of reads to the references, but when I assembled from the mapped reads there were still lots of contigs that were clearly from contaminants. I must say, some samples have worked better than others (I am working on several different strains) which is why I'm beginning to doubt my extractions...

      Comment


      • #4
        So that is certainly one possible explanation. You could do another round of BBSplit with assembled contaminants to see if you can weed some additional sequences out. Then go back and do the assembly again.

        Comment


        • #5
          Give SPAdes a try to see if it does a better job of assemblies. At least it may put the bacterial contaminants together better so you can remove them.

          Comment


          • #6
            Thanks, I'll have a go with SPADES. Currently, my workflow is something like this:

            1) Trim adapters + poor quality reads - Trimmomatic

            2) map reads to multiple cyanbacteria genomes - BBsplit

            3) assemble mapped reads - ABYSS/SPADES

            4) Separate contigs based on taxonomic affiliation - PhyloPythia

            I was then thinking about using the initial reads to try and extend the cyanobacterial contigs. I've tried this using IMAGE (which didn't work) and PRICE (which ran out of memory on my 32 GB desktop). Currently the assemblies are generating huge numbers of contigs, many of which are short and obviously want to get this number down without throwing away useful data...

            Does all this sound like a reasonable way of going about things? There is such a huge amount of available software out there it's hard to see the wood for the trees sometimes...

            Comment


            • #7
              You may need to try and iterate between 2 and 3 to see if you can improve things. If the cynobacterial DNA is underrepresented in the current library then you may need to do another prep.

              Comment


              • #8
                When you say there are contigs from contaminants... what kind of contaminants are you talking about? The wrong strain, or the wrong phylum entirely, or synthetic lab molecules?

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Genetic Variation in Immunogenetics and Antibody Diversity
                  by seqadmin



                  The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
                  11-06-2024, 07:24 PM
                • seqadmin
                  Choosing Between NGS and qPCR
                  by seqadmin



                  Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
                  10-18-2024, 07:11 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Today, 11:09 AM
                0 responses
                24 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, Today, 06:13 AM
                0 responses
                20 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 11-01-2024, 06:09 AM
                0 responses
                30 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 10-30-2024, 05:31 AM
                0 responses
                21 views
                0 likes
                Last Post seqadmin  
                Working...
                X