Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ronaldrcutler
    Member
    • May 2016
    • 80

    SPAdes plasmids output

    Hello all. I have done a plasmidSPAdes run on my bacterial reads to look for any plasmids. In the final output directory, I have the contigs.fasta file which contains the plasmids with the suffix componenet_X denoting each respective plasmid.

    When looking at the different kmer value folders the K21-K99 folders seem to contain assemblies of the bacterial contigs in the final_contigs.fasta file, as a normal SPAdes run would. However, the K127 folder final_contigs.fasta is instead similar to the contigs.fasta file mentioned above - it seems to only contain plasmids and not the bacterial contigs.

    After comparing the K127 final_contigs.fasta and contigs.fasta files with diff, they have differences. My question is why are these two files different? Is the contigs.fasta file different because SPAdes ran the mismatch corrector? And also why are plasmids only output in the K127 folder final_contigs.fasta and not the other kmer value folders?

    Thanks
    Last edited by ronaldrcutler; 07-26-2016, 07:51 PM.
  • ronaldrcutler
    Member
    • May 2016
    • 80

    #2
    Update:

    In the K127 folder there is also before_chromosome_removal.fasta file. When I try to see if contig in the final_contigs.fasta is in the aformentioned file, I do not find them. So are the sequences that are supposedly plasmid sequences from the final_contigs.fasta in the K127 folder and contigs.fasta in the main SPAdes output directory in the final_contigs.fasta files in the other folders? What exactly is the difference between them all?

    Comment

    • HESmith
      Senior Member
      • Oct 2009
      • 512

      #3
      And also why are plasmids only output in the K127 folder final_contigs.fasta and not the other kmer value folders?
      Increasing the kmer length decreases the depth of coverage for each kmer. At some point, the depth becomes insufficient to assemble the de Bruijn graph accurately. Since plasmid sequences are present at much higher read depth, assembly of those reads can tolerate a higher kmer than the host cell sequences. It appears that setting k=127 is a threshold for assembling plasmid but not host cell reads.

      Also, if you're only interested in the plasmid sequences, you may want to use plasmidSPAdes (described here).
      Last edited by HESmith; 07-28-2016, 08:30 AM.

      Comment

      • ronaldrcutler
        Member
        • May 2016
        • 80

        #4
        I was using plasmidSPAdes. If I understand correctly, the plasmid contigs are able to assemble at higher kmer values because they are present at a higher read depth. Why are they more present at higher read depths than the host cell? What about coverage?

        Comment

        • HESmith
          Senior Member
          • Oct 2009
          • 512

          #5
          Plasmids are present in multiple copies per cell, whereas the host genome is (typically) a single copy. More copies = higher read depth.

          Comment

          • ronaldrcutler
            Member
            • May 2016
            • 80

            #6
            Okay, thanks for the clarification. So since there are no plasmids specified in the other final_contigs.fasta files from using kmer values less than 127, I can confidently say there are no plasmids in these and were only found when using a kmer value of 127?

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              Yesterday, 10:05 AM
            • SEQadmin2
              Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
              by SEQadmin2


              With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


              Introduction

              Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
              05-22-2026, 06:42 AM
            • SEQadmin2
              Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
              by SEQadmin2

              Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


              Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
              05-06-2026, 09:04 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Yesterday, 12:03 PM
            0 responses
            19 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, Yesterday, 11:40 AM
            0 responses
            14 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 05-28-2026, 11:40 AM
            0 responses
            29 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 05-26-2026, 10:12 AM
            0 responses
            31 views
            0 reactions
            Last Post SEQadmin2  
            Working...