Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Successive rounds of gapfilling and scaffolding

    Hi,

    we've been playing around with successive rounds of gapfilling and scaffolding and we see a constant improvement of our genome stats. Basically we run several iterations of gapfilling which generates new ends with potential mapping sites for our mate-pair libraries and re-scaffold the genome. However, besides not being able to really assess the quality of these so generated new scaffolds, we wondered if re-scaffolding after gapfilling should be performed on contig level i.e. gapfilling, breaking down the scaffolds to contigs (now longer thanks to the gapfilling) and rescaffolding, or if it would be rather advisable to re-scaffold the gapfilled scaffolds. Theoretically, breaking down the scaffolds to contigs after gapfilling would allow to intercalate contigs that could not be mapped to the previously existing gaps due to the missing sequence at the end of the contigs before gap filling.

    Has anyone played around with this and can share his experience?

    Thanks,
    Zapp

  • #2
    Gapfilled regions of your genome will be of lower quality than the parts created by kmer extension in the original assembly, so too many rounds of gapfilling will start to introduce more error. Because of this, I would lean towards keeping your scaffolds intact for further rounds of scaffolding. This way any new mismapping mate pairs in gap filled regions don’t invalidate and screw up what should be higher quality scaffolding links already made.

    Comment


    • #3
      That sound reasonable, thank you very much.

      Regards,
      Zapp

      Comment


      • #4
        Stand-alone scaffolding will never be as good a built-in scaffolder that can take in all information used for the assembly...

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-25-2024, 11:49 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-24-2024, 08:47 AM
        0 responses
        18 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        62 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Working...
        X