Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Fernando Seixas
    Junior Member
    • Oct 2013
    • 8

    Reference Based de Novo Assembly

    Hi,

    I'm trying to de novo assembly the genome of a non-model mammalian species with a relatively large genome (~3 Gb) for which we have a relatively close reference - average 5% divergent (~12 My). What I was wondering is if there is any way I can take advantage of this relatively close reference to assemble my genome and in that case what would be the best strategy.

    One of things I thougth was to first do the de novo assembly of my genome (in allpaths-lg, for which the data was specially tailored) and then to use the reference to help scaffolding the contigs/scaffolds resulting from the de novo assembly - i'm aware that, for instance, AlignGraph is designed for this task. Could this be a good solution or is there any other tool available (not that I don't want to use AlignGraph but I want to be aware of the possible tools)?

    Thanks,
    Fernando
    Fernando
  • illinu
    Member
    • Jul 2013
    • 55

    #2
    You can still use ALLPATHS-LG with the reference based assembly option

    Comment

    • syfo
      Just a member
      • Nov 2012
      • 103

      #3
      I'm curious about this one: IDBA-Hybrid: an iterative De Bruijn Graph De Novo Assembler for hybrid sequencing

      Comment

      • illinu
        Member
        • Jul 2013
        • 55

        #4
        Let us know how it goes. I guess by hybrid it means short reads and PacBio??
        I used a very close reference (2 My) to aid a de novo assembly with ALLPATHS-LG and I saw no improvement over the de novo itself. Plus I am not sure it is a good idea to force DNA to go where your close reference has it, the little difference between the two species might be due to gene copy, larger genes or in different places of the genome. I am not too keen to use a reference if it's not the same species. For scaffolding it is a good idea. Have you tried SSPACE? you can also use a transcriptome to scaffold.

        Comment

        • EricSchon
          Member
          • Nov 2013
          • 10

          #5
          Hi I am considering using AlignGraph as well to improve my de novo assembly (only PE illumine reads) of a filamentous fungus (30 Mb genome). What was your experience with the tool?

          Comment

          • ymc
            Senior Member
            • Mar 2010
            • 496

            #6
            I want to assemble a bacteria strain from only PE150 reads. I have a finished reference that is the same lineage of the strain. I already mapped my reads to the reference and generated a bam.

            Which reference-based assembler do you recommend for my case? Thanks a lot!

            Comment

            • illinu
              Member
              • Jul 2013
              • 55

              #7
              You have a lot of options but for bacteria look at MIRA, Ray, Abyss, Velvet... I find that depending on the species one software might perform better than other so you have to try several and see which one works best for you.

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              38 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              101 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              123 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-04-2026, 08:59 AM
              0 responses
              114 views
              0 reactions
              Last Post SEQadmin2  
              Working...