Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Partial Order Alignment Step

    I'm running through Jared and Nick's Nature Methods de novo assembly approach on my Lambda burn-in FAST5 data. Just to get the pipeline up and running and some familiarity using a focused data set.

    I've successfully converted FAST5 to FASTA using poretools. Using the nanocorrect pipeline, I've performed the DALIGNER steps, and then now am processing using the Partial Order Alignment step using poaV2.

    It is working and the corrected.fasta file is growing... slowly. I've been tailing the file and the output when blasted is giving me 95% accuracy to NCBI refseq Lambda phage sequence. It's been chuggin along for a day. I have a 16 core 3.7 Ghz setup with 64 GB of ram, and plenty of SSD drive space to spare. It's only using a single thread (based on my system process utilization). And it's only sucked up 150 MB of working RAM.

    Wondering what others have done to parallelize this step, or what can be done for speed up?

  • #2
    I used PBcR and then nanopolish with 2D reads only and I got good results. Is there a much better pipeline than PBcR+nanopolish?

    Comment


    • #3
      Nanocorrect (daligner + poa), is the step preceding the celera assembly and nanopolish. This is to say, PBcR and nanopolish are next once the POA is done... When it gets done.

      Comment


      • #4
        Thx. Let me give it a try

        Comment


        • #5
          Ah. Nanocorrect outputs fasta but PBcR requires fastq input. How do u deal with that?

          Comment


          • #6
            Interesting, I tried the combined PBcR MHAP pipeline with the oxford.spec and arrived at an assembly in 20 minutes with 98% match to the NCBI ref seq for Lambda.

            The DALIGNER, POA and RunCA with the oxford.spec arrived at the assembly after 1.5 days with >99% match to the NCBI ref seq for Lambda.

            The major difference seems to be the latter is more accurate in the homopolymer runs.

            Still for rapid identification and other purposes, the PBcR MHAP pipeline is more than adequate.

            -Tom

            Comment


            • #7
              How did you obtain the frg file need for runCA? I suppose you only had one fasta file from the nanocorrect pipeline without any qual file, right?

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Non-Coding RNA Research and Technologies
                by seqadmin




                Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                Nobel Prize for MicroRNA Discovery
                This week,...
                Yesterday, 08:07 AM
              • seqadmin
                Recent Developments in Metagenomics
                by seqadmin





                Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                09-23-2024, 06:35 AM
              • seqadmin
                Understanding Genetic Influence on Infectious Disease
                by seqadmin




                During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                09-09-2024, 10:59 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 10-02-2024, 04:51 AM
              0 responses
              95 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 10-01-2024, 07:10 AM
              0 responses
              106 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-30-2024, 08:33 AM
              1 response
              106 views
              0 likes
              Last Post EmiTom
              by EmiTom
               
              Started by seqadmin, 09-26-2024, 12:57 PM
              0 responses
              20 views
              0 likes
              Last Post seqadmin  
              Working...
              X