Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ekkogecko
    Junior Member
    • Jun 2015
    • 4

    MetaVelvet caught in an infinite loop?

    Hello,

    I've been trying to run MetaVelvet assembly of an environmental DNA sample (short, paired end, fastq, illumina reads) on a supercomputer cluster, and have been unable to generate output. I am using Velvet 1.2.10 and MetaVelvet -1.1.01. The program will run to completion (in a fraction of a second) on very small files (containing 20,000 sequences each) for both single and paired end reads, and produce a functional meta-velvet contigs file. Vevleth and Velvetg both run successfully on files containing 40,000 sequences each. However, when running MetaVelvetg on these results, the program will run until reaching the wall timer.

    I am using the following sequence of commands:
    • mpirun -np $PBS_NP ~/bin/velvet-master/velveth $fol 31 -fastq -shortPaired $sequence1 $sequence2
    • mpirun -np $PBS_NP ~/bin/velvet-master/velvetg $fol -read_trkg yes -scaffolding no
    • mpirun -np $PBS_NP ~/MetaVelvet-1.1.01/meta-velvetg $fol -scaffolding no


    Where $sequence1 and $sequence2 point to their respective ends of each sequence, and $fol points to the output directory. I have used a different output directory with each run. I typically run the process with 6 nodes, each with 8 2.66 GHz processors and access to some allocation of 2.62 TB shared memory. I have tried up to 24 of those nodes, and MetaVelvet has never run to completion, so it seems like it's not a matter of processing power.

    When I've killed the process, the error file contained the following:
    MPI: could not run executable (case #3)
    MPI: No details available, no log files found
    /opt/torque/4.2.9/spool/mom_priv/jobs/77068.hokieone.SC: line 34: 369997 Killed
    mpirun -np $PBS_NP /home/slvt16/bin/velvet-master/velveth $fol 31 -fastq $sequence1
    MPI: could not run executable (case #3)
    MPI: No details available, no log files found
    /opt/torque/4.2.9/spool/mom_priv/jobs/77068.hokieone.SC: line 35: 374736 Killed
    mpirun -np $PBS_NP /home/slvt16/bin/velvet-master/velvetg $fol -read_trkg yes -scaffolding no

    The output log shows that the program is hanging on the ------Scafolding------ step, even when run with the '-scaffolding no' flag.

    I would like to run MetaVelvet on paired end files of ~400,000 reads each, eventually. I'm running out of ideas for troubleshooting, so any help would be appreciated!

    EDIT: Cross-Posted to https://www.biostars.org/p/150101/
    Last edited by ekkogecko; 07-08-2015, 11:50 AM. Reason: Sourced Xpost
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Also on Biostars: https://www.biostars.org/p/150101/

    Comment

    • ekkogecko
      Junior Member
      • Jun 2015
      • 4

      #3
      Originally posted by GenoMax View Post
      I've cross posted to increase visibility, is that not allowed?

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        You can cross-post. If an answer is posted in the other forum many a times OP does not come back to this forum to indicate that a solution has been found. Link in post #2 is there as a reference for your biostars post.

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #5
          Do you know what "MPI: could not run executable (case #3)" is referring to? Are you running into any other limits (storage, /tmp space)? Is "369997" a torque PID? If it is, can you ask the admins to see why that process was killed?

          Comment

          • ekkogecko
            Junior Member
            • Jun 2015
            • 4

            #6
            I'm not sure what "MPI: could not run executable (case #3)" means, I can send a message to the system admins for some clarification there.
            As far as storage limits, there's 100s of free gigabytes open. I've been able to assemble much larger genomes on this system using Ray, so I expect that this smaller dataset would be able to run with MetaVelvet (unless the requirements are wildly different).
            I'm not sure if "369997" is a torque PID, could you explain how to check that?

            Comment

            • RamakrishnanRS
              Junior Member
              • Oct 2012
              • 9

              #7
              It does look like a PID. I have not seen numeric identifiers for any other exposed entity in a cluster.
              Ram

              Comment

              • GenoMax
                Senior Member
                • Feb 2008
                • 7142

                #8
                @ekkogecko: Are you sure metavelvet is MPI compatible? Have you tried running it without MPI?

                Comment

                • RamakrishnanRS
                  Junior Member
                  • Oct 2012
                  • 9

                  #9
                  Plus, even if it were MPI compatible, the PBS script/command may need to be modified with an option or directive to pick an MPI-enabled node.
                  Ram

                  Comment

                  • ekkogecko
                    Junior Member
                    • Jun 2015
                    • 4

                    #10
                    Originally posted by RamakrishnanRS View Post
                    Plus, even if it were MPI compatible, the PBS script/command may need to be modified with an option or directive to pick an MPI-enabled node.
                    Thanks for the recommendation. As of yet, I've been unable to determine if it is truly MPI compatible. Running without MPI still fails on files >20,000 sequences. I ran a couple of test trials without MPI overnight to no avail.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Pathogen Surveillance with Advanced Genomic Tools
                      by seqadmin




                      The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                      03-24-2025, 11:48 AM
                    • seqadmin
                      New Genomics Tools and Methods Shared at AGBT 2025
                      by seqadmin


                      This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                      The Headliner
                      The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                      03-03-2025, 01:39 PM
                    • seqadmin
                      Investigating the Gut Microbiome Through Diet and Spatial Biology
                      by seqadmin




                      The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                      02-24-2025, 06:31 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 03-20-2025, 05:03 AM
                    0 responses
                    41 views
                    0 reactions
                    Last Post seqadmin  
                    Started by seqadmin, 03-19-2025, 07:27 AM
                    0 responses
                    46 views
                    0 reactions
                    Last Post seqadmin  
                    Started by seqadmin, 03-18-2025, 12:50 PM
                    0 responses
                    36 views
                    0 reactions
                    Last Post seqadmin  
                    Started by seqadmin, 03-03-2025, 01:15 PM
                    0 responses
                    191 views
                    0 reactions
                    Last Post seqadmin  
                    Working...