Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Haumich
    Junior Member
    • Jul 2015
    • 5

    ABYSS never completes a run

    Hello Everyone,

    I am currently trying to assemble mammalian paired-end DNASeq files from Illumina using ABYSS. The problem is that ABYSS gets stuck during the Assembly and never finishes (or at least it didn't during seven days).
    The data consists of two 80GB fastq files that I fed to ABYSS 1.5.2. After days, the terminal still shows
    0: Reading `/data/reads_1.fq'...
    1: Reading `/data/reads_2.fq'...
    It always stays at this prompt if I try to read multiple files. I can see that no data is actually read into the RAM.
    When I merge those files to a single one, the program reads data to the RAM, makes it to "Finding adjacenct k-mer..." and remains there.
    I can see that the processes are still running, but no files are written to the disk. I am working on a single node with 32 Processors and 720GB RAM. The server uses SLURM and openMPI version 1.8.5.
    I have read that sometimes the eager limit of MPI is to small, so I set it to a higher value with a given formula, but that didn't solve the problem.
    The command I use:
    Code:
    sbatch --job-name=abyss --partition=hive --nodes=1 --ntasks-per-node=32 --wrap "
    abyss-pe k=51 n=10 name=test np=32 in='/data/reads_1.fq /data/reads_2.fq'
    "
    If someone had a similar problem and would like to share some thoughts, any help is aprreciated.

    EDIT
    Thanks pmiguel for the remark.
    With the "v=-vv" option, I got additional output. It seems as if ABYSS really takes that long to load the reads. In the beginning, it takes about 35 seconds to read 100.000 reads. That increases to about 1 minute and so on. I have the feeling, its the server that is so slow.
    Last edited by Haumich; 08-03-2015, 06:16 AM. Reason: Additional information
  • pmiguel
    Senior Member
    • Aug 2008
    • 2328

    #2
    I suggest adding "v=-vv" (verbose mode) to your abyss-pe parameters to get a better idea of what is hanging it up. As long as you stdout and stderr are being captured to a log file, you will be able to see what the program is up to.

    Also, maybe you should update ABySS? The most recent release is 1.9.0
    --
    Phillip

    Comment

    • westerman
      Rick Westerman
      • Jun 2008
      • 1104

      #3
      Originally posted by Haumich View Post
      With the "v=-vv" option, I got additional output. It seems as if ABYSS really takes that long to load the reads. In the beginning, it takes about 35 seconds to read 100.000 reads. That increases to about 1 minute and so on. I have the feeling, its the server that is so slow.
      I think, but do not know, that it is ABySS that is slow. The more reads it has to work with the slower it goes. Much worse than linear speed. Could be a hashing issue (in which case normalized reads or sub-sampled reads might help) or a memory space issue. Phillip and I are currently working on a mammalian genome and might get some insights into ABySS's performance along the way.

      BTW: The ABySS forum is now hosted on Biostars.



      In particular you could look at:

      Comment

      • Haumich
        Junior Member
        • Jul 2015
        • 5

        #4
        Hello,

        thanks everyone for their suggestions. It seems that the problem was openMPI. At least, once I changed to mpich 3.1.3, the speed of the assembly increased greatly and the program finished in about a day.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          New Genomics Tools and Methods Shared at AGBT 2025
          by seqadmin


          This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

          The Headliner
          The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
          03-03-2025, 01:39 PM
        • seqadmin
          Investigating the Gut Microbiome Through Diet and Spatial Biology
          by seqadmin




          The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
          02-24-2025, 06:31 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-20-2025, 05:03 AM
        0 responses
        17 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-19-2025, 07:27 AM
        0 responses
        18 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-18-2025, 12:50 PM
        0 responses
        19 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-03-2025, 01:15 PM
        0 responses
        186 views
        0 reactions
        Last Post seqadmin  
        Working...