Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Subread segmentation faults on Scientific Linux

    Hi

    I am running Subread on scientific linux on a HPC cluster with a GPFS file system. With big FASTQ (>35M reads) subread falls over half way through with a segmentation fault.

    It seems to be memory related as smaller files work okay. Increasing the amount of available memory to the process to 31GB seems to make no difference.

    Has anyone seen this before?

  • #2
    Memory does not sound like the culprit here. Are you running into some other limit, say storage (quota) or tmp space or time assigned for job?

    Comment


    • #3
      Unfortunately not, the disk space assigned to the node is 10TB, the walltime is 5 days (when typically the job takes 2-3 hours) and the temp space is "unlimited" although practically is about 4TB.

      Comment


      • #4
        Dr. Shi (author of Subread) participates here and we may hear something enlightening from him. But I would have thought that once the genome index is read into memory that requirement should be more or less satisfied. Unless subread works differently (I don't use subread) than other aligners.

        Comment


        • #5
          Originally posted by abeggs View Post
          Hi

          I am running Subread on scientific linux on a HPC cluster with a GPFS file system. With big FASTQ (>35M reads) subread falls over half way through with a segmentation fault.

          It seems to be memory related as smaller files work okay. Increasing the amount of available memory to the process to 31GB seems to make no difference.

          Has anyone seen this before?
          Could you please provide the screen output and also your commands? Subread has no problems to process more than 35 million reads.

          Comment


          • #6
            Hi,

            The command I use for subjunc is:
            subjunc -i $TMPDIR/hg19 -r $TMPDIR/$2_R1.fastq.gz -R $TMPDIR/$2_R2.fastq.gz -o $1/$2.bam -T 8 --gzFASTQinput --allJunctions
            And the output is:

            ========== _____ _ _ ____ _____ ______ _____
            ===== / ____| | | | _ \| __ \| ____| /\ | __ \
            ===== | (___ | | | | |_) | |__) | |__ / \ | | | |
            ==== \___ \| | | | _ <| _ /| __| / /\ \ | | | |
            ==== ____) | |__| | |_) | | \ \| |____ / ____ \| |__| |
            ========== |_____/ \____/|____/|_| \_\______/_/ \_\_____/
            v1.5.0-p1

            //============================= subjunc setting ==============================\\
            || ||
            || Function : Read alignment + Junction/Fusion detection (RNA-Seq) ||
            || Threads : 8 ||
            || Input file 1 : /scratch/beggsa_909907.bb2torque.bb2.cluster/s018 ... ||
            || Input file 2 : /scratch/beggsa_909907.bb2torque.bb2.cluster/s018 ... ||
            || Output file : /gpfs/projects/s-beggsa01/P141115-N-DW-28-2673271 ... ||
            || Index name : /scratch/beggsa_909907.bb2torque.bb2.cluster/hg19 ||
            || Phred offset : 33 ||
            || ||
            || All subreads : 14 ||
            || Min read1 votes : 1 ||
            || Min read2 votes : 1 ||
            || Max fragment size : 600 ||
            || Min fragment size : 50 ||
            || ||
            || Allowed mismatch : 3 bases ||
            || Max indels : 5 ||
            || # of Best mapping : 1 ||
            || Unique mapping : no ||
            || Hamming distance : no ||
            || Quality scores : no ||
            || ||
            \\===================== http://subread.sourceforge.net/ ======================//

            //====================== Running (31-Mar-2016 15:30:00) ======================\\
            || ||
            || The input file contains base space reads. ||
            || The range of Phred scores observed in the data is [2,36] ||
            || Load the 1-th index block... ||
            || Map fragments... ||
            || 0% completed, 0.3 mins elapsed, rate=3.7k fragments per second ||
            || Finish the 3,495,253 fragments... ||
            || 5% completed, 11 mins elapsed, rate=3.7k fragments per second ||
            || 5% completed, 11 mins elapsed, rate=3.8k fragments per second ||
            || 6% completed, 11 mins elapsed, rate=3.9k fragments per second ||
            || 6% completed, 12 mins elapsed, rate=4.0k fragments per second ||
            || 6% completed, 12 mins elapsed, rate=4.1k fragments per second ||
            || 7% completed, 12 mins elapsed, rate=4.2k fragments per second ||
            || 7% completed, 13 mins elapsed, rate=4.3k fragments per second ||
            || Map fragments... ||
            || 7% completed, 13 mins elapsed, rate=4.4k fragments per second ||
            || Finish the 3,495,253 fragments... ||
            || 13% completed, 24 mins elapsed, rate=4.1k fragments per second ||
            || 13% completed, 24 mins elapsed, rate=4.1k fragments per second ||
            || 13% completed, 25 mins elapsed, rate=4.1k fragments per second ||
            || 14% completed, 25 mins elapsed, rate=4.2k fragments per second ||
            || 14% completed, 26 mins elapsed, rate=4.2k fragments per second ||
            || 14% completed, 26 mins elapsed, rate=4.2k fragments per second ||
            || 15% completed, 26 mins elapsed, rate=4.3k fragments per second ||
            || Map fragments... ||
            || 15% completed, 27 mins elapsed, rate=4.3k fragments per second ||
            || Finish the 3,495,253 fragments... ||
            || 20% completed, 38 mins elapsed, rate=4.1k fragments per second ||
            || 21% completed, 38 mins elapsed, rate=4.1k fragments per second ||
            || 21% completed, 39 mins elapsed, rate=4.2k fragments per second ||
            || 21% completed, 39 mins elapsed, rate=4.2k fragments per second ||
            || 22% completed, 39 mins elapsed, rate=4.2k fragments per second ||
            || 22% completed, 40 mins elapsed, rate=4.2k fragments per second ||
            || 22% completed, 40 mins elapsed, rate=4.2k fragments per second ||
            || 23% completed, 41 mins elapsed, rate=4.2k fragments per second ||
            || Map fragments... ||
            || 23% completed, 41 mins elapsed, rate=4.2k fragments per second ||
            || Finish the 3,495,253 fragments... ||
            ./SubReadRNAPipeline-highmem: line 43: 19380 Segmentation fault subjunc -i $TMPDIR/hg19 -r $TMPDIR/$2_R1.fastq.gz -R $TMPDIR/$2_R2.fastq.gz -o $1/$2.bam -T 8 --gzFASTQinput --allJunctions

            Comment


            • #7
              Thanks for providing the info. Can you also send us the fastq files so that we can reproduce the problem and find out what went wrong?
              Best,
              Wei

              Comment


              • #8
                I have solved it!

                I recompiled from source instead of using the binaries and it worked fine.

                We have a scientific linux HPC and it seems there was something about that which was causing problems if you ran the precompiled binaries.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  New Genomics Tools and Methods Shared at AGBT 2025
                  by seqadmin


                  This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                  The Headliner
                  The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                  03-03-2025, 01:39 PM
                • seqadmin
                  Investigating the Gut Microbiome Through Diet and Spatial Biology
                  by seqadmin




                  The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                  02-24-2025, 06:31 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 03-03-2025, 01:15 PM
                0 responses
                180 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 02-28-2025, 12:58 PM
                0 responses
                275 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 02-24-2025, 02:48 PM
                0 responses
                662 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 02-21-2025, 02:46 PM
                0 responses
                268 views
                0 likes
                Last Post seqadmin  
                Working...
                X