Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • papori
    Senior Member
    • Dec 2010
    • 181

    bwa sampe segmentation fault

    Hi all,
    i am using bwa-0.5.9 for illumina output.

    i have reads from 2 samples of same organism.
    So i have 2 databases.

    when i run BWA on the first, everything went well. the SAM file is 3.3GB.

    On the second, i got segmentation fault in the last part, when converting to *.sam.
    it succeed to convert 2.2GB , and then seg` fault.

    My steps are:
    bwa index -a is database.fasta

    bwa aln database.fasta read_1.fastq > database_aln_sa_1.sai
    bwa aln database.fasta read_2.fastq > database_aln_sa_2.sai

    bwa sampe database.fasta database_aln_sa_1.sai database_aln_sa_2.sai read_1.fastq read_2.fastq > database_aln.sam

    When i run bwa with samse on each end separately , i got 2 sam file , each in size 1.6GB , as expected.


    My questions are:
    How can i solve the Segmentation fault?
    Or, Can i merge the 2 sam files that have been resulted separately to one sam?without loosing any data?

    Thanks in advance..
  • giorgifm
    Member
    • Aug 2011
    • 35

    #2
    Dear papori,

    my best solution at BWA segmentation fault has always been to change cpu and increase RAM. Otherwise, you may try splitting your original paired fastq files in half and try again.

    And then merge the resulting (paired) BAM files with Picard Tools MergeSamFiles

    Good luck!

    Comment

    • CompBio
      Member
      • Aug 2009
      • 26

      #3
      I'm running into a similar problem. I've got about 1 million paired-end reads (~500,000 pairs) that I'm aligning to about 6,000 transcript sequences. bwa aln works fine for both of them, but bwa sampe exits immediately with a segmentation fault, both with and without the -P option.

      I'm using the following format:
      bwa sampe -f part_1_part_2.sam transcripts.fa part_1.aln part_2.aln part_1.fq part_2.fq
      The only feedback I get is:
      Segmentation fault (core dumped)
      I'm working on a machine with 128GB memory, so I really can't imagine it's a memory issue. I've used bwa sampe successfully with similar data (500,000 pairs and 28,000 transcripts), but not with this particular data set.

      It's frustrating, to say the least.

      Comment

      • stylz2k
        Junior Member
        • Nov 2012
        • 8

        #4
        I'm having the same trouble with bwa 0.6.2. Its really frustrating.

        I have a machine with Intel Core i5 and 8 GB RAM.

        Comment

        • fongchun
          Member
          • May 2011
          • 55

          #5
          Did you guys ever find solutions to your problems of segmentation fault?

          Comment

          • CompBio
            Member
            • Aug 2009
            • 26

            #6
            In our case the problem went away when we reduced the size of the headers in the FASTA file. We were storing meta-data in the headers that sometimes grew pretty long (100's of characters). By storing the meta-data in a table and using only the indexes as FASTA headers, the problem went away.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Pathogen Surveillance with Advanced Genomic Tools
              by seqadmin




              The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
              Yesterday, 11:48 AM
            • seqadmin
              New Genomics Tools and Methods Shared at AGBT 2025
              by seqadmin


              This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

              The Headliner
              The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
              03-03-2025, 01:39 PM
            • seqadmin
              Investigating the Gut Microbiome Through Diet and Spatial Biology
              by seqadmin




              The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
              02-24-2025, 06:31 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 03-20-2025, 05:03 AM
            0 responses
            26 views
            0 reactions
            Last Post seqadmin  
            Started by seqadmin, 03-19-2025, 07:27 AM
            0 responses
            33 views
            0 reactions
            Last Post seqadmin  
            Started by seqadmin, 03-18-2025, 12:50 PM
            0 responses
            25 views
            0 reactions
            Last Post seqadmin  
            Started by seqadmin, 03-03-2025, 01:15 PM
            0 responses
            190 views
            0 reactions
            Last Post seqadmin  
            Working...