Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BFAST match error

    Hi,

    I'm getting the following error from BFAST with my colorspace SOLiD reads in FASTQ format. Couldn't figure out what it is...

    All my reads are >20bp and I'm using a subset of all reads to test the program here.
    $ bfast match -f Mus_musculus.GRCm38.68.dna_rm.toplevel.fa -r ../test.fastq -A 1
    ************************************************************
    Checking input parameters supplied by the user ...
    Validating fastaFileName Mus_musculus.GRCm38.68.dna_rm.toplevel.fa.
    Validating readsFileName ../test.fastq.
    Validating tmpDir path ./.
    **** Input arguments look good!
    ************************************************************
    ************************************************************
    Printing Program Parameters:
    programMode: [ExecuteProgram]
    fastaFileName: Mus_musculus.GRCm38.68.dna_rm.toplevel.f
    a
    mainIndexes [Auto-recognizing]
    secondaryIndexes [Not Using]
    readsFileName: ../test.fastq
    offsets: [Using All]
    loadAllIndexes: [Not Using]
    compression: [Not Using]
    space: [Color Space]
    startReadNum: 1
    endReadNum: 2147483647
    keySize: [Not Using]
    maxKeyMatches: 8
    keyMissFraction: 1.000000
    maxNumMatches: 384
    whichStrand: [Both Strands]
    numThreads: 1
    queueLength: 250000
    tmpDir: ./
    timing: [Not Using]
    ************************************************************
    Searching for main indexes...
    Found 1 index (4 total files).
    Not using secondary indexes.
    ************************************************************
    Reading in reference genome from Mus_musculus.GRCm38.68.dna_rm.toplevel.fa.cs.br
    g.
    In total read 66 contigs for a total of 2730871774 bases
    ************************************************************
    Reading ../test.fastq into a temp file.
    Will process 250 reads.
    ************************************************************
    Searching index file 1/4 (index #1, bin #1)...
    Reading index from Mus_musculus.GRCm38.68.dna_rm.toplevel.fa.cs.1.1.bif.
    bfast: ../bfast/RGIndex.c:2015: RGIndexReadHeader: Assertion `index->length > 0'
    failed.
    ▒ ♥Aborted
    Thought this might be due to me giving BFAST an incomplete dataset. However, if I use the entire dataset (all FASTQ SOLiD reads), I get the following error
    *** glibc detected *** bfast: malloc():> memory corruption: 0x000000000220bcd0 **
    which seems like BFAST is running out of usable memory, when, in fact, I'm specifying 20gb of memory for a one lane of SOLiD FASTQ reads.

    Any ideas on how to solve this problem?

    Thanks

  • #2
    It looks like your index is corrupt, try rebuilding your indexes.

    Comment


    • #3
      Originally posted by nilshomer View Post
      It looks like your index is corrupt, try rebuilding your indexes.
      Hi nilshomer,

      Thanks for the reply. I'm making the index using the bfast fasta2brg function while specifying -A 1. The genome is in base space, however I want to align colorspace reads to it. Am I using -A correctly? My rationale was that you want to align colorspace reads to colorspace genome, so -A 1 is the way to go. But maybe I'm mistaken...

      Also, whats the correct way to specify the -m option for index? I have specified the following based on this post I saw on this forum
      Code:
      bfast fasta2brg -f Mus_musculus.GRCm38.68.dna_rm.toplevel.fa -A 1 -t out.tab
      
      bfast index -f Mus_musculus.GRCm38.68.dna_rm.toplevel.fa -A 1 -d 1 -R -T indexTMP -t bfastindex_out.txt -m 10111111011001100011111000111111 -w 14
      Thanks for your help

      EDIT: added bfast index command
      Last edited by flobpf; 10-01-2012, 11:08 AM.

      Comment


      • #4
        A few things, the "-t" option doesn't require an argument, and you have forgotten to create a base space version as well. See the bfast manual that comes with the distribution for examples (Chapter 7), as well as the command line options.
        It's something like this:
        bfast fasta2brg -f Mus_musculus.GRCm38.68.dna_rm.toplevel.fa -A 0 -t
        bfast fasta2brg -f Mus_musculus.GRCm38.68.dna_rm.toplevel.fa -A 1 -t
        bfast index -f Mus_musculus.GRCm38.68.dna_rm.toplevel.fa -A 1 -T indexTMP -t -m 10111111011001100011111000111111 -w 14

        Comment


        • #5
          Hi,
          I realize this thread is older but I get a similar error message while using bfast for colorspace alignment.

          My data consists of colorreads of length 75, dynamically trimmed down to >30 in case of bad sequencing quality. But the majority of reads still has length 75.

          When I use the 10 masks from the bfast manual to build 10 (primary) indexes, the alignment works fine. But many of the shorter trimmed reads are not aligned. So I used 10 more masks to build indexes for shorter reads, which I want to use as secondary indexes. They should only be used for unaligned, (= mostly trimmed) reads. Right?
          Anyway, when I try to run bfast as follows:
          bfast match -f $reference -i 1,2,3,4,5,6,7,8,9,10 -I 11,12,13,14,15,16,17,18,19,20 -r $infile -w 0 -n $nc -A 1 -z -t
          I receive the error:
          Copying unmatched reads for secondary index search.
          Splitting unmatched reads into temp files.
          *** glibc detected *** bfast: double free or corruption (!prev): 0x000000000065f290 ***
          I rebuild all indexfiles, but it didn't have any effect.

          Is it not possible to use this many indexes? Everything works fine if I only use 10 primary ones... Or doesn't it make sense to use this combination of indexes since bfast is not designed to align reads with variable lengths?

          Does anybody have suggestions what I did wrong? Thanks a lot in advance...

          Comment


          • #6
            I don't recollect how my problem got solved, but probably the solution was changing the version of BFAST. My other glibc problems have certainly been solved by changing the version of the program in question.

            Comment


            • #7
              Thank you for the reply!
              I am running the current version of bfast. Are you using an older one, if so, which one works for you?

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Genetic Variation in Immunogenetics and Antibody Diversity
                by seqadmin



                The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
                Yesterday, 07:24 PM
              • seqadmin
                Choosing Between NGS and qPCR
                by seqadmin



                Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
                10-18-2024, 07:11 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 11-01-2024, 06:09 AM
              0 responses
              28 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 10-30-2024, 05:31 AM
              0 responses
              21 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 10-24-2024, 06:58 AM
              0 responses
              25 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 10-23-2024, 08:43 AM
              0 responses
              57 views
              0 likes
              Last Post seqadmin  
              Working...
              X