Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • epigen
    Senior Member
    • May 2010
    • 101

    BFAST bfast.submit.pl configuration

    Could anyone please provide a working config file for bfast.submit.pl to use as an example? We tried to create a config file, but it's sometimes hard to guess which parameter name in the schema stands for which in the BFAST programs. Also, Eclipse reports that the provided xml schema is incorrect at some places. Despite our efforts, the bfast.submit.pl exited without producing output. (We're using the data from the latest BFAST version.) bfast.submit.pl looks like a valuable tool and it would be very useful to get it to run for our cluster.

    In that context, I'd like to know what the most efficient way of running BFAST is. I can use a node with 16 CPUs, up to 128 GB RAM. The 10 indexes for the human genome are 12 GB each so it's probably impossible to load them all into memory and keep enough space for the rest, especially when using pipes. As I noted, reading the indexes (one at a time as done by default) is the most time-consuming part in our case. Instead of splitting up the reads much and call multiple instances of bfast match with all indexes, I think it would be better to process all reads with one of the indexes in parallel.

    Thanks in advance for the help

    Barbara
  • Vincenzo
    Junior Member
    • Feb 2011
    • 4

    #2
    Dears,
    this is my config file.
    I have not a sam file in the output, but only some sh script: match.sh, localign.sh, postprocess.sh, sam.sh.
    I don't know what is the next step because, as i write you, it's sometimes hard to guess which parameter name in the schema stands for which in the BFAST programs.
    A suggestion:
    if you have any validation errors with eclipse don't worry, they're not preclude the running of bfast.
    I am a beginner so i don't know if this can help you. I hope..
    Anyway, this is the BASIC xml version to run it:
    <?xml version="1.0" encoding="UTF-8"?>
    <bfastConfig xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:noNamespaceSchemaLocation="BfastConfig_5.xsd">
    <globalOptions>
    <bfastBin>/cbbl/users/vincenzo/bin/</bfastBin>
    <samtoolsBin>/cbbl/users/vincenzo/bin/</samtoolsBin>
    <qsubBin>/cbbl/users/vincenzo/bin/run_parallel/</qsubBin>
    <fastaFileName>/cbbl/users/vincenzo/bin/run_parallel/DH10B.fa</fastaFileName>
    <runDirectory>/cbbl/users/vincenzo/bin/run_parallel/</runDirectory>
    <readsDirectory>/cbbl/users/vincenzo/bin/run_parallel/</readsDirectory>
    <outputDirectory>/cbbl/users/vincenzo/bin/run_parallel/</outputDirectory>
    <tmpDirectory>/cbbl/users/vincenzo/bin/run_parallel/</tmpDirectory>
    <outputID>result</outputID>
    <cleanUsedIntermediateFiles>0</cleanUsedIntermediateFiles>
    <numReadsPerFASTQ localalignSplit="20" matchSplit="20">20</numReadsPerFASTQ>
    <timing>ON</timing>
    <queueType>SGE</queueType>
    <space>NT</space>
    </globalOptions>
    <matchOptions>
    <readCompression>gz</readCompression>
    <threads>1</threads>
    </matchOptions>
    <localalignOptions>
    <threads>1</threads>
    </localalignOptions>
    <postprocessOptions>
    <threads>1</threads>
    </postprocessOptions>
    <samOptions>
    <samtools>1</samtools>
    </samOptions>
    </bfastConfig>

    To add any parameter, you can inspire from XSD and from bfast.submit.pl.
    If you solved it, please share your code.
    Thanks a lot.
    Vincenzo

    Comment

    Latest Articles

    Collapse

    • seqadmin
      New Genomics Tools and Methods Shared at AGBT 2025
      by seqadmin


      This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

      The Headliner
      The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
      03-03-2025, 01:39 PM
    • seqadmin
      Investigating the Gut Microbiome Through Diet and Spatial Biology
      by seqadmin




      The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
      02-24-2025, 06:31 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 05:03 AM
    0 responses
    16 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, Yesterday, 07:27 AM
    0 responses
    13 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-18-2025, 12:50 PM
    0 responses
    15 views
    0 reactions
    Last Post seqadmin  
    Started by seqadmin, 03-03-2025, 01:15 PM
    0 responses
    185 views
    0 reactions
    Last Post seqadmin  
    Working...