Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Vincenzo
    Junior Member
    • Feb 2011
    • 4

    Bfast parallel running

    Dear,
    i am a beginner.
    I have a question about the running of bfast in parallel way.
    _i downloaded the XML schema.
    _ i have all files for the running: .brg, .bif, .fai.,reads.fastq
    My cluster is SGE.
    The input is in the NT space.(from Illumina).
    When i launch ./bfast.submit.pl -config schema.xml the result is:

    "Unable to recognise encoding of this document at /soft/lib/perl5/site_perl/5.8.8/XML/SAX/PurePerl/EncodingDetect.pm line 100.
    The global options were not found."

    The schema.xml is(i modified it for my job):


    <?xml version="1.0"?>
    <xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema">
    <xs:element name="bfastConfig">
    <xs:complexType>
    <xs:sequence>
    <xs:element name="globalOptions">
    <xs:complexType>
    <xs:sequence>
    <xs:element name="bfastBin" type="directoryPath" value="bfast.submit.pl"/>
    <xs:element name="samtoolsBin" type="directoryPath" value="samtools"/>
    <xs:element name="qsubBin" type="directoryPath"/>
    <xs:element name="FastaFileName" value="phi.x.174.fa" type="filePath" use="required"/>
    <xs:element name="runDirectory" type="directoryPath" use="required"/>
    <xs:element name="readsDirectory" type="directoryPath" use="required"/>
    <xs:element name="outputDirectory" type="directoryPath" use="required"/>
    <xs:element name="tmpDirectory" type="directoryPath" use="required"/>
    <xs:element name="outputID" type="xs:string" use="required"/>
    <xs:element name="cleanUsedIntermediateFiles" type="xs:integer" use="required"/>
    <xs:element name="numReadsPerFASTQ" type="integer" value="1">
    <xs:complexType>
    <xs:attribute name="matchSplit" type="positiveInteger" use="required"/>
    <xs:attribute name="localalignSplit" type="positiveInteger" use="required"/>
    </xs:complexType>
    </xs:element>
    <xs:element name="timing">
    <xs:simpleType>
    <xs:restriction base="xs:string">
    <xs:enumeration value="ON"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:element>
    <xs:element name="queueType" use="required">
    <xs:simpleType>
    <xs:restriction base="xs:string">
    <xs:enumeration value="SGE"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:element>
    <xs:element name="space" use="required">
    <xs:simpleType>
    <xs:restriction base="xs:string">
    <xs:enumeration value="NT"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:element>
    </xs:sequence>
    </xs:complexType>
    </xs:element>
    <xs:element name="matchOptions">
    <xs:complexType>
    <xs:sequence>
    <xs:element name="mainIndexes" type="xs:string"/>
    <xs:element name="secondaryIndexes" type="xs:string"/>
    <xs:element name="offsets" type="xs:string"/>
    <xs:element name="loadAllIndexes" type="xs:integer"/>
    <xs:element name="readCompression">
    <xs:simpleType>
    <xs:restriction base="xs:string">
    <xs:enumeration value="none"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:element>
    <xs:element name="keySize" type="positiveInteger"/>
    <xs:element name="maxKeyMatches" type="positiveInteger"/>
    <xs:element name="maxNumMatches" type="positiveInteger"/>
    <xs:element name="strand">
    <xs:simpleType>
    <xs:restriction base="xs:string">
    <xs:enumeration value="BOTH"/>
    <xs:enumeration value="FORWARD"/>
    <xs:enumeration value="REVERSE"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:element>
    <xs:element name="threads" type="positiveInteger" value="2"/>
    <xs:element name="mergeSeparate" type="xs:integer"/>
    <xs:element name="qsubQueue" type="xs:string"/>
    <xs:element name="qsubArgs" type="xs:string"/>
    </xs:sequence>
    </xs:complexType>
    </xs:element>
    <xs:element name="localalignOptions">
    <xs:complexType>
    <xs:sequence>
    <xs:element name="offset" type="nonNegativeInteger"/>
    <xs:element name="threads" type="integer" value="2"/>
    <xs:element name="pairedEndLength" type="xs:integer"/>
    <xs:element name="mirrorType" type="xs:integer"/>
    <xs:element name="forceMirror">
    <xs:simpleType>
    <xs:restriction base="xs:integer">
    <xs:minInclusive value="0"/>
    <xs:maxInclusive value="3"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:element>
    <xs:element name="qsubQueue" type="xs:string"/>
    <xs:element name="qsubArgs" type="xs:string"/>
    </xs:sequence>
    </xs:complexType>
    </xs:element>
    <xs:element name="postprocessOptions">
    <xs:complexType>
    <xs:sequence>
    <xs:element name="algorithm">
    <xs:simpleType>
    <xs:restriction base="xs:integer">
    <xs:minInclusive value="0"/>
    <xs:maxInclusive value="3"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:element>
    <xs:element name="outputFormat" type="xs:integer"/>
    <xs:element name="threads" type="positiveInteger" value="2"/>
    <xs:element name="qsubQueue" type="xs:string"/>
    <xs:element name="qsubArgs" type="xs:string"/>
    </xs:sequence>
    </xs:complexType>
    </xs:element>
    <xs:element name="samOptions">
    <xs:complexType>
    <xs:sequence>
    <xs:element name="samtools" type="integer" use="required"/>
    <xs:element name="qsubQueue" type="xs:string"/>
    <xs:element name="qsubArgs" type="xs:string"/>
    </xs:sequence>
    </xs:complexType>
    </xs:element>
    </xs:sequence>
    </xs:complexType>
    </xs:element>
    <xs:simpleType name="filePath">
    <xs:restriction base="xs:string">
    <xsattern value="/home/vincenzo/run_parallel"/>
    </xs:restriction>
    </xs:simpleType>
    <xs:simpleType name="directoryPath">
    <xs:restriction base="xs:string">
    <xsattern value="/home/vincenzo/run_parallel"/>
    </xs:restriction>
    </xs:simpleType>
    <xs:simpleType name="nonNegativeInteger">
    <xs:restriction base="xs:integer">
    <xs:minInclusive value="0"/>
    </xs:restriction>
    </xs:simpleType>
    <xs:simpleType name="positiveInteger">
    <xs:restriction base="xs:integer">
    <xs:minInclusive value="1"/>
    </xs:restriction>
    </xs:simpleType>
    </xs:schema>

    Thanks a lot.
    Please tell me if there is a clear tutorial about this phase of running.
    This is my first post, i hope is clear.
    Bye
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #2
    There is no tutorial about this script and is meant for advanced users. You have not converted the XML schema into a valid XML file. You should first read a tutorial about XML and creating an XML file from an XML schema. For example,
    Code:
    <xs:element name="qsubQueue" type="xs:string"/>
    should be
    Code:
    <qsubQueue>SGE</qsubQueue>
    .

    Comment

    • Vincenzo
      Junior Member
      • Feb 2011
      • 4

      #3
      Can you share a complete file.xml to inspire and help me in the develop of this config file?
      For example a file.xml that you used to run your jobs.
      Thanks a lot

      Comment

      • nilshomer
        Nils Homer
        • Nov 2008
        • 1283

        #4
        Please email the mailing list for further questions: [email protected]. There may be someone on the list who can share their xml configuration file with you.

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        24 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        42 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        48 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        49 views
        0 reactions
        Last Post SEQadmin2  
        Working...