Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • NicoBxl
    not just another member
    • Aug 2010
    • 264

    scripture with paired-end data - problem

    Hi,

    I've problem with scripture.

    I performed alignments with tophat 2 (paired-end, 76bp) and now want to use scripture to build a transcriptome. When I check the scripture help, I don't understand how to use it.

    Here's the help :


    Code:
    Parameters 
     -alignment <Alignment file in BAM, SAM or Alignemnt format> 
     -maskFileDir <Mask File directory> 
     -out <Output file name>
     -windows <Comma separated list of windows to evaluate>
     -sizeFile <Chromosome size file>  
     Optional arguments: 
     -trim <Include this flag if trimming of the ends of windows based on read coverage  is desired this is expensive> 
     -alpha <Desired FDR>
     one of -chr <Chromsomosome to segment> or -chrSequence <Necessary to filter spliced reads by splice site information. Notice that this is only compatible with region files that contain regions of only one chromosome> -dontFilterCanonicalSplice
     -start <To segment only a subregion of the chromosome include its start> -end <To segment only a subregion of the chromosome include its end>
     -minSpliceSupport <Minimum count to support splice reads, default is 1> 
     -pairedEnd <Paired end alignment files> -strandSpecificReads <Strand specific alignment file> -scoreRegions <Full BED to score> -upWeightSplices -lambda <If a prior background expectation for number of reads per base exists> -exons <BED file of exons> -introns <Introns and counts>
    
    Task: AddPairs -  Uses a paired end alignment to tune graph 
    	-in <Graph in .dot format. Standard input is assumed> 
    	-pairedEnd <Paired end information (as in previous task), in single line BED format>
    	 -maskFileDir <Directory containing mask files for the genome> 
    	-chr <Chromosome (only a chromosome at a time is supported at this point)> 
    	-sizeFile <Chromosome size file> 
    	-out <Output file name>
    
    Task: score -  Computes several expression related scores for a set of annotations -in <Full BED file with annotations to score> 
    	-alignment <Alignment file in BAM, SAM or Alignemnt format> 
    	-sizeFile <Chromosome size file> 
    	-out <Output file name> 
    	 -maskFileDir <Mask File directory>

    My questions :

    The accepted_hits.bam from tophat, should I input it with -alignment or -pairedEnd.
    What is -maskFileDir and -windows param ?


    Thanks a lot,

    N.
  • rboettcher
    Member
    • Oct 2010
    • 71

    #2
    Hi Nico,

    since both cufflinks and scriptute are provided by the Broad institute, my guess is that both have similar input parameters. That in mind, the maskFileDir should contain a gtf file with known regions that should be excluded ("masked") from the assembly process to focus on novel transcripts only.

    Looking at the walktrough provided on the Scripture website
    (http://www.broadinstitute.org/softwa...hrough_example),
    they suggest to concatenate your paired end alignment files and then use the -alignment flag.

    I'm interested in the outcome of your analysis in general (especially compared to cufflinks), so please keep me posted.

    Best regards,
    René

    Comment

    • NicoBxl
      not just another member
      • Aug 2010
      • 264

      #3
      Thanks.

      what about the -windows param ?

      Comment

      • rboettcher
        Member
        • Oct 2010
        • 71

        #4
        Originally posted by Eric Fournier View Post
        To anyone coming across this thread because they can't figure out how to set Scripture's -windows flag (Which is listed as optional in the documentation, but seems mandatory when running scripture.jar):

        this problem can also be solved by using the latest beta-2 build, which is available from http://www.broadinstitute.org/softwa...re/?q=download
        hope this helps

        Comment

        • bruce01
          Senior Member
          • Mar 2011
          • 160

          #5
          I have been trying to run Scripture using Samtools merged .bam files for my conditions. The original .bam files are from Tophat2. When I run Scripture I get "Has pairs: false" when data is from paired-end sequence. Do you have to give the original .fq files? What comes after -pairedData flag? In short: how do I get "Has pairs: true"?

          Help much appreciated.

          Comment

          • rboettcher
            Member
            • Oct 2010
            • 71

            #6
            So I also tried scripture in beta version 2, after alignment with tophat2 (as recommended on the homepage), however I keep getting the following warning message:

            [SamAlignment.java:243] [main] Warning: invalid quality string length. Read length = 101 quality string length = 0

            The resulting files contain quite a lot of transcript estimates, still they all score 0.0.

            Any suggestions?

            Comment

            • baibaijingjing
              Junior Member
              • Jul 2013
              • 1

              #7
              I have got the same problem when I used the example data downloaded from ftp://ftp.ncbi.nih.gov/pub/geo/DATA/...aligned.sam.gz. but Has pairs is false and all the scores are 0. I hope someone would solve it .

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              38 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              100 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              121 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-04-2026, 08:59 AM
              0 responses
              114 views
              0 reactions
              Last Post SEQadmin2  
              Working...