Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • mrfox
    Senior Member
    • Aug 2010
    • 103

    Tophat2.03: error

    Hi all,

    I just tested Tophat2.03 released on May 26th using a small data set (paired-end data, 5000 reads, read len = 50). Unfortunately, I got the error message below. I am not sure if I made something wrong or it is a bug...



    [ao@xxxx tmp]$ tophat2 -r 150 -p 20 -g 1 -o ./ ../../bowtie2_index_hg19/hg19 tmp1.fq tmp2.fq

    [2012-05-28 23:28:16] Beginning TopHat run (v2.0.3)
    -----------------------------------------------
    [2012-05-28 23:28:53] Checking for Bowtie
    Bowtie version: 2.0.0.6
    [2012-05-28 23:28:54] Checking for Samtools
    Samtools version: 0.1.18.0
    [2012-05-28 23:28:54] Checking for Bowtie index files
    [2012-05-28 23:28:54] Checking for reference FASTA file
    [2012-05-28 23:28:54] Generating SAM header for ../../bowtie2_index_hg19/hg19
    format: fastq
    quality scale: phred33 (default)
    [2012-05-28 23:30:36] Preparing reads
    left reads: min. length=50, max. length=50, 2498 kept reads (2 discarded)
    right reads: min. length=50, max. length=50, 2494 kept reads (6 discarded)
    [2012-05-28 23:35:13] Mapping left_kept_reads to genome hg19 with Bowtie2
    [FAILED]
    Error running:
    /../tools/tophat-2.0.3/bam2fastx --all --fastq .//tmp/left_kept_reads.bam|/RIS/home/hzhao1/tools/bowtie2-2.0.0-beta6/bowtie2-align -q -k 1 -D 15 -R 2 -N 0 -L 20 -i S,1,1.25 --gbar 4 --mp 6,2 --np 1 --rdg 5,3 --rfg 5,3 --score-min L,-0.6,-0.6 -p 20 --sam-no-hd -x ../../bowtie2_index_hg19/hg19 -|/../tools/tophat-2.0.3/fix_map_ordering --bowtie2-min-score 10 --sam-header .//tmp/hg19_genome.bwt.samheader.sam - .//tmp/left_kept_reads.mapped.bam .//tmp/left_kept_reads_unmapped.bam --index-outfile .//tmp/left_kept_reads.mapped.bam.index
  • TrueSEQr
    Junior Member
    • Aug 2008
    • 7

    #2
    I have seen this error as well. However, I may have something to add that a potential solver will find relevant.

    When I see the error above in the stdout log, I see the following in the "tophat.log" under the log directory.

    [2013-07-30 00:03:55] Mapping left_kept_reads_unmapped to genome segment_juncs with Bowtie2 (1/1)
    open: No such file or directory
    Error: bam2fastx failed to open BAM file ./tophat_out/tmp/left_kept_reads_unmapped.bam
    [FAILED]

    Further, I see this in the bowtie.left_kept_reads_unmapped.log

    0 reads
    0.00% overall alignment rate

    This seems to be telling me that there is nothing to be captured in the "left_kept_reads_unmapped" file. However, I would expect Tophat to be able to anticipate something like that--maybe make a dummy file. Or, I would expect it somehow to be impossible.

    I use version 2.0.8, BTW.

    Comment

    • jp.
      Senior Member
      • Jul 2013
      • 142

      #3
      Hi
      I have have problem with Tophat2.. I have successfully analysed using tophat and cufflinks with insert size 250.
      do you know how to give commands for read length 101 x 2 (insert size 150) ?


      Originally posted by TrueSEQr View Post
      I have seen this error as well. However, I may have something to add that a potential solver will find relevant.

      When I see the error above in the stdout log, I see the following in the "tophat.log" under the log directory.

      [2013-07-30 00:03:55] Mapping left_kept_reads_unmapped to genome segment_juncs with Bowtie2 (1/1)
      open: No such file or directory
      Error: bam2fastx failed to open BAM file ./tophat_out/tmp/left_kept_reads_unmapped.bam
      [FAILED]

      Further, I see this in the bowtie.left_kept_reads_unmapped.log

      0 reads
      0.00% overall alignment rate

      This seems to be telling me that there is nothing to be captured in the "left_kept_reads_unmapped" file. However, I would expect Tophat to be able to anticipate something like that--maybe make a dummy file. Or, I would expect it somehow to be impossible.

      I use version 2.0.8, BTW.

      Comment

      • TrueSEQr
        Junior Member
        • Aug 2008
        • 7

        #4
        Here's the closest thing to a solution to the "unmapped" problem I've found so far: use Bowtie1. There is a --bowtie1 option on tophat v2. Use that, and make sure that bowtie is in your path. Oddly, when I first tried this, I found that tophat complained (and failed to run because of) that the bowtie version was not correct. It stated it had to be at least 0.12 something. I had to compensate by borrowing and copying the tophat script which is writtin in python, and commenting out that check. It seemed to run clear to the end for the first time.

        So that's two separate problems: 1--why is it failing to find this file if I run bowtie2 underneath. 2--why doesn't it like bowtie 1.0.0?

        Comment

        • TrueSEQr
          Junior Member
          • Aug 2008
          • 7

          #5
          BTW, jp, Sorry to say I am no expert at tophat. I am just trying to learn it now. Still, I would encourage you to start your own thread on this topic, because that question is quite different from the one I started here. It might get more attention that way, too.

          Comment

          • TrueSEQr
            Junior Member
            • Aug 2008
            • 7

            #6
            Bowtie1 is not universally solving this. I may have a bad install, here. I have downloaded data that was used for testing in writing the original TopHat paper, but that failed. Plus the strange workarounds I have had to do lead me to believe something in the software stack is not right. The "tophat test data" I downloaded, seemed to work just fine.

            So, bottom line: this was not strictly a bowtie2 / tophat 2 issue. Could also be the index I used, which I had built myself (I did make sure to build one such index per bowtie version, and point the tophat at the relevant version). But, then again, I have not yet found the "explicit instructions" for building such indices, and could have left a parameter out.

            Comment

            • TrueSEQr
              Junior Member
              • Aug 2008
              • 7

              #7
              I finally resorted to going through the tophat2.py code. I am not a Python programmer, but was able to get a hint. When I used the command-line option "--keep-tmp", this problem with the disappearing left_kept_reads_unmapped.bam did not occur.

              Instead, I see this:

              "Error while running bowtie:
              Error while flushing and closing output"

              and that last line is repeated seven times (BTW, I ran tophat with eight threads).

              So this raises the question: why does the left_kept_reads_unmapped.bam file get deleted if you are not clearing away temporary files, even though there are still downstream dependencies on that file? When I ran this against the test data, I did not have to use --keep-tmp to get it to run.

              Still a deeper problem.

              Comment

              Latest Articles

              Collapse

              • GATTACAT
                Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by GATTACAT
                Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                07-01-2026, 11:43 AM
              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Yesterday, 11:08 AM
              0 responses
              6 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-30-2026, 05:37 AM
              0 responses
              11 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              19 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              53 views
              0 reactions
              Last Post SEQadmin2  
              Working...