Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TopHat(1.3.2) reports error!help!

    Mapping RNA-Seq reads to reference with TopHat(v-0.1.3.2) results in error in that
    Code:
    Mapping right_kept_reads against genome_o_6 with bowtie
    gzip:stdout:Broken pipe
    ...
    10Error:[Error 2]No such file or directory:'./tophat_out/temp/right_kept_reads_missing.fq'
    what's happened to gzip with right_kept_reads?The left_kept_reads was processed properly.
    Need your help,thanks!

  • #2
    holywoool, have you found the answer to your question yet? I am getting the same error when I try to run Tophat. Does anybody know what is going on? Here is my entire message:

    [Wed Nov 16 16:17:54 2011] Beginning TopHat run (v1.3.3)
    -----------------------------------------------
    [Wed Nov 16 16:17:54 2011] Preparing output location /u/home/mcdb/arturj/B1/
    [Wed Nov 16 16:17:54 2011] Checking for Bowtie index files
    [Wed Nov 16 16:17:54 2011] Checking for reference FASTA file
    [Wed Nov 16 16:17:54 2011] Checking for Bowtie
    Bowtie version: 0.12.5.0
    [Wed Nov 16 16:17:54 2011] Checking for Samtools
    Samtools Version: 0.1.18
    [Wed Nov 16 16:17:54 2011] Generating SAM header for hg19_c
    [Wed Nov 16 16:17:54 2011] Preparing reads
    format: fastq
    quality scale: phred64 (reads generated with GA pipeline version >= 1.3)
    [Wed Nov 16 16:17:54 2011] Reading known junctions from GTF file
    Left reads: min. length=49, count=164672067
    [Wed Nov 16 17:01:35 2011] Mapping left_kept_reads against hg19_c with Bowtie
    [Wed Nov 16 17:01:35 2011] Processing bowtie hits

    gzip: stdout: Broken pipe
    Traceback (most recent call last):
    File "/u/home/mcdb/arturj/tophat-1.3.3.Linux_x86_64/tophat", line 2604, in ?
    sys.exit(main())
    File "/u/home/mcdb/arturj/tophat-1.3.3.Linux_x86_64/tophat", line 2563, in main
    user_supplied_deletions)
    File "/u/home/mcdb/arturj/tophat-1.3.3.Linux_x86_64/tophat", line 2218, in spliced_alignment
    segment_len)
    File "/u/home/mcdb/arturj/tophat-1.3.3.Linux_x86_64/tophat", line 1820, in split_reads
    zreads = ZReader(reads_filename, params.system_params, False)
    File "/u/home/mcdb/arturj/tophat-1.3.3.Linux_x86_64/tophat", line 1190, in __init__
    self.file=open(filename)
    IOError: [Errno 2] No such file or directory: '/u/home/mcdb/arturj/B1/tmp/left_kept_reads_missing.fq'

    And the bash script calling it..

    PATH=$PATH:/u/home/mcdb/arturj/bowtie-0.12.5/:/u/home/mcdb/arturj/tophat-1.3.3.Linux_x86_64/:/u/home/mcdb/arturj/samtools-0.1.18/
    export PATH
    export BOWTIE_INDEXES=/u/home/mcdb/arturj/bowtie-0.12.5/indexes/

    tophat --solexa1.3-quals -g 1 -G /u/home/mcdb/arturj/hg19_refflat.gtf -p 8 -o /u/home/mcdb/arturj/B1 hg19_c /u/home/mcdb/arturj/B1/B1.fastq


    Please help!

    Comment


    • #3
      Indeed,there needs a "-Q" option in the TopHat's command.Mine was done after adding "-Q".You may have a try.

      Comment


      • #4
        That definitely does not help me. I tried it, despite the manual saying it's for separate quality files. Has anyone else had this problem?
        Artur

        Comment


        • #5
          And thank you for your response, holywoool!

          Comment


          • #6
            I guess tat means that there are no unaligned reads..
            Why dont you simulate some reads and cat them to your read files and then try??

            Comment


            • #7
              Where do you see the release of version 1.3.1 and 1.3.3? They are downloadable from the site, but nothing about them is mentioned on the TopHat site.

              Comment


              • #8
                Setting -g means you won't see any reads that align more than once as it sets both the -m and -k flags in Bowtie to 1:

                -k <int> report up to <int> good alignments per read (default: 1)
                -m <int> suppress all alignments if > <int> exist (def: no limit)

                Chris

                Comment


                • #9
                  Originally posted by Thomas Doktor View Post
                  Where do you see the release of version 1.3.1 and 1.3.3? They are downloadable from the site, but nothing about them is mentioned on the TopHat site.
                  I see it only on the downloads site... http://tophat.cbcb.umd.edu/downloads/
                  It might be just a bug in this version, but I did try running 4 different datasets at a different time, and it worked fine. I've kept all the same options. The only thing that has changed is the data. The ONLY thing I can think of is that maybe it's an OOM error, just cleverly disguised. However, I'm running my data on 8 processors with 8 Gb each. Should be enough for a 30 Gb fastq file, no? After this try, I'll try going back to 1.3.1 or earlier.

                  cjp, I've been doing the -g option on all my data, and it's never not worked. I'm trying it without the -g argument, however. I'll let you know in a little bit if it works.

                  Comment


                  • #10
                    Still nothing.. Tried changing the annotation file from refflat to ensembl, tried removing the '-g' option, -Q (even though I don't see how that would work at all), and it's impossible for there to not be any unaligned transcripts with 164672067 49-bp reads. From my experience, only about half of the reads align anyway, so the probability of nothing aligning would be 2^(-8236033). Maybe I'll try using an earlier version or something, unless someone has any other suggestions?

                    Comment


                    • #11
                      -Q is usually for SOLiD colour-space.

                      When debugging failed TopHat runs, you can also try to run the individual commands from the command line yourself.

                      In the logs/ sub-directory of your output directory there should be a file called run.log which shows the commands that TopHat runs. There are also other log files in there - look to see if you can find errors (especially in the newest one or two files). Otherwise try running each command by itself from the directory you started TopHat in and see if and where empty files are made. The tmp stuff needed to do this is usually kept if a run fails.

                      Chris

                      Comment


                      • #12
                        Originally posted by cjp View Post
                        -Q is usually for SOLiD colour-space.

                        When debugging failed TopHat runs, you can also try to run the individual commands from the command line yourself.

                        In the logs/ sub-directory of your output directory there should be a file called run.log which shows the commands that TopHat runs. There are also other log files in there - look to see if you can find errors (especially in the newest one or two files). Otherwise try running each command by itself from the directory you started TopHat in and see if and where empty files are made. The tmp stuff needed to do this is usually kept if a run fails.

                        Chris
                        Excellent suggestion. I will try it once I get out of my Tryptophan coma

                        Comment


                        • #13
                          And once our computing cluster is up. Boo, maintenance!

                          Comment


                          • #14
                            Wowee. So I evidently was using the wrong index. I thought hg19_c was complete (it's prebuilt from the Bowtie website). I was supposed to use hg19.

                            Originally posted by cjp View Post
                            -Q is usually for SOLiD colour-space.

                            When debugging failed TopHat runs, you can also try to run the individual commands from the command line yourself.

                            In the logs/ sub-directory of your output directory there should be a file called run.log which shows the commands that TopHat runs. There are also other log files in there - look to see if you can find errors (especially in the newest one or two files). Otherwise try running each command by itself from the directory you started TopHat in and see if and where empty files are made. The tmp stuff needed to do this is usually kept if a run fails.

                            Chris
                            However, Chris, I learned a lot about debugging trying to find this error. Thank you greatly for the suggestion!

                            Comment

                            Latest Articles

                            Collapse

                            • seqadmin
                              Non-Coding RNA Research and Technologies
                              by seqadmin




                              Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                              Nobel Prize for MicroRNA Discovery
                              This week,...
                              10-07-2024, 08:07 AM
                            • seqadmin
                              Recent Developments in Metagenomics
                              by seqadmin





                              Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                              09-23-2024, 06:35 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by seqadmin, 10-02-2024, 04:51 AM
                            0 responses
                            104 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 10-01-2024, 07:10 AM
                            0 responses
                            112 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 09-30-2024, 08:33 AM
                            1 response
                            116 views
                            0 likes
                            Last Post EmiTom
                            by EmiTom
                             
                            Started by seqadmin, 09-26-2024, 12:57 PM
                            0 responses
                            23 views
                            0 likes
                            Last Post seqadmin  
                            Working...
                            X