Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #16
    I did not pay close attention to your original question (i.e. you are getting the error while running tophat itself and not the actual script). My fault.

    Let us start over.

    Is caref_ncbiall available in the same directory that you are running tophat from? If not, can you provide the full path to that file when you execute tophat. The same should apply to cp04.fastq and caref_seq.gff. Also make certain that your gff file matches the format specification as noted here: http://cufflinks.cbcb.umd.edu/gff.html
    Last edited by GenoMax; 04-30-2014, 04:00 PM.

    Comment

    • varshacp
      Member
      • Aug 2013
      • 13

      #17
      Hi GenoMAx

      All the files are in the same directory. I also made sure that the names are same in gff and genome file.


      Thank you

      Varsha

      Comment

      • varshacp
        Member
        • Aug 2013
        • 13

        #18
        Hi

        I forgot to mention that the fastq file is also in same directory. The gff file and genome sequences were downloaded from NCBI (NCBI has separate fasta file for each chromosomes and all the unplaced scafolds are in one fasta file. I concatenated these files to make the genome file and renamed it as per the gff file. The same gff file is working with other genome sequence file which does not have unplaced sequences.


        Thank you

        Varsha

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #19
          TopHat is picky about the order of options on the command line. Can you try the following:

          Code:
          $ tophat -o cp04_thout5 -p 2 -G caref_seq.gff caref_ncbiall cp04.fastq
          Let me also verify that the basename for your genome index files is "caref_ncbiall", that is there are several files (that comprise of the index) that have that prefix?

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #20
            Originally posted by varshacp View Post
            Hi

            I forgot to mention that the fastq file is also in same directory. The gff file and genome sequences were downloaded from NCBI (NCBI has separate fasta file for each chromosomes and all the unplaced scafolds are in one fasta file. I concatenated these files to make the genome file and renamed it as per the gff file. The same gff file is working with other genome sequence file which does not have unplaced sequences.


            Thank you

            Varsha
            Does it mean that you have not created the "index" files for this combined fasta reference file? You will need to index the reference in order to use tophat. You can build the reference index using the directions here: http://tophat.cbcb.umd.edu/tutorial.shtml#ref

            Comment

            • varshacp
              Member
              • Aug 2013
              • 13

              #21
              HI GenoMAx

              The genome index was created using the same fasta file and is in the same directory

              Thankx

              Varsha

              Comment

              • GenoMax
                Senior Member
                • Feb 2008
                • 7142

                #22
                Can you post a listing of the files in this directory?

                Also see my previous post about the order of options. If the genome index is correctly created then give that command line a try.

                Comment

                • varshacp
                  Member
                  • Aug 2013
                  • 13

                  #23
                  Originally posted by GenoMax View Post
                  TopHat is picky about the order of options on the command line. Can you try the following:

                  Code:
                  $ tophat -o cp04_thout5 -p 2 -G caref_seq.gff caref_ncbiall cp04.fastq
                  Let me also verify that the basename for your genome index files is "caref_ncbiall", that is there are several files (that comprise of the index) that have that prefix?

                  Hi

                  The basename is caref_ncbiall for the index files

                  Comment

                  • GenoMax
                    Senior Member
                    • Feb 2008
                    • 7142

                    #24
                    Were you able to get tophat working?

                    Comment

                    • varshacp
                      Member
                      • Aug 2013
                      • 13

                      #25
                      Hi Genomax

                      The index is also build using the same genome sequence file in the same directory

                      Comment

                      • GenoMax
                        Senior Member
                        • Feb 2008
                        • 7142

                        #26
                        Are things working now? Or are you still seeing an error?

                        Comment

                        • varshacp
                          Member
                          • Aug 2013
                          • 13

                          #27
                          HI GenoMax

                          I am still getting the same error.


                          Thank you

                          Comment

                          • GenoMax
                            Senior Member
                            • Feb 2008
                            • 7142

                            #28
                            Varsha: Without seeing a listing of the files (related to this error, e.g. caref_ncbiall) in the directory you are running this from there is not much further help I can offer.

                            Comment

                            • varshacp
                              Member
                              • Aug 2013
                              • 13

                              #29
                              Hi GenoMax

                              The following is the list of file in the directory from which I am running the tophat command

                              caref_ncbiall.fa (genome sequence file)
                              caref_ncbiall.1.bt2 (bowtie index files)
                              caref_ncbiall.2.bt2
                              caref_ncbiall.3.bt2
                              caref_ncbiall.4.bt2
                              caref_ncbiall.rev.1.bt2
                              caref_ncbiall.rev.2.bt2
                              cp04.fastq (reads files)
                              caref_seq.gff (genome annotation file)

                              Thank you

                              Kind regards


                              Varsha

                              Comment

                              • varshacp
                                Member
                                • Aug 2013
                                • 13

                                #30
                                HI

                                I checked the log file and besides the run.log which I posted earlier I get the following error in the g2f.log file

                                terminate called after throwing an instance of 'std:ut_of_range'
                                what(): basic_string::substr


                                Help me to understand this

                                Thank you
                                Varsha

                                Comment

                                Latest Articles

                                Collapse

                                • seqadmin
                                  New Genomics Tools and Methods Shared at AGBT 2025
                                  by seqadmin


                                  This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                                  The Headliner
                                  The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                                  03-03-2025, 01:39 PM
                                • seqadmin
                                  Investigating the Gut Microbiome Through Diet and Spatial Biology
                                  by seqadmin




                                  The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                                  02-24-2025, 06:31 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by seqadmin, 03-20-2025, 05:03 AM
                                0 responses
                                18 views
                                0 reactions
                                Last Post seqadmin  
                                Started by seqadmin, 03-19-2025, 07:27 AM
                                0 responses
                                20 views
                                0 reactions
                                Last Post seqadmin  
                                Started by seqadmin, 03-18-2025, 12:50 PM
                                0 responses
                                19 views
                                0 reactions
                                Last Post seqadmin  
                                Started by seqadmin, 03-03-2025, 01:15 PM
                                0 responses
                                186 views
                                0 reactions
                                Last Post seqadmin  
                                Working...