Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TopHat protocol error at cuffdiff

    Hi all - I've been trying to work through the TopHat protocol from the March issue of Nature Protocols but ran into an issue at step #5 (Running Cuffdiff). The protocol describes only two labels but lists six merged files. Any ideas of how to run the protocol with only two labels on all six datasets?

  • #2
    The 6 bam files are broken up into two lists of three (each list is comma separated). So the label is for the list of files. The command they list in the paper looks correct. If running it, or something like it, produces different results for you then it'd be helpful for you to provide the command and an example of the output.

    Comment


    • #3
      Thanks for the quick response, here is the command and output:
      [herndon@usda02 rnaseqexp]$ cuffdiff -o diff_out -b genome.fa -p 8 -L C1,C2 -u merged_asm/merged.gtf ./C1_R1_thout/accepted_hits.bam, ./C1_R2_thout/accepted_hits.bam, ./C1_R1_thout/accepted_hits.bam, ./C1_R3_thout/accepted_hits.bam, C2_R1_thout/accepted_hits.bam, ./C2_R2_thout/accepted_hits.bam, ./C2_R3_thout/accepted_hits.bam
      cuffdiff: /lib64/libz.so.1: no version information available (required by cuffdiff)
      You are using Cufflinks v1.3.0, which is the most recent release.
      Error: number of labels must match number of conditions

      Comment


      • #4
        You have C1_R1_thout twice and spaces after the commas, as well as only one list once you fix all of that.

        What you want is probably something like this:
        Code:
        cuffdiff -o diff_out -b genome.fa -p 8 -L C1,C2 -u merged_asm/merged.gtf C1_R1_thout/accepted_hits.bam,C1_R2_thout/accepted_hits.bam,C1_R3_thout/accepted_hits.bam C2_R1_thout/accepted_hits.bam,C2_R2_thout/accepted_hits.bam,C2_R3_thout/accepted_hits.bam

        Comment


        • #5
          Yep - that worked. Aside from the misplaced commas, I was running the program from the directory where the tophout ouptut directories were located.

          Thanks for the help!

          Comment


          • #6
            Cool, glad I could help. Best of luck with the analyses.

            Comment


            • #7
              not reading BAM file

              I'm using the same Nature Protocol on my samples and am running into a different problem at the cuffdiff step. The program is not reading my accepted_hits.bam files. I've tried converted to .sam and still no luck. .sam file looks ok to my eyes. And .bam file worked for cufflinks step. Any ideas?

              $ cuffdiff -o ./results-NCBIref082011_cuffdiff/cuffdiff/ -b /Users/Fontana/Genomes/Drosophila_melanogaster/NCBI/build5.3/Sequence/Bowtie2Index/genome.fa -p 2 -L Mid,nonMid -u ./results-NCBIref082011_cuffdiff/merged_asm/merged.gtf \ ./results-NCBIref082011_JF001g1/accepted_hits.bam,./results-NCBIref082011_JF003g1/accepted_hits.bam \ ./results-NCBIref082011_JF002g1/accepted_hits.bam
              You are using Cufflinks v2.0.0, which is the most recent release.
              open: No such file or directory
              File ./results-NCBIref082011_JF001g1/accepted_hits.bam doesn't appear to be a valid BAM file, trying SAM...
              Error: cannot open alignment file ./results-NCBIref082011_JF001g1/accepted_hits.bam for reading
              -Note : also tried running with some of my older BAM files that I know are good. Still spitting out same error
              Last edited by jfofly; 06-01-2012, 05:57 AM.

              Comment


              • #8
                Perhaps it's because I'm seeing this on my phone, but you appear to have a number of "\ " instances in your command. I imagine that you don't want to escape those spaces.

                Comment


                • #9
                  Originally posted by dpryan View Post
                  Perhaps it's because I'm seeing this on my phone, but you appear to have a number of "\ " instances in your command. I imagine that you don't want to escape those spaces.
                  Thanks for the reply. Those "\ " characters are written in the protocol before the samples for each condition used. Am I interpreting it wrongly? I'm unfamiliar with what that character would be doing. Should I be leaving it out?

                  Comment


                  • #10
                    Yeah, maybe they should have mentioned this detail in the paper since I lot of the readers following it won't be familiar with the command line. The backslashes are used to allow you to split a command over multiple lines. So, you need to hit return/enter after each one. Alternatively, you can leave them out and just put everything on one line. The version with the backslashes increases readability, but will lead to problems like the one you ran into for those unaccustomed. Try either of the following and see if that fixes your problems:

                    All on one line:
                    Code:
                    cuffdiff -o ./results-NCBIref082011_cuffdiff/cuffdiff/ -b /Users/Fontana/Genomes/Drosophila_melanogaster/NCBI/build5.3/Sequence/Bowtie2Index/genome.fa -p 2 -L Mid,nonMid -u ./results-NCBIref082011_cuffdiff/merged_asm/merged.gtf ./results-NCBIref082011_JF001g1/accepted_hits.bam,./results-NCBIref082011_JF003g1/accepted_hits.bam ./results-NCBIref082011_JF002g1/accepted_hits.bam
                    Split across lines:
                    Code:
                    cuffdiff -o ./results-NCBIref082011_cuffdiff/cuffdiff/ -b /Users/Fontana/Genomes/Drosophila_melanogaster/NCBI/build5.3/Sequence/Bowtie2Index/genome.fa -p 2 -L Mid,nonMid -u ./results-NCBIref082011_cuffdiff/merged_asm/merged.gtf \
                    ./results-NCBIref082011_JF001g1/accepted_hits.bam,./results-NCBIref082011_JF003g1/accepted_hits.bam \
                    ./results-NCBIref082011_JF002g1/accepted_hits.bam

                    Comment


                    • #11
                      That did it. I appreciate the help!

                      Comment


                      • #12
                        why are there '.' in front of '/' in bam files?

                        Comment


                        • #13
                          You don't need the "./" at all. If you just used "/file.bam", though, then "file.bam" would need to be in the root directory, which should never occur. "./" just mean "the current working directory".

                          Comment


                          • #14
                            Originally posted by AndrewMartins
                            You don't need the "./" at all. If you just used "/file.bam", though, then "file.bam" would need to be in the root directory, which should never occur. "./" just mean "the current working directory".
                            This has to be the most inept attempt at a spam bot I've ever seen.

                            Comment

                            Latest Articles

                            Collapse

                            • seqadmin
                              Non-Coding RNA Research and Technologies
                              by seqadmin




                              Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                              Nobel Prize for MicroRNA Discovery
                              This week,...
                              10-07-2024, 08:07 AM
                            • seqadmin
                              Recent Developments in Metagenomics
                              by seqadmin





                              Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                              09-23-2024, 06:35 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by seqadmin, Today, 06:35 AM
                            0 responses
                            6 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, Yesterday, 02:44 PM
                            0 responses
                            7 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 10-11-2024, 06:55 AM
                            0 responses
                            14 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 10-02-2024, 04:51 AM
                            0 responses
                            110 views
                            0 likes
                            Last Post seqadmin  
                            Working...
                            X