Hi all - I've been trying to work through the TopHat protocol from the March issue of Nature Protocols but ran into an issue at step #5 (Running Cuffdiff). The protocol describes only two labels but lists six merged files. Any ideas of how to run the protocol with only two labels on all six datasets?
Unconfigured Ad
Collapse
X
-
The 6 bam files are broken up into two lists of three (each list is comma separated). So the label is for the list of files. The command they list in the paper looks correct. If running it, or something like it, produces different results for you then it'd be helpful for you to provide the command and an example of the output.
-
-
Thanks for the quick response, here is the command and output:
[herndon@usda02 rnaseqexp]$ cuffdiff -o diff_out -b genome.fa -p 8 -L C1,C2 -u merged_asm/merged.gtf ./C1_R1_thout/accepted_hits.bam, ./C1_R2_thout/accepted_hits.bam, ./C1_R1_thout/accepted_hits.bam, ./C1_R3_thout/accepted_hits.bam, C2_R1_thout/accepted_hits.bam, ./C2_R2_thout/accepted_hits.bam, ./C2_R3_thout/accepted_hits.bam
cuffdiff: /lib64/libz.so.1: no version information available (required by cuffdiff)
You are using Cufflinks v1.3.0, which is the most recent release.
Error: number of labels must match number of conditions
Comment
-
-
You have C1_R1_thout twice and spaces after the commas, as well as only one list once you fix all of that.
What you want is probably something like this:
Code:cuffdiff -o diff_out -b genome.fa -p 8 -L C1,C2 -u merged_asm/merged.gtf C1_R1_thout/accepted_hits.bam,C1_R2_thout/accepted_hits.bam,C1_R3_thout/accepted_hits.bam C2_R1_thout/accepted_hits.bam,C2_R2_thout/accepted_hits.bam,C2_R3_thout/accepted_hits.bam
Comment
-
-
not reading BAM file
I'm using the same Nature Protocol on my samples and am running into a different problem at the cuffdiff step. The program is not reading my accepted_hits.bam files. I've tried converted to .sam and still no luck. .sam file looks ok to my eyes. And .bam file worked for cufflinks step. Any ideas?
-Note : also tried running with some of my older BAM files that I know are good. Still spitting out same error$ cuffdiff -o ./results-NCBIref082011_cuffdiff/cuffdiff/ -b /Users/Fontana/Genomes/Drosophila_melanogaster/NCBI/build5.3/Sequence/Bowtie2Index/genome.fa -p 2 -L Mid,nonMid -u ./results-NCBIref082011_cuffdiff/merged_asm/merged.gtf \ ./results-NCBIref082011_JF001g1/accepted_hits.bam,./results-NCBIref082011_JF003g1/accepted_hits.bam \ ./results-NCBIref082011_JF002g1/accepted_hits.bam
You are using Cufflinks v2.0.0, which is the most recent release.
open: No such file or directory
File ./results-NCBIref082011_JF001g1/accepted_hits.bam doesn't appear to be a valid BAM file, trying SAM...
Error: cannot open alignment file ./results-NCBIref082011_JF001g1/accepted_hits.bam for readingLast edited by jfofly; 06-01-2012, 05:57 AM.
Comment
-
-
Thanks for the reply. Those "\ " characters are written in the protocol before the samples for each condition used. Am I interpreting it wrongly? I'm unfamiliar with what that character would be doing. Should I be leaving it out?Originally posted by dpryan View PostPerhaps it's because I'm seeing this on my phone, but you appear to have a number of "\ " instances in your command. I imagine that you don't want to escape those spaces.
Comment
-
-
Yeah, maybe they should have mentioned this detail in the paper since I lot of the readers following it won't be familiar with the command line. The backslashes are used to allow you to split a command over multiple lines. So, you need to hit return/enter after each one. Alternatively, you can leave them out and just put everything on one line. The version with the backslashes increases readability, but will lead to problems like the one you ran into for those unaccustomed. Try either of the following and see if that fixes your problems:
All on one line:
Split across lines:Code:cuffdiff -o ./results-NCBIref082011_cuffdiff/cuffdiff/ -b /Users/Fontana/Genomes/Drosophila_melanogaster/NCBI/build5.3/Sequence/Bowtie2Index/genome.fa -p 2 -L Mid,nonMid -u ./results-NCBIref082011_cuffdiff/merged_asm/merged.gtf ./results-NCBIref082011_JF001g1/accepted_hits.bam,./results-NCBIref082011_JF003g1/accepted_hits.bam ./results-NCBIref082011_JF002g1/accepted_hits.bam
Code:cuffdiff -o ./results-NCBIref082011_cuffdiff/cuffdiff/ -b /Users/Fontana/Genomes/Drosophila_melanogaster/NCBI/build5.3/Sequence/Bowtie2Index/genome.fa -p 2 -L Mid,nonMid -u ./results-NCBIref082011_cuffdiff/merged_asm/merged.gtf \ ./results-NCBIref082011_JF001g1/accepted_hits.bam,./results-NCBIref082011_JF003g1/accepted_hits.bam \ ./results-NCBIref082011_JF002g1/accepted_hits.bam
Comment
-
-
This has to be the most inept attempt at a spam bot I've ever seen.Originally posted by AndrewMartinsYou don't need the "./" at all. If you just used "/file.bam", though, then "file.bam" would need to be in the root directory, which should never occur. "./" just mean "the current working directory".
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.
Here are nine questions we think about, in roughly the order they matter, before...-
Channel: Articles
06-18-2026, 07:11 AM -
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Yesterday, 11:10 AM
|
0 responses
7 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 11:10 AM
|
||
|
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population
by SEQadmin2
Started by SEQadmin2, 06-17-2026, 06:09 AM
|
0 responses
42 views
0 reactions
|
Last Post
by SEQadmin2
06-17-2026, 06:09 AM
|
||
|
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism
by SEQadmin2
Started by SEQadmin2, 06-09-2026, 11:58 AM
|
0 responses
104 views
0 reactions
|
Last Post
by SEQadmin2
06-09-2026, 11:58 AM
|
||
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
125 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
Comment