Seqanswers Leaderboard Ad



No announcement yet.
  • Filter
  • Time
  • Show
Clear All
new posts

  • CuffDiff cannot opn BAM file made in TopHat

    I am running a number of Tuxedo pipelines, starting with TopHat and ending in CuffDiff. Most of these pipelines work fine. However, I encountered the following error in CuffDiff for one of my pipelines:

    open: No such file or directory
    File /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_exonCDS/AR_DM1005_Female/accepted_hits.bam doesn't appear to be a valid BAM file, trying SAM...
    Error: cannot open alignment file /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_exonCDS/AR_DM1005_Female/accepted_hits.bam for reading

    I have checked over the input file carefully, and cannot find any errors in the file paths.

    The accepted_hits.bam file was made in TopHat. I have tried remaking it in TopHat and get the same result. I have looked over it in SamTools and can see no obvious errors.

    The only thing that separates this pipelines from my others is that I am using a gff file (made in TopHat from a Flybase GFF3 file) that contains exon and CDS data. Other GFF files with different data combinations seem to work (e.g. exons only, CDS + UTR). Has anyone encountered a similar problem? Is this a bug in CuffDiff?

    My full CuffDiff and CuffMerge pipeline is below.

    module load tophat/2.0.9
    module load bowtie/2.1.0
    module load cufflinks/2.1.1
    cuffmerge \
    -o /gpfs/group1/f/flyinv/Outputs_CuffMerge/exonCDS/Test_ARDM1005 \
    -g /gpfs/group1/f/flyinv/working_index/ExonCDS_.gff \
    -s /gpfs/group1/f/flyinv/RNASeq/Dpse3_0.fasta \
    cuffdiff \
    -o "/gpfs/group1/f/flyinv/Outputs_CuffDiff/exonCDS/Test_ARDM1005" \
    -L AR_DM1005_Male,AR_DM1005_Female \
    --total-hits-norm \
    --frag-bias-correct /gpfs/group1/f/flyinv/working_index/Dpse3_0_1.fa \
    --multi-read-correct \
    --library-norm-method classic-fpkm \
    /gpfs/group1/f/flyinv/Outputs_CuffMerge/exonCDS/Test_ARDM1005/merged.gtf \
    /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_ExonCDS/AR_DM1005_Male/accepted_hits.bam \

  • #2
    Help!! Cuffdiff cannot open alignment file for reading

    Hi gwilymh!

    I am also getting the exact same error message that you've gotten when trying to run cuffdiff

    open: No such file or directory
    File /Volumes/Data/2013-08-20_Gp_Cell_Cycle_Transcriptome/Olson_Samples_Run2/SampleXp1/accepted_hits.bam doesn't appear to be a valid BAM file, trying SAM...
    Error: cannot open alignment file /Volumes/Data/2013-08-20_Gp_Cell_Cycle_Transcriptome/Olson_Samples_Run2/SampleXp1/accepted_hits.bam for reading

    My accepted_hits.bam file was generated in Tophat v2.0.8. I used Cufflinks v2.1.1 but got an error when I tried to use cuffmerge, so I used a slightly older version of Cuffmerge (v2.0.2) to merge my transcript.gtf files generated by Cufflinks (I can provide this error message later if anyone is interested).

    I have successfully used previous versions of the Tuxedo software to analyze my RNA-seq data (Tophat v2.0.8, Bowtie, and Cufflinks v2.0.2), so maybe it is a bug with the new version of Cufflinks?

    I did notice that the version of samtools has changed to, where the analyses that I've done with the previous Tuxedo software used samtools Could it also be a samtools issue?

    If anyone else is getting this error message, please help us!!!


    • #3
      The first thing to check is if you can "samtools view" the accepted_hits.bam file. If not, it was likely corrupted at some point. gwilymh's problem might be a bug in cuffdiff, it'd be good to make stripped down BAM and annotation files and see if the error persists.

      I should note that the initial "open: No such file or directory" error would suggest that the path for the accepted_hits.bam file is simply being misspecified.
      Last edited by dpryan; 09-12-2013, 03:05 AM. Reason: I should really proof read before posting!


      Latest Articles


      • seqadmin
        Exploring the Dynamics of the Tumor Microenvironment
        by seqadmin

        The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
        07-08-2024, 03:19 PM
      • seqadmin
        Exploring Human Diversity Through Large-Scale Omics
        by seqadmin

        In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
        06-25-2024, 06:43 AM





      Topics Statistics Last Post
      Started by seqadmin, Today, 06:53 AM
      0 responses
      Last Post seqadmin  
      Started by seqadmin, 07-10-2024, 07:30 AM
      0 responses
      Last Post seqadmin  
      Started by seqadmin, 07-03-2024, 09:45 AM
      0 responses
      Last Post seqadmin  
      Started by seqadmin, 07-03-2024, 08:54 AM
      0 responses
      Last Post seqadmin