Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • gwilymh
    Member
    • Dec 2011
    • 72

    CuffDiff cannot opn BAM file made in TopHat

    I am running a number of Tuxedo pipelines, starting with TopHat and ending in CuffDiff. Most of these pipelines work fine. However, I encountered the following error in CuffDiff for one of my pipelines:

    open: No such file or directory
    File /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_exonCDS/AR_DM1005_Female/accepted_hits.bam doesn't appear to be a valid BAM file, trying SAM...
    Error: cannot open alignment file /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_exonCDS/AR_DM1005_Female/accepted_hits.bam for reading

    I have checked over the input file carefully, and cannot find any errors in the file paths.

    The accepted_hits.bam file was made in TopHat. I have tried remaking it in TopHat and get the same result. I have looked over it in SamTools and can see no obvious errors.

    The only thing that separates this pipelines from my others is that I am using a gff file (made in TopHat from a Flybase GFF3 file) that contains exon and CDS data. Other GFF files with different data combinations seem to work (e.g. exons only, CDS + UTR). Has anyone encountered a similar problem? Is this a bug in CuffDiff?

    My full CuffDiff and CuffMerge pipeline is below.



    module load tophat/2.0.9
    module load bowtie/2.1.0
    module load cufflinks/2.1.1
    cuffmerge \
    -o /gpfs/group1/f/flyinv/Outputs_CuffMerge/exonCDS/Test_ARDM1005 \
    -g /gpfs/group1/f/flyinv/working_index/ExonCDS_.gff \
    -s /gpfs/group1/f/flyinv/RNASeq/Dpse3_0.fasta \
    /gpfs/group1/f/flyinv/Outputs_CuffMerge/exonCDS/CuffLinks_Output_GTF_List_ARDM1005.txt
    cuffdiff \
    -o "/gpfs/group1/f/flyinv/Outputs_CuffDiff/exonCDS/Test_ARDM1005" \
    -L AR_DM1005_Male,AR_DM1005_Female \
    --total-hits-norm \
    --frag-bias-correct /gpfs/group1/f/flyinv/working_index/Dpse3_0_1.fa \
    --multi-read-correct \
    --library-norm-method classic-fpkm \
    /gpfs/group1/f/flyinv/Outputs_CuffMerge/exonCDS/Test_ARDM1005/merged.gtf \
    /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_ExonCDS/AR_DM1005_Male/accepted_hits.bam \
    /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_exonCDS/AR_DM1005_Female/accepted_hits.bam
  • tmarria
    Junior Member
    • Jun 2012
    • 1

    #2
    Help!! Cuffdiff cannot open alignment file for reading

    Hi gwilymh!

    I am also getting the exact same error message that you've gotten when trying to run cuffdiff

    open: No such file or directory
    File /Volumes/Data/2013-08-20_Gp_Cell_Cycle_Transcriptome/Olson_Samples_Run2/SampleXp1/accepted_hits.bam doesn't appear to be a valid BAM file, trying SAM...
    Error: cannot open alignment file /Volumes/Data/2013-08-20_Gp_Cell_Cycle_Transcriptome/Olson_Samples_Run2/SampleXp1/accepted_hits.bam for reading

    My accepted_hits.bam file was generated in Tophat v2.0.8. I used Cufflinks v2.1.1 but got an error when I tried to use cuffmerge, so I used a slightly older version of Cuffmerge (v2.0.2) to merge my transcript.gtf files generated by Cufflinks (I can provide this error message later if anyone is interested).

    I have successfully used previous versions of the Tuxedo software to analyze my RNA-seq data (Tophat v2.0.8, Bowtie 2.1.0.0, and Cufflinks v2.0.2), so maybe it is a bug with the new version of Cufflinks?

    I did notice that the version of samtools has changed to 0.1.19.0, where the analyses that I've done with the previous Tuxedo software used samtools 0.1.18.0. Could it also be a samtools issue?


    If anyone else is getting this error message, please help us!!!

    Comment

    • dpryan
      Devon Ryan
      • Jul 2011
      • 3478

      #3
      The first thing to check is if you can "samtools view" the accepted_hits.bam file. If not, it was likely corrupted at some point. gwilymh's problem might be a bug in cuffdiff, it'd be good to make stripped down BAM and annotation files and see if the error persists.

      I should note that the initial "open: No such file or directory" error would suggest that the path for the accepted_hits.bam file is simply being misspecified.
      Last edited by dpryan; 09-12-2013, 03:05 AM. Reason: I should really proof read before posting!

      Comment

      Latest Articles

      Collapse

      • SEQadmin2
        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
        by SEQadmin2


        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
        ...
        Yesterday, 10:05 AM
      • SEQadmin2
        Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
        by SEQadmin2


        With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


        Introduction

        Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
        05-22-2026, 06:42 AM
      • SEQadmin2
        Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
        by SEQadmin2

        Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


        Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
        05-06-2026, 09:04 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, Yesterday, 12:03 PM
      0 responses
      19 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, Yesterday, 11:40 AM
      0 responses
      14 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 05-28-2026, 11:40 AM
      0 responses
      29 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 05-26-2026, 10:12 AM
      0 responses
      31 views
      0 reactions
      Last Post SEQadmin2  
      Working...