Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Baoqing
    Member
    • Jan 2013
    • 24

    cuffmerge error

    Hi, All

    I was trying to run the cuffmerge for my two samples. However, there is warning message point to the mitochondrial genome, however, i could not figure out the exact problem, could you please give some hints? I directly download the Saccharomyces cerevisiae/Ensembl/EF4, ref genome.

    Thank you very much!

    bq@bq-VirtualBox:~/Desktop/rnaseq/trimmed$ cuffmerge -g genes.gtf -s genome.fa -p 4 assemblies.txt

    [Wed Jun 5 22:46:37 2013] Beginning transcriptome assembly merge
    -------------------------------------------

    [Wed Jun 5 22:46:37 2013] Preparing output location ./merged_asm/
    [Wed Jun 5 22:46:37 2013] Converting GTF files to SAM
    [22:46:37] Loading reference annotation.
    [22:46:37] Loading reference annotation.
    [Wed Jun 5 22:46:37 2013] Quantitating transcripts
    You are using Cufflinks v2.1.1, which is the most recent release.
    Command line:
    cufflinks -o ./merged_asm/ -F 0.05 -g genes.gtf -q --overhang-tolerance 200 --library-type=transfrags -A 0.0 --min-frags-per-transfrag 0 --no-5-extend -p 4 ./merged_asm/tmp/mergeSam_fileDalcVi
    [bam_header_read] EOF marker is absent.
    [bam_header_read] invalid BAM binary header (this is not a BAM file).
    File ./merged_asm/tmp/mergeSam_fileDalcVi doesn't appear to be a valid BAM file, trying SAM...
    [22:46:37] Loading reference annotation.
    [22:46:38] Inspecting reads and determining fragment length distribution.
    Processed 2290 loci.
    > Map Properties:
    > Normalized Map Mass: 6966.00
    > Raw Map Mass: 6966.00
    > Fragment Length Distribution: Truncated Gaussian (default)
    > Default Mean: 200
    > Default Std Dev: 80
    [22:46:38] Assembling transcripts and estimating abundances.
    Processed 2290 loci.
    [Wed Jun 5 22:46:43 2013] Comparing against reference file genes.gtf
    You are using Cufflinks v2.1.1, which is the most recent release.
    Warning: couldn't find fasta record for 'Mito'!
    [Wed Jun 5 22:46:44 2013] Comparing against reference file genes.gtf
    You are using Cufflinks v2.1.1, which is the most recent release.
    Warning: couldn't find fasta record for 'Mito'!
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    The fasta sequence for "Mito" is missing from the genome.fa file.

    Comment

    • Baoqing
      Member
      • Jan 2013
      • 24

      #3
      Hi, GenoMax

      I should have mentioned that i did some initial troubleshooting, you are right that "Mito" is missing in the genome.fa file. It was named as "MT" instead. So I renamed this file, and reran it, the same error occurred again. Maybe something else has to be done besides renaming, or it is a totally different thing?

      Best,

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        So the chromosome names in your genes.gtf are not matching what you had in the original reference file (your alignments inherited those names).

        Compare these two commands:
        Code:
        awk -F "\t" '{print $1}' genes.gtf | uniq
        and

        Code:
        cat genome.fa | grep ">"

        If that is true then you could rename the "Mito" from the genes.gtf to "MT" and then try a re-run.
        Last edited by GenoMax; 06-06-2013, 08:53 AM.

        Comment

        • Baoqing
          Member
          • Jan 2013
          • 24

          #5
          Thank you so much! The error does go away when I rename the "mito" to "MT". It just never came to me that the two files' name are inconsistent. Really appreciate your help!

          Baoqing

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          36 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          99 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          120 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-04-2026, 08:59 AM
          0 responses
          113 views
          0 reactions
          Last Post SEQadmin2  
          Working...