Unconfigured Ad

**sindrle** · 04-24-2014, 03:49 AM

I have the same problem, did you find an answer?

**N00bSeq** · 04-24-2014, 06:14 AM

I am also curious about this. Running cuffcompare on my cuffmerge output results in these numbers:

Code:

#     Query mRNAs :  865356 in  787440 loci  (97791 multi-exon transcripts)
#            (16955 multi-transcript loci, ~1.1 transcripts per locus)
# Reference mRNAs :   95598 in   36914 loci  (82214 multi-exon)
# Super-loci w/ reference transcripts:    33985
#--------------------|   Sn   |  Sp   |  fSn |  fSp
        Base level:      99.6     8.5     -       -
        Exon level:     110.6    35.2   100.0    36.2
      Intron level:      99.2    96.9   100.0    98.8
Intron chain level:      80.3    67.5   100.0   100.0
  Transcript level:      74.7     8.3    70.2     7.8
       Locus level:      99.1     4.6    99.6     4.6

     Matching intron chains:   66045
              Matching loci:   36587

          Missed exons:    1293/351192  (  0.4%)
           Novel exons:  755700/1102464 ( 68.5%)
        Missed introns:    1755/243253  (  0.7%)
         Novel introns:    1588/249173  (  0.6%)
           Missed loci:     157/36914   (  0.4%)
            Novel loci:  747780/787440  ( 95.0%)

Reference used was Ensembl mouse from igenomes. The options used for cuffcompare were the following:

Code:

~/cufflinks-2.2.0.Linux_x86_64/cuffcompare -s ~/igenomes/Mus_musculus/Ensembl/NCBIM37/Sequence/Bowtie2Index/genome.fa -r ~/igenomes/Mus_musculus/Ensembl/NCBIM37/Annotation/Genes/genes.gtf -p Ensembl ~/cuffmerge/merged.gtf

In addition, I got the following class codes:

Code:

grep -v "gene_name" Ensembl.combined.gtf | awk '{print $18}' | sort | uniq -c

 739578 "u";
grep "gene_name" Ensembl.combined.gtf | awk '{print $22}' | sort | uniq -c

 555263 "=";
 380367 "j";
   2684 "o";
  12920 "x";

739578 novel transfrags seems a bit much to me.

**dhir_kumar** · 09-10-2014, 04:54 PM

Cuffcompare: Low specificity of transcript assembly

Hi,
Cuffcompare introductory page at http://cufflinks.cbcb.umd.edu/manual.html states the following.

" Cuffcompare produces the following output files:
1) <outprefix>.stats

Cuffcompare reports various statistics related to the "accuracy" of the transcripts in each sample when compared to the reference annotation data. The typical gene finding measures of "sensitivity" and "specificity" (as defined in Burset, M., Guigó, R. : Evaluation of gene structure prediction programs (1996) Genomics, 34 (3), pp. 353-367. doi: 10.1006/geno.1996.0298) are calculated at various levels (nucleotide, exon, intron, transcript, gene) for each input file and reported in this file."

As highlighted in the mentioned 1996 reference's figure 1(Attached) it appears that exons metioned in the annotation GTF would be considered as True prositives and any novel transcript/exon would be considered False positives while calculating sensitivity and specificity by cuffcompare. This explains why we have low specificity measures for whole transcriptome assembly which might have a large number of novel transcripts.

It seems that we can ignore specificity measure for assembly from whole RNA samples. However, to increase specificity FPKM fileters might be effective.

Attached Files

Cuffcumpare_reference.jpg (50.0 KB, 6 views)

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 18 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 21 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Cuffcompare stats: High sensitivity and Low specificity....... what does it mean?

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News