Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • qiudao
    Member
    • May 2008
    • 23

    cufflinks questions

    Dear Friends,

    I am new to RNA-Seq. I have tried to use cufflinks' cuffdiff to compare two aligned data to get differentially expressed transcripts/exons ...

    However if I used:
    cuffdiff -o cuffdiff Homo_sapiens.GRCh37.62.gtf ./read1/accepted_hits.bam ./read2/accepted_hits.bam

    in the "gene_exp.diff", all the "ln(fold change)" is 0.

    If I used assembled "merged.gtf" instead of the above "Homo_sapiens.GRch37.62.gtf", I can get ln(fold change) and p values, everything looks fine, but I can not get annotate gene symbols for each transcript.

    Could someone let me know what's wrong I was doing for the first analysis?
    And
    How could I also obtained the genes annotation for second anlaysis using "merged.gtf"?

    It seems something is wrong. Did I used right gene annotation gtf file? (see below)


    Another question is: what is the human annotation.gtf generally used for cuffdiff? I downloaded it from ensemble (see above), is this the right one?. does it have an equivalent from ucsc genome browser? I had expected cufflinks manual would have specially pointed out where to download this for an example, unfortunately, it has no such info. Maybe in the future it can be added.
    Thanks.




    Thank you very much.



    for your information, I also did cuffcompare between annotation I used an merged.gtf:
    #==========================================
    # Cuffcompare v1.0.3 | Command line was:
    #cuffcompare -o ./cuffcompare -s ./hg18.fa -r ./Homo_sapiens.GRCh37.62.gtf ./merged.gtf
    #

    #= Summary for dataset: ./merge/merged.gtf :
    # Query mRNAs : 65079 in 61573 loci (27950 multi-exon transcripts)
    # (2938 multi-transcript loci, ~1.1 transcripts per locus)
    # Reference mRNAs : 166159 in 50490 loci (145130 multi-exon)
    # Corresponding super-loci: 0
    #--------------------| Sn | Sp | fSn | fSp
    Base level: 0.0 0.0 - -
    Exon level: 0.0 0.0 0.0 0.0
    Intron level: 0.0 0.0 0.0 0.0
    Intron chain level: 0.0 0.0 0.0 0.0
    Transcript level: 0.0 0.0 0.0 0.0
    Locus level: 0.0 0.0 0.0 0.0
    Missed exons: 509677/509677 (100.0%)
    Wrong exons: 126588/126588 (100.0%)
    Missed introns: 330351/330351 (100.0%)
    Wrong introns: 62961/62961 (100.0%)
    Missed loci: 0/50490 ( 0.0%)
    Wrong loci: 36503/61573 ( 59.3%)

    Total union super-loci across all input datasets: 61573
    #=================================================
    Last edited by qiudao; 06-05-2011, 09:47 PM. Reason: add another question
  • Morten
    Junior Member
    • Jun 2010
    • 9

    #2
    Hi.
    I don't know if you managed to solve your problem.
    Anyway, it turned out I had the same problem and your post helped me correct it.
    I was using the same annotation file af you did at first and also got all "ln(fold change)" 0 in "gene_exp.diff".

    I downloaded the UCSC annotation file here:


    I couldn't find it on the UCSC website. (It must be there somehere, though...)
    The annotation file I am linking to worked well for me.

    -Morten

    Comment

    Latest Articles

    Collapse

    • GATTACAT
      Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by GATTACAT
      Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
      07-01-2026, 11:43 AM
    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 07-02-2026, 11:08 AM
    0 responses
    11 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-30-2026, 05:37 AM
    0 responses
    13 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-26-2026, 11:10 AM
    0 responses
    20 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    54 views
    0 reactions
    Last Post SEQadmin2  
    Working...