Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cufflinks questions

    Dear Friends,

    I am new to RNA-Seq. I have tried to use cufflinks' cuffdiff to compare two aligned data to get differentially expressed transcripts/exons ...

    However if I used:
    cuffdiff -o cuffdiff Homo_sapiens.GRCh37.62.gtf ./read1/accepted_hits.bam ./read2/accepted_hits.bam

    in the "gene_exp.diff", all the "ln(fold change)" is 0.

    If I used assembled "merged.gtf" instead of the above "Homo_sapiens.GRch37.62.gtf", I can get ln(fold change) and p values, everything looks fine, but I can not get annotate gene symbols for each transcript.

    Could someone let me know what's wrong I was doing for the first analysis?
    And
    How could I also obtained the genes annotation for second anlaysis using "merged.gtf"?

    It seems something is wrong. Did I used right gene annotation gtf file? (see below)


    Another question is: what is the human annotation.gtf generally used for cuffdiff? I downloaded it from ensemble (see above), is this the right one?. does it have an equivalent from ucsc genome browser? I had expected cufflinks manual would have specially pointed out where to download this for an example, unfortunately, it has no such info. Maybe in the future it can be added.
    Thanks.




    Thank you very much.



    for your information, I also did cuffcompare between annotation I used an merged.gtf:
    #==========================================
    # Cuffcompare v1.0.3 | Command line was:
    #cuffcompare -o ./cuffcompare -s ./hg18.fa -r ./Homo_sapiens.GRCh37.62.gtf ./merged.gtf
    #

    #= Summary for dataset: ./merge/merged.gtf :
    # Query mRNAs : 65079 in 61573 loci (27950 multi-exon transcripts)
    # (2938 multi-transcript loci, ~1.1 transcripts per locus)
    # Reference mRNAs : 166159 in 50490 loci (145130 multi-exon)
    # Corresponding super-loci: 0
    #--------------------| Sn | Sp | fSn | fSp
    Base level: 0.0 0.0 - -
    Exon level: 0.0 0.0 0.0 0.0
    Intron level: 0.0 0.0 0.0 0.0
    Intron chain level: 0.0 0.0 0.0 0.0
    Transcript level: 0.0 0.0 0.0 0.0
    Locus level: 0.0 0.0 0.0 0.0
    Missed exons: 509677/509677 (100.0%)
    Wrong exons: 126588/126588 (100.0%)
    Missed introns: 330351/330351 (100.0%)
    Wrong introns: 62961/62961 (100.0%)
    Missed loci: 0/50490 ( 0.0%)
    Wrong loci: 36503/61573 ( 59.3%)

    Total union super-loci across all input datasets: 61573
    #=================================================
    Last edited by qiudao; 06-05-2011, 09:47 PM. Reason: add another question

  • #2
    Hi.
    I don't know if you managed to solve your problem.
    Anyway, it turned out I had the same problem and your post helped me correct it.
    I was using the same annotation file af you did at first and also got all "ln(fold change)" 0 in "gene_exp.diff".

    I downloaded the UCSC annotation file here:


    I couldn't find it on the UCSC website. (It must be there somehere, though...)
    The annotation file I am linking to worked well for me.

    -Morten

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Understanding Genetic Influence on Infectious Disease
      by seqadmin




      During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

      Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
      09-09-2024, 10:59 AM
    • seqadmin
      Addressing Off-Target Effects in CRISPR Technologies
      by seqadmin






      The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
      08-27-2024, 04:44 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 02:44 PM
    0 responses
    8 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-06-2024, 08:02 AM
    0 responses
    143 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-03-2024, 08:30 AM
    0 responses
    151 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 08-27-2024, 04:40 AM
    0 responses
    158 views
    0 likes
    Last Post seqadmin  
    Working...
    X