Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Cufflinks low FPKMs and other wonders

    Hi all

    I am very new to RNAseq, and while I know perl and some R, I am not exactly a computer wizard.... so please bare with me - its probably something stupid
    We have paired end RNAseq data generated from a mouse tissue on Illumina Hiseq 2000, 50 bp, ~180M reads for each of the 4 conditions (both ends).
    We want to do several things, the first one is to identify and quantify expressed isoforms (preferably finding new ones as well), and call differential expression of genes between the conditions. Because of some size/memory constrains we run each lane using several files and merge the cufflinks assembly at the end (is that ok???)

    These are the commands we used:
    tophat command:
    Code:
     tophat-1.3.0.Linux_x86_64/tophat -r 50
    .../data/all
    .../read1_X
    ..../read2_0
    cufflink command:
    Code:
      cufflinks-1.0.3.Linux_x86_64/cufflinks -g ..../mm9_refGene
    ..../accepted_hits.bam
    cuffmerge command:
    Code:
    cuffmerge -s ..../all.fa -g ..../mm9_refGene assemblies.txt
    1. After running Tophat+Cufflinks we get very low FPKM values (from 4.96066e-324.... to ~32), with FPKM_lo and FPKM_hi being 0 for all - this makes no sense to me, but may be I am absolutely wrong....? Can that happen if the tophat insert size is not accurate? (I am asking because we first run the Tophat with r -200, which was too large, and all insert sizes in sam files were 0, we the rerun using a new version of Tophat (1.3.0) with -r 50 (which is smaller than true) and used insert size column to estimate the parameter (which seems to be ~120) - this is being processed).

    2. Is there a simple way to get summary data for how many known genes are expressed, and how many known and new isoforms of these genes were identified? Are there novel transcripts (not from known genes) and how many? Is there confidence criteria for these expression values?

    3. Also, at first we did something much more simple minded - we used a different aligner (Mr/Mrs Fasta) to map the reads to the mouse genome - without using the pairing info
    and calculated RPKM values. These have absolutely no relation to the FPKM values from Cufflinks (which we suspect are not right anyway).....

    Thanks in advance for the help
    Yehudit
    Yu

Latest Articles

Collapse

  • seqadmin
    Quality Control Essentials for Next-Generation Sequencing Workflows
    by seqadmin




    Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

    Nucleic Acid Quality Control
    Preparing for NGS starts with isolating the...
    02-10-2025, 01:58 PM
  • seqadmin
    An Introduction to the Technologies Transforming Precision Medicine
    by seqadmin


    In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
    01-27-2025, 07:46 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 02-07-2025, 09:30 AM
0 responses
72 views
0 likes
Last Post seqadmin  
Started by seqadmin, 02-05-2025, 10:34 AM
0 responses
113 views
0 likes
Last Post seqadmin  
Started by seqadmin, 02-03-2025, 09:07 AM
0 responses
90 views
0 likes
Last Post seqadmin  
Started by seqadmin, 01-31-2025, 08:31 AM
0 responses
49 views
0 likes
Last Post seqadmin  
Working...
X