Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Cufflinks returning elevated FPKM values for 'new' transcripts

    I am running RNA-Seq in a Tuxedo pipeline. In the final Cufflinks output, I am getting massively elevated FPKM values for some transcripts. These inflated values are only returned for the transcripts newly discovered by Cufflinks/TopHat (i.e. transcripts that were not previously annotated). The transcripts from annotated genes seemed normal.

    In one analysis, for example, the known genes had an average FPKM value of 214.5, and a maximum FPKM value of 147,473. The newly discovered transcripts, however, returned a mean FPKM of 139,234.6 and a maximum of 74,769,200; 2.5% of the new transcripts had vales greater than the maximum FPKM detected for the annotated genes. The results for the new transcripts clearly contain artifacts.

    My pipeline is below
    tophat \
    --min-anchor-length 10 \
    --splice-mismatches 1 \
    --min-intron-length 5 \
    --microexon-search \
    --fusion-search \
    --transcriptome-index=/gpfs/group1/f/flyinv/working_index/Dpse3_0_1_exons_ \
    -o "/gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_exons/AR_MSH126_Male" \
    /gpfs/group1/f/flyinv/working_index/Dpse3_0_1 \
    /gpfs/group1/f/flyinv/RNASeq/AR_MSH126_Male_1_sequence.txt \
    /gpfs/group1/f/flyinv/RNASeq/AR_MSH126_Male_2_sequence.txt \

    cufflinks \
    --output-dir "/gpfs/group1/f/flyinv/Outputs_CuffLinks/transcriptomeSequence_exons/AR_MSH126_Male" \
    --GTF-guide /gpfs/group1/f/flyinv/working_index/Dpse3_0_Exons.gff3 \
    --upper-quartile-norm \
    --min-intron-length 5 \
    --quiet \
    /gpfs/group1/f/flyinv/Outputs_TopHat/transcriptomeSequence_exons/AR_MSH126_Male/accepted_hits.bam \


    Does anyone know how or why this occurs? How can it be prevented? How can such artifacts be screened out of downstream analyses?

Latest Articles

Collapse

  • seqadmin
    Non-Coding RNA Research and Technologies
    by seqadmin




    Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

    Nobel Prize for MicroRNA Discovery
    This week,...
    10-07-2024, 08:07 AM
  • seqadmin
    Recent Developments in Metagenomics
    by seqadmin





    Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
    09-23-2024, 06:35 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:55 AM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, 10-02-2024, 04:51 AM
0 responses
108 views
0 likes
Last Post seqadmin  
Started by seqadmin, 10-01-2024, 07:10 AM
0 responses
114 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-30-2024, 08:33 AM
1 response
118 views
0 likes
Last Post EmiTom
by EmiTom
 
Working...
X