Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nwalwort
    Junior Member
    • Feb 2012
    • 4

    Cufflinks reporting differnt FPKMS for the same gene

    Hello,

    I am analyzing some bacterial RNA-seq data with cufflinks. Since in bacterial RNA-seq, splicing isn't an issue, I am using a mapping program that does not take splicing into account. I just wanted to get FPKM's for my genes and do some differential expression even though I am aware that cufflinks was created for euk RNA-seq. In the FAQ's, it states that cufflinks will work with bacterial RNA-seq given that I map with a fasta file of already annotated genes. I know cufflinks assembles transcripts, but when I feed it my sam file (generated from mapping program perM by mapping reads to a mulitfasta file of genes in the genome), cufflinks returns multiple locations of one gene in separate rows with all of their own FPKM's. I wanted just an FPKM for each gene. Does anyone have any way to resolve this issue? Cheers.
  • Nicolas
    Member
    • Apr 2009
    • 41

    #2
    Could it be multiple isoforms of the same gene?

    Comment

    • nwalwort
      Junior Member
      • Feb 2012
      • 4

      #3
      Hey Nicolas,

      thats what I was thinking it might be. However, I looked at the read pileups with IGV, and cufflinks is just assembling different transcripts de novo of the same gene based on clustering reads. So, for example, looking at one gene that I am mapping reads too, instead of calculating the FPKM for all reads hitting that gene, cufflinks is splitting the gene up into thirds based on where reads are piling up and calculating three different FPKMs for each region of the gene and then reporting it as different "genes" (transcripts). So, rather than different isoforms, it looks like it is just splitting up genes based on where reads fall. I am also using a mapping program unaware of splicing. I am trying my luck with a few other programs to compare.

      Comment

      • Nicolas
        Member
        • Apr 2009
        • 41

        #4
        Could you post the command you're using?
        Which Cufflinks mode are you using, de novo (default), with a reference annotation (-G) or RABT (-g)?
        Is there a complete coverage of your gene? If not (and if you're using de novo mode), then Cufflinks has no information supporting the fact that the 3 regions are actually one single gene...

        Please provide more info.

        Comment

        • nwalwort
          Junior Member
          • Feb 2012
          • 4

          #5
          Hello Nicolas, my command is below:

          cufflinks -N -u seq1_380-380_r1_out.sorted.sam

          It is in default mode i think. I aligned my reads to a multifasta file with annotated genes in hopes that it would be sufficient for cufflinks to assign reads to only these genes but I was wrong, and cufflinks assembled transcripts because no GTF was supplied. I was trying to search for anything on how to obtain or generate a reference GTF file for my bacterium, but I cannot seem to find it. Surely, that would probably fix my problem. do you know how I might generate one with an annotated reference genome in fasta format. Thank you for your inquiries! I am still quite new to this

          Comment

          • Nicolas
            Member
            • Apr 2009
            • 41

            #6
            Does your multifasta file contains one entry per gene?
            If so, it should be easy to count the number of reads mapping to each entry (samtools idxstats <aln.bam> for instance). You can then normalize by exon size and library depth to achieve something similar to FPKM.
            I don't think Cufflinks could do what you want, but I am also not sure you really need it!

            Comment

            • nwalwort
              Junior Member
              • Feb 2012
              • 4

              #7
              Thank you for the replies Nicolas. Much appreciated. good luck with everything

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Yesterday, 11:10 AM
              0 responses
              7 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              42 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              103 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              125 views
              0 reactions
              Last Post SEQadmin2  
              Working...