Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Cufflinks/cuffdiff and overlapping genes

    Hey,

    I have been using cuffdiff to determine DE genes in S. pombe which has a large number of overlapping gene. I am noticing that all the genes that overlap seem to be merged as one gene in the final testing. Is this a known issue with cuffdiff and is there a way to handle it.

    I'd like to keep using this software package because all the other published DE studies in S. pombe out there seem to use this pipeline. Is this something that's generally ignored in those papers or is there a simple fix i can't seem to figure out?

    Thanks for you help!

  • #2
    Do you have a stranded dataset? If not, then that's probably what's causing the gene merging. The alternative is to just use cufflinks/cuffquant for quantification with a GTF file and not try to modify existing or look for new features.

    Comment


    • #3
      It is stranded data do you think it would make difference if i use a gff3 vs gtf file?

      Comment


      • #4
        It shouldn't matter whether you use a gff3 or gtf file. Make sure you tell cufflinks and cuffdiff about your stranded library, not doing so can substantially degrade the results.

        Comment


        • #5
          yeah it was telling it the correct library type that fixed it thanks!

          Comment


          • #6
            actually that helps with cufflinks. But then cuffdiff moves back to merging them even though i tell it the library type to use. It seems to be just the the genes that overlap on the exact same value like...

            SPAC19D5.06c.1 5220220 5221641
            SPAC19D5.11c.1 5220220 5221383

            any thoughts?

            Comment


            • #7
              Please excuse my ignorance. What is a "stranded dataset"?

              Thank you

              Comment


              • #8
                we use the scriptseq library preparation kit so we have information on which strand the reads come from.

                Comment


                • #9
                  @kswithers: Did you use the right --library-type with cuffdiff as well?

                  Comment


                  • #10
                    yeah same one --library-type fr-secondstand

                    Comment


                    • #11
                      No clue then. Have a look in the merged GTF and see if cuffmerge merged them.

                      Comment


                      • #12
                        yeah something is definitely being lost in the cuffmerge part. The individual transcript.gtf files are fine then after cuffmerge this gene merging phenomena seems to be happening both when i include a reference gtf file and without it.

                        Comment


                        • #13
                          If I remember that right: bacteria -> polycystronic mRNA.
                          It's just assumed that overlapping (or very close) genes are on one mRNA, therefore they're treated as one locus.

                          If you take a look at your GTF file, you'll see in the last column all the identifiers, not only the "transcript locus", but also the old IDs (oID, I think).
                          You can tell cuffdiff in one of the options (don't ask me, I'm not at my work computer) to use these alternative IDs to do the DE.

                          Comment


                          • #14
                            That would be awesome and an easy fix! Any idea what that option may be i can't seem to find anything in the man or manual pages

                            Comment


                            • #15
                              I've just checked now, and I can't find it o_O.
                              Maybe I did some conversions with another program, but then also no clue what I did at that time point (the merged orfs are fine for me, so I didn't bother).

                              I'd try to simple replace the oID entry with geneID, and the other way round. Maybe that's already enough.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Essential Discoveries and Tools in Epitranscriptomics
                                by seqadmin




                                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                                04-22-2024, 07:01 AM
                              • seqadmin
                                Current Approaches to Protein Sequencing
                                by seqadmin


                                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                                04-04-2024, 04:25 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Today, 11:49 AM
                              0 responses
                              13 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, Yesterday, 08:47 AM
                              0 responses
                              16 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-11-2024, 12:08 PM
                              0 responses
                              61 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 04-10-2024, 10:19 PM
                              0 responses
                              60 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X