Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Multiple genes, one fpkm value

    Hi everyone,

    My apologies if this question has been asked before, but my searches on the forum came up with nothing.

    I got sequencing results back from mouse RNA (75 bp PE, 50 mln reads) and then ran it through a Tophat-Cufflinks-Cuffmerge-Cuffdiff pipeline. This results in a list with differentially transcribed genes. So far, so good.

    However, some genes seem to have been 'combined' somewhere along the pipeline, that it is there are multiple gene symbols on one line but only one fpkm value, chromosomal location, etc. Below are two examples:

    XLOC_002706 XLOC_002706 Neurod4,Vmn2r84,Vmn2r85,Vmn2r86,Vmn2r87 chr10:130268058-130542669 SC4 SC2 OK 889.112 524.056 -0.762645 -256.757 5,00E-05 0.00061898 yes

    XLOC_003158 XLOC_003158 2410006H16Rik,Snord49a,Snord49b chr11:62601222-62670908 SC4 SC2 OK 327.662 373.866 0.19031 0.453688 0.3605 0.742145 no

    As you can see, this can be found both when there is or is no significant change.

    Has anyone had this problem before? If so, what is the best solution? Should the reads be trimmed to avoid them overlapping multiple genes and if so, how much trimming is recommended?

    Many thanks for your input.

  • #2
    These are indeed most probably overlapping genes for which the read counter can't determine to which feature these reads belong... There is likely no perfect solution.

    Comment


    • #3
      ^As mentioned above. They are overlapping genes, I would suggest taking a look at them through NCBI IGV to get a better idea on how things are.

      Also check to see that it might be Isoforms if you have are trying to find novel transcripts of the genes.

      There are some programs like MISO and NCBI IUTA that might help with Isoform detection.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Quality Control Essentials for Next-Generation Sequencing Workflows
        by seqadmin




        Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

        Nucleic Acid Quality Control
        Preparing for NGS starts with isolating the...
        02-10-2025, 01:58 PM
      • seqadmin
        An Introduction to the Technologies Transforming Precision Medicine
        by seqadmin


        In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
        01-27-2025, 07:46 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 02-07-2025, 09:30 AM
      0 responses
      71 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-05-2025, 10:34 AM
      0 responses
      112 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 02-03-2025, 09:07 AM
      0 responses
      86 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 01-31-2025, 08:31 AM
      0 responses
      48 views
      0 likes
      Last Post seqadmin  
      Working...
      X