Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • lollysticky
    Junior Member
    • Aug 2010
    • 4

    RNAseq transcriptome analysis: workflow problem

    Hi all,


    I will be performing RNAseq transcriptome analysis on a certain organism under a specified number of conditions (let's say 10). The goal is to construct a tab-delimited file which contains the expression values (the raw read counts, not RPKM/FPKM values) for each gene under all conditions.
    I am the first one in the lab to perform such an experiment and we don't have a standard workflow developed. Therefore I would ask you, the community, to review what I sketched so far and respond to my questions if possible!

    my design so far:
    1) generate RNAseq data
    2) preprocessing the data: FASTX-toolkit (quality check, trimming, clipping, filtering)
    3) aligning the reads -> TopHat (SAMformat output)
    4) Iterate 1-3 for each condition...
    5) construct file
    6) further analysis


    Questions:
    a) what to do with isoforms? Do I take them into consideration (using Cufflinks or so), or not? My organism has very few introns, and I expect to see little isoform transcripts. Nevertheless, any isoform information is valuable.
    b) how to tackle multireads? As far as I understand it, TopHat does not carry out some multi-read re-distribution like ERANGE does...
    c) how to proceed to raw read counts? TopHat reports RPKM values, but I need raw read counts -> I could use some sort of comparison script which uses my annotation files to construct a read count for each gene? (BEDtools can do this I think).

    If anybody has a better suggestion for a workflow and/or possible answers to my questions, please post them here


    thanks!
  • john_mu
    Member
    • May 2010
    • 88

    #2
    SpliceMap does separate multi-reads, you could check the paper to see if it suits your purpose. Although the paper does describe an older version, it gives the main idea.

    The link is in my signature.
    SpliceMap: De novo detection of splice junctions from RNA-seq
    Download SpliceMap Comment here

    Comment

    • lollysticky
      Junior Member
      • Aug 2010
      • 4

      #3
      We have used TopHat in the lab before and would like to continue using it
      that's why I'm searching for a solution

      Comment

      • frozenlyse
        Senior Member
        • Sep 2008
        • 135

        #4
        One thing to note - TopHat will only work with sequences of all the same length, so trimming adaptors is a bit of a *****

        Comment

        • jameslz
          Member
          • Nov 2009
          • 20

          #5
          Originally posted by frozenlyse View Post
          One thing to note - TopHat will only work with sequences of all the same length, so trimming adaptors is a bit of a *****

          I met the same situation, but it sees to be better if trimming the low quality base(such as 'B') from the 3' end.

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM
          • SEQadmin2
            Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
            by SEQadmin2


            With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


            Introduction

            Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
            05-22-2026, 06:42 AM
          • SEQadmin2
            Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
            by SEQadmin2

            Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


            Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
            05-06-2026, 09:04 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Yesterday, 08:59 AM
          0 responses
          12 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 12:03 PM
          0 responses
          21 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 11:40 AM
          0 responses
          18 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 05-28-2026, 11:40 AM
          0 responses
          31 views
          0 reactions
          Last Post SEQadmin2  
          Working...