Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • lollysticky
    Junior Member
    • Aug 2010
    • 4

    RNAseq transcriptome analysis: workflow problem

    Hi all,


    I will be performing RNAseq transcriptome analysis on a certain organism under a specified number of conditions (let's say 10). The goal is to construct a tab-delimited file which contains the expression values (the raw read counts, not RPKM/FPKM values) for each gene under all conditions.
    I am the first one in the lab to perform such an experiment and we don't have a standard workflow developed. Therefore I would ask you, the community, to review what I sketched so far and respond to my questions if possible!

    my design so far:
    1) generate RNAseq data
    2) preprocessing the data: FASTX-toolkit (quality check, trimming, clipping, filtering)
    3) aligning the reads -> TopHat (SAMformat output)
    4) Iterate 1-3 for each condition...
    5) construct file
    6) further analysis


    Questions:
    a) what to do with isoforms? Do I take them into consideration (using Cufflinks or so), or not? My organism has very few introns, and I expect to see little isoform transcripts. Nevertheless, any isoform information is valuable.
    b) how to tackle multireads? As far as I understand it, TopHat does not carry out some multi-read re-distribution like ERANGE does...
    c) how to proceed to raw read counts? TopHat reports RPKM values, but I need raw read counts -> I could use some sort of comparison script which uses my annotation files to construct a read count for each gene? (BEDtools can do this I think).

    If anybody has a better suggestion for a workflow and/or possible answers to my questions, please post them here


    thanks!
  • john_mu
    Member
    • May 2010
    • 88

    #2
    SpliceMap does separate multi-reads, you could check the paper to see if it suits your purpose. Although the paper does describe an older version, it gives the main idea.

    The link is in my signature.
    SpliceMap: De novo detection of splice junctions from RNA-seq
    Download SpliceMap Comment here

    Comment

    • lollysticky
      Junior Member
      • Aug 2010
      • 4

      #3
      We have used TopHat in the lab before and would like to continue using it
      that's why I'm searching for a solution

      Comment

      • frozenlyse
        Senior Member
        • Sep 2008
        • 135

        #4
        One thing to note - TopHat will only work with sequences of all the same length, so trimming adaptors is a bit of a *****

        Comment

        • jameslz
          Member
          • Nov 2009
          • 20

          #5
          Originally posted by frozenlyse View Post
          One thing to note - TopHat will only work with sequences of all the same length, so trimming adaptors is a bit of a *****

          I met the same situation, but it sees to be better if trimming the low quality base(such as 'B') from the 3' end.

          Comment

          Latest Articles

          Collapse

          • GATTACAT
            Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by GATTACAT
            Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
            07-01-2026, 11:43 AM
          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Yesterday, 11:08 AM
          0 responses
          6 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-30-2026, 05:37 AM
          0 responses
          11 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-26-2026, 11:10 AM
          0 responses
          19 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          53 views
          0 reactions
          Last Post SEQadmin2  
          Working...