RNA-seq Analysis 101

thinkRNA

Member

Join Date: Jan 2010

Posts: 94
- Share
- Tweet
#1

RNA-seq Analysis 101

02-11-2010, 09:09 AM

I am starting out analyzing unpaired Illumina RNA-seq reads to determine differential expression between two samples (and eventually want to look at alternative splicing).

From all my reading so far, it seems these are the step to follow and there are TONS of programs for each step. There are also integrated softwares out there which will supposedly do all these for you.

The thread here is to find out (1) Are these steps right? (2) Which programs work the best for all of you. I am asking the masters out there, what worked for them and to fill in the gaps.

Steps:
(1) Obtain a data set, preferably a small set your computer and brain can handle. Tons of data available now in NCBI Trace archive.
(2) Align it to the reference genome (I am using Bowtie, I know there are many other software). Tweak the parameters and get maximum reads aligned. Understand what is happening to unmapped reads as well as the mapped reads. Get familiar with output. Ultimately get it in a format you understand. I am using SAM format. This step is critical, I think.
(3) Since I am interested in alternative splicing, I plan on using Tophat (which uses Bowtie) to align reads to known junctions. In the future, I plan on providing my own set of junctions.
(4) Determine differential expression. Which program is best for this and has worked well?? Cufflinks, DegSeq. I understand that this is step where you will perform any normalization and implement complex statistics (Poisson or determine likelihood) to determine if a gene is differentially expressed between two samples. Which program has worked well for you, what are the pros and cons?
(5) Use a visualization tool to look at your data. you can also do this after step (2)

So, does this sound right? What are the challenges in analyzing RNA-seq data, besides choosing from a large number of options. I am more interested in knowing from people who have successfully determined differential expression of genes which step requires caution and is time consuming, where in lies the challenge?
Tags: None
steven

Senior Member

Join Date: Aug 2009

Posts: 269
- Share
- Tweet
#2

02-15-2010, 10:17 AM

Originally posted by thinkRNA View Post

The thread here is to find out (1) Are these steps right?

Looks pretty good!
Some more info maybe in this paper (nature methods).
Comment
crazyhottommy

Senior Member

Join Date: Apr 2012

Posts: 165
- Share
- Tweet
#3

08-28-2013, 07:28 PM

The best way is trying out the whole process by yourself. Mapping by Tophat may be tricky sometimes...and it took me long time even I do it on the computing cluster...
Comment

Previous template Next

Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM
Strategies for Sequencing Challenging Samples

by seqadmin

Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
- Channel: Articles
03-22-2024, 06:39 AM

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 29 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

RNA-seq Analysis 101

Comment

Comment

Latest Articles

ad_right_rmr

News