Seqanswers Leaderboard Ad

**honey** · 09-29-2010, 06:11 AM

I think it is right place and is very useful

**natstreet** · 10-26-2010, 01:03 AM

The guide is really useful thanks. In it you use the data from Li et al 2008 as an example dataset. Can you point me to where I could download the fasta files you detail?

**poisson200** · 10-26-2010, 02:41 AM

Dear Matt,
It is a very good place for a document like this. Someone asked me to detail how to perform RNA-seq gene diff-ex analyses on short read data; this document is an excellent example. I think it will really help a lot of people and save a lot of time (I would have done a couple of things differently but that is just personal experience/preference).

Thank you for the contribution.

Actually, a Next Generation Sequencing wiki, if it does not exist already, is a great idea.

John.

**hanifk** · 10-27-2010, 11:35 PM

I have spent a lot time to find such a tutorial
but it seems that very little material is availble
thanks for your help

**huyvuong** · 11-11-2010, 06:30 PM

Hi Matt,
Thank very much for sharing your guide. Would you please let me know the link to download the Li Prostate cancer dataset you mentioned in the guide, i.e the 7 fa files? I couldn't find them in the publication's supporting information. Thanks

**diya** · 11-15-2010, 09:24 AM

Very useful document for beginners in deep-sequencing

Hi Matt,

I have been searching so much for such kind of tutorial. The tutorial is very helpful.

Thanks,

Diya

**MDY** · 11-15-2010, 07:50 PM

Hi everyone,

Sorry for the slow reply, I somehow managed to miss the replies. For those asking where to get the seven fasta files used in this guide, they are using the data used in the referenced paper, Li et al 2008 ( http://www.ncbi.nlm.nih.gov/sites/en...,f1000m,isrctn ). As far as I know, the files aren't stored on GEO, but the authors were happy to send the data when contacted by email. The 7 files are 3 treated and 4 untreated lanes of RNA-seq.

Cheers,

Matt

**flyyuan** · 12-03-2010, 06:03 AM

Thanks Matt for this nice guide, now, I am tring to analysis some soybean rna-seq data following this article. However, I am very new to this work, could anybody give me some suggestions to solve following problems:

1. I try to use makeTranscriptDbFromBiomart to get the information of soybean in phytozome database, but it seems there many organisms in phytozome database, how can I select the G.max which I need?

2.bowtie software map the RNA-seq tag to reference gene, what is the criterion for match or does not match.

thanks in advance!

**nancyelatimer** · 12-07-2010, 07:33 AM

Matt - Awesome super-polished resource for those with or without experience in NGS or RNA-seq! Please feel free to share any other resources you have created. Thank you.

**Optimistix** · 12-09-2010, 07:39 PM

Thanks a lot for the nice guide and sharing it with all of us, Matt!

**MDY** · 12-13-2010, 06:54 PM

flyyuan - I'm not sure what the answer to your first question about biomart. A detailed description of how bowtie decides on a valid match can be found on the bowtie webpage and in particular the manual. You might want to look at this http://bowtie-bio.sourceforge.net/ma...alignment-mode

In brief, in the default mode bowtie will report a read as matching if it has fewer than -n mismatches from the reference in the seed and the sum of the quality scores at ANY mismatching base within the entire read is less than -e.

**KevinLam** · 12-18-2010, 02:04 AM

Excellent. This should be a sticky!

**colindaven** · 01-25-2011, 04:07 AM

Excellent introductory guide, thank you!

**Azazel** · 01-26-2011, 08:47 PM

Hi Matt,

thanks for putting up this excellent tutorial.

I have one constructive critisism or discussion point though; as I understand it, when checking for differential expression (DE) you only consider reads "overlapping some annotation object, which is usually something like a collection of genes downloaded from the UCSC."

So you suggest checking DE only for something like RefSeq, and taking the number of reads within each RefSeq (or other object) as the expression level.

I think this discards not only much of the information gained by RNA-seq, but also some of the most important information: the most interesting genes are often among the non-annotated genes. Consider for example two cellular states, a very interesting gene might only be expressed in the very unusual state B, and be very highly expressed; while in state A it's not or so lowly expressed that it didn't make it into the annotation. So with this approach a researcher would miss this gene and others like it entirely because it's not in the annotation, although these might be the very genes which explain the biological question at hand.

If I'd use RNA-seq just to identify DE genes which are already annotated in UCSC, I almost might as well have used a tiling array spanning the annotated genes only. (sure RNA-seq is "digital", but the point I'm trying to make is that with UCSC or similar annotation one would ignore 90%+ of the RNA-seq data elsewhere in the genome!)

So I think a better approach would be first to use the RNA-seq data to produce an ad hoc annotation, including information from all sequenced conditions, then check DE against this annotation.

Now the question is of course, what is a very good way to create an annotation, i.e. how to identify the regions spanned by genes, from RNA-seq?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 47 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Guide/tutorial for the analysis of RNA-seq data

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News