Seqanswers Leaderboard Ad

**Jim Robinson** · 11-01-2015, 12:14 AM

Hi, what protocol was used to prepare the library?

**weirdkid** · 11-01-2015, 12:32 AM

Hi Jim,

do you mean the library for the reference genome? If so this is what I used:

bowtie2-build /path/to/421K17.fa /path/to/421K17/421K17

thanks

**Jim Robinson** · 11-01-2015, 12:48 AM

No, I mean the library prep for the RNA, e.g. dUTP.

**weirdkid** · 11-01-2015, 01:17 AM

In that case I'm not sure. I got this data from another lab and I don't think they passed that information on to me.

**Jim Robinson** · 11-01-2015, 08:04 AM

I ask because it doesn't appear to be strand preserving. I would verify that with the lab you received it from.

**weirdkid** · 11-01-2015, 02:13 PM

I'll check this with the lab and come back to you on that then. But what makes you think that it is not strand preserving?

**Jim Robinson** · 11-01-2015, 02:56 PM

The reads look more or less evenly distributed between strands, when colored by first-in-pair strand. Also, the junctions look more or less even between strands. It just has the look of an un-stranded library, if it is no amount of informatics will recover the strand information. I thought that was the gist of your original post, sorry if I misunderstood.

**weirdkid** · 11-01-2015, 04:27 PM

So I asked and from what I was told the samples were sequenced with Illumina's TrueSeq RNA kit (total RNA) and was strand specific.

**Michael.Ante** · 11-02-2015, 01:27 AM

What is the difference between the two subsets in your IGV example?
In terms of strand-preservation Illumina's TruSeq stranded achieves 90 % - 97 %; so if you see in the correct strand-orientation 950 reads and in the wrong orientation 50 your result is acceptable.
For a sample-wide anaylsis, you can use RSeQC's infer_experiment function on the complete data set (accepted_hits.sort).

**weirdkid** · 11-02-2015, 10:03 PM

I've tried using RseQC's infer experiment script, but I'm not sure what it means when it asks for the reference genome in bed format. My reference genome is in fasta format, and converting from fasta to bed doesn't make sense to me...

**Michael.Ante** · 11-03-2015, 12:17 AM

RSeQc needs gene/transcript information. Thus, you need to convert your gtf/gff (Fgenesh_EST_aligned.gff) into a 12-column bed-file. Either you script it yourself or use for instance a tool like this.

**weirdkid** · 11-03-2015, 10:44 PM

Finally managed to do it. I couldn't quite get the gff/gtf to convert to bed so I just did it manually.
Anyway, here is what I got:

This is PairEnd Data
Fraction of reads failed to determine: 0.0000
Fraction of reads explained by "1++,1--,2+-,2-+": 0.1667
Fraction of reads explained by "1+-,1-+,2++,2--": 0.8333

Therefore the data is strand specific paired end data. But I'm not quite sure how to interpret the differences between the two. If the majority of the reads are explained by ""1+-,1-+,2++,2--": 0.8333", does this mean that I should have analyzed the data in tophat with fr-secondstrand instead of fr-firststrand??

**Michael.Ante** · 11-04-2015, 01:46 AM

According to this post, you should use fr-firststrand. When I analysed TruSeq stranded libraries, I yielded the same orientation, but higher percentages. For non-model organism it might be different due to incomplete annotation.

**weirdkid** · 11-04-2015, 04:20 PM

Hmmm, well I used fr-firststrand so I guess that can't be it.

I'll just keep trying a few different things and see where it gets me.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Help with RNA-seq and strand specific mapping

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News