Unconfigured Ad

**EricHaugen** · 07-15-2015, 01:32 PM

Better to determine this setting empirically:
Run TopHat+Cufflinks pipeline separately with either firststrand or secondstrand options.
Then assuming your annotation file matches your library somewhat,
the version with much larger alignment and FPKM numbers will be the correct option for your library prep method.

**Mchicken** · 07-15-2015, 09:27 PM

First of all thanks for your advice. I already read this way of library determination somewhere.
Nevertheless there should be a logical explanation anyway. The company, which sequenced our samples told us yesterday, that indeed the R1 read corresponds to the sense/coding strand, like I observed when I mapped my paired-end data with TopHat2 (using library-type unstranded).

**Brian Bushnell** · 07-15-2015, 10:17 PM

Honestly... when I first wrote the code to handle firststrand/secondstrand, it took me a week of going back and forth and talking to different people who make libraries because the description in the Tuxedo package is so incredibly confusing. They should be named clearly, as in:

READ1-PLUS protocol and READ1-MINUS protocol, or READ1-SENSE, or something like that.

Every time I am asked questions about this I have to go back to the comments in my source code because the names are so vague and the official descriptions so opaque as to be meaningless.

**HESmith** · 07-16-2015, 03:53 AM

Brian's right; the terminology is confusing.

Regarding your original questions, the orientation of the gene on the DNA (Watson or Crick strand) is irrelevant. The quoted statement ["the leftmost end of the fragment (in transcript coordinates) is the first sequenced"] indicates that read1 proceeds in the 5'->3' orientation of the mRNA.

As for your second question, strandedness (for TopHat) refers to the sequence being generated. In diagram 5a, the first cDNA strand is the template, which means that the sequence is identical to the second cDNA strand.

**Mchicken** · 07-16-2015, 05:02 AM

Okay now to summarize:

In my case, indeed the library-type is fr-secondstrand as the R1 (forward) read maps in 5' to 3' direction of the mRNA.

And the reason to call it fr-secondstrand is that the first cDNA strand only served as template for the generation of the R1 read, which is identical to the "second strand" (leading to the name fr-secondstrand).

Up to now I used fr-unstranded as library type parameter, which also gave me good results. But I think in future I will be using the correct library type and hope that this will improve my result further.

Thank your very much guys, this issue has been a mystery for a long time for me and now I finally get it

**kenietz** · 08-17-2015, 07:04 PM

Hi guys,
i apologize for reviving the thread but i am also a bit confused about the stranded RNA-seq.
I have some Illumina PE data which is stranded but i dont know how the library was generated. I received bam files aligned with TopHat. So i used RSeQC's 'infer_experiment.py' command to tell me how the libraries are stranded.
So for one of them i get: 1++,1–,2+-,2-+ and for the other 1+-,1-+,2++,2–. Now my problem is to link this info to TopHat fr-firststrand or fr-secondstrand. From what i have read so far on the web it seems to me that:
- fr-secondstrand corresponds to 1++,1–,2+-,2-+
- fr-firststrand corresponds to 1+-,1-+,2++,2–

Is that right?

Asking because i wonder if the alignment could be improved if the appropriate library type is used. As of now default unstranded was used.

Thank you for your help time

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 18 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 21 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

strand-specific libraries / firststrand /secondstrand

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News