Originally posted by sdriscoll
View Post
Unconfigured Ad
Collapse
X
-
but this is not necessarily correct strand information if i use an unstranded RNA-seq data, isn't it?
-
-
Since it wasn't mentioned yet I'll add that cufflinks determines the strand of the assembled isoforms from the value of the XS attribute in the alignments (generated by STAR with --outSAMstrandField intronMotif set at runtime). The XS attribute is only populated with strand information for spliced reads. The 4-bp motif at the splice site informs STAR what the strand is if the motif is a known one. If it is an unknown motif then there is no strand information. 90+% of splices will have those known motifs in mammalian genomes. The only other way cufflinks can determine strand is if you provide a reference GTF for assembly in which case it will use the strand information from that for matching assembled isoforms from the data.
Leave a comment:
-
-
hi,
If you are worried about a scenario where a gene locus has no/minimal sense transcription but very high antisense transcription and then Cufflinks not able to differentiate then you might need to do prepare a Stranded library before sequencing.
If not then at data analysis step there is very minimal you could do -
1) Do you know the sequence of these antisense? Do they maintain the exon intron boundary (introns spliced off), but just in complementary strand? Or do they read through introns? If they read through introns then you can set an arbitrary threshold (depending on your read length) saying -
If a read extends beyond the exon boundary into the intron sequence for at least 'n' bases, then it might be from an unspliced transcript/ antisense. Hence discard the read. then use the filtered reads only for transcript assembly.
Doing so genome-wide would be very tricky as there might be genuine transcripts with alternate exon start-ends.
I'm not aware of your organism, but if it is something that has been widely studied then there would be datasets available around & PCR validations to cross-check your results for.
Leave a comment:
-
-
thanks a lot, amitm. if the strand attribute from STAR feeding into cufflink is not really the strand information, is it going to affect how cufflink uses the info to assemble the transcripts? How should i deal with the sense and antisense assembled transcripts to reduce false positives?
Leave a comment:
-
-
hi,
Your un-stranded data doesn't get 'converted to stranded'. An un-stranded data would have reads from both strands as PCR amplification (during library prep.) amplifies both strands of the DNA.
The derived strand by STAR is based on alignment of any particular read and is not necessarily reflecting the strand due to the above reason.
Regarding whether assembly would be affected or not => Cufflinks wont run without the XS attribute in the SAM/BAM file.
Leave a comment:
-
-
using STAR+Cufflinks for transcript assembly turns unstranded RNA-seq to stranded?
I am trying to use STAR+Cufflinks to do a reference based transcript assembly using unstranded RNA-seq data.
As mentioned in the STAR manual "If you have un-stranded RNA-seq data, and wish to run Cufflinks/Cuffdiff on STAR alignments, you will
need to run STAR with --outSAMstrandField intronMotif option, which will generate the XS strand attribute for all alignments that contain splice junctions"
Thus in the generated SAM file, strand will be derived from the intron motif. Unstranded RNA-seq data will be assigned a strand, which results in a lot of genes have both sense and antisense transcripts in the merged transcript assembly.
My questions are:
1) how reliable is the derived strand info from intron motif?
2) Is the assembled transcripts affected by this?
Thank you very much!
Runxuan
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
14 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
24 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
28 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
||
|
Started by SEQadmin2, 06-02-2026, 11:40 AM
|
0 responses
22 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 11:40 AM
|
Leave a comment: