Seqanswers Leaderboard Ad

**ParthavJailwala** · 03-19-2012, 08:40 AM

Hi zorph,

I too am trying to get cufflinks to read a GSNAP generated SAM file. Just curious, were those XS:A:- or XS:A:+ tags automatically inserted by GSNAP OR you manually inserted them?
If you manually inserted these tags, how did you figure out strand information (+ or -) ?
Thanks

**jnfass** · 05-02-2012, 03:25 AM

I'll third this complaint ... with SAM generated by GMAP. I've checked, and all of the records with N's in CIGAR strings do have XS:A:[+-] tags. I had wondered if there was a specific order that cufflinks is expecting the tags to be in, but the OP's example doesn't deviate from the example SAM lines given in the manual, so that seems unlikely.

Please post if either of you crack this case.

**rnaseek** · 05-07-2012, 12:03 PM

The latest version of GSNAP says that ( version released on 2012-04-27 ) it adds the XS tags, so one doe not have to do this manually.

The XS tag is added to spliced reads and it tells information about which strand the read came from (not the strand it aligned to.) The cufflinks manual says that

This attribute, which must have a value of "+" or "-", indicates which strand the RNA that produced this read came from. While this tag can be applied to any alignment, including unspliced ones, it must be present for all spliced alignment records (those with a 'N' operation in the CIGAR string).

Note the strand it aligned to is easy to get from sam flag. But, getting the strand info of RNA it came from is tricky in unstranded sequencing. TopHat uses splice junction information to infer that. One can manually try to add the XS tag based on the sequence info at the splice junction of the alignment. TopHat manual says that

With long (>=75bp) reads, "GT-AG", "GC-AG" and "AT-AC" introns will be found ab initio. With shorter reads, TopHat only reports alignments across "GT-AG" introns

**jnfass** · 05-07-2012, 12:44 PM

I should have mentioned that I found my issue. I was careless before in saying that all my spliced alignments had XS:A:[+-] tags. Some of them instead have XS:A:? tags (presumably where the transcript's strand couldn't be determined from the sequence at the edges of the splice?) - and when I removed these undetermined XS tags, Cufflinks doesn't give me that error anymore. Hope this helps someone.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 18 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

manual XS:A[-|+] assignment for cufflinks

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News