Announcement
Collapse
No announcement yet.
X
-
Thanks for your reply. I wasn't sure whether Cufflinks would continue in the wrong way or give an error message, had it been wrong.
-
If cufflinks runs then you shouldn't have any problems arising from the sorting. It sorted the chromosomes as strings, not as numbers, but as long as the positions are sorted numerically, it should be fine, and they are (it would have given you an error otherwise)
Leave a comment:
-
Cufflinks SAM file sort problem
The Cufflinks manual states that SAM files should be sorted according to the following:
Code:sort -k 3,3 -k 4,4n hits.sam > hits.sam.sorted
Code:chr1 chr11 [...] chr19 chr2 chr20 chr3 [...] chr9
I've been running Cufflinks with my SAM files ordered like this, but I've no idea if it will make a difference or not.
Note: I started with Bioscope BAM files (PE, strand-specific), which were converted to SAM with SAMtools. The 'XS:A:' field was added based on strand info from field 2. A sample of my SAM files is below:
Code:1384_723_1125 0 chr1 121 0 25M * 0 0 CATTTTCCTCTAGAGTCAGAAACGN IH8IIIIIIIIIEII77IIIIHEI! NH:i:0 RG:Z:20100828211420290 CS:Z:G3130002022232221112200133 CQ:Z:BB'2BBB?2BB?42BB776BA:/75 SM:i:0 CM:i:2 XS:A:+ 200_1536_1533 73 chr1 7467 1 18H28M4H * 0 0 GTTTTTCCTAATTTGATATTTAAAAAAA //-.2.**787;033""".*)--4F>., NH:i:0 RG:Z:20100828211420290 CS:Z:T12132211201311202001000020230300122130030000002000 CQ:Z:6??<=?><;A;AA?/-%,)')%*)&%&2'1+&.&%))&%%)%07('&2-* SM:i:2 CM:i:2 XS:A:+ 2234_1292_1060 129 chr1 8334 33 25M chr8 47073575 0 GAGATCCCCAAGAATCCTTACCTTT +EIII))))519IIIA5:8/%:D4& NH:i:1 RG:Z:20100828211420290 CS:Z:G0222320001022032020320200 CQ:Z:'%ABB<)=>)-%5=B=%1*/<%6/& SM:i:1 CM:i:3 XS:A:+ 21_385_199 89 chr1 10073 0 3H27M20H * 0 0 AGCCCCGAAAAAAAAAATAAATATCAG 72/@I91=B?@4/03E@<II%%,/(0I NH:i:0 RG:Z:20100828211420290 CS:Z:T01223203301132231023222233100330000000002300032030 CQ:Z:&,87'*/%%%.*-((/%&+*5:0((%%684)8.&+%01/4*(28)',,)6 SM:i:3 CM:i:2 XS:A:-
Tags: None
Latest Articles
Collapse
-
by seqadmin
At the intersection of cytogenetics and genomics lies the exciting field of cytogenomics. It focuses on studying chromosomes at a molecular scale, involving techniques that analyze either the whole genome or particular DNA sequences to examine variations in structure and behavior at the chromosomal or subchromosomal level. By integrating cytogenetic techniques with genomic analysis, researchers can effectively investigate chromosomal abnormalities related to diseases, particularly...-
Channel: Articles
09-26-2023, 06:26 AM -
-
by seqadmin
Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...-
Channel: Articles
09-07-2023, 11:15 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 09:36 AM
|
0 responses
6 views
0 likes
|
Last Post
by seqadmin
Today, 09:36 AM
|
||
Started by seqadmin, Yesterday, 07:14 AM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
Yesterday, 07:14 AM
|
||
Started by seqadmin, 09-29-2023, 09:38 AM
|
0 responses
13 views
0 likes
|
Last Post
by seqadmin
09-29-2023, 09:38 AM
|
||
Started by seqadmin, 09-27-2023, 06:57 AM
|
0 responses
14 views
0 likes
|
Last Post
by seqadmin
09-27-2023, 06:57 AM
|
Leave a comment: