Hi,
Can you help me in this? I have paired end reads in sam format, and I have followed all steps to create the sorted sam file according to the passilla tutorial. I have successfully created the maize gtf file using the first python script. This is how my sam file looks like,
I`m getting the following error. Though the reads are paired-end, the program does not recognize it as paired end. How do I get the program to run? If I dont specify any parameter (not use --pair=yes), it gives me an output with all 0 counts.
Can you help me in this? I have paired end reads in sam format, and I have followed all steps to create the sorted sam file according to the passilla tutorial. I have successfully created the maize gtf file using the first python script. This is how my sam file looks like,
Code:
GALZUI2_0001:8:100:0:142#0/1 4 * 0 0 * * 0 0 NCCTGGTGGAGACCGGAGGAGCCTCGGCAGAGATCG #0--011+++858::7=386>?;=:9?9==8??### RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:0:1654#0/1 16 chr1 128019411 0 35M1S * 0 0 TTTTCTCTGTTGATATTTCAATCTTCTTCCTCAGAN FFFFFFFFFFFFFF=FFFFFF??===44574,,00# MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:0:1914#0/1 0 chr5 84824688 0 1S24M240N11M * 0 0 NTTTGCAGCTGATGCTGAGAGCAAGATTGTCCCTGC #################################### MD:Z:8X26 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 YS:A:+ GALZUI2_0001:8:100:0:1953#0/1 4 * 0 0 * * 0 0 NCATAAACGATGCCGACCAGGGATCAGCGAGATCGG #################################### RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:0:1957#0/1 0 chr8 166503511 0 1S35M * 0 0 NACAAGGTAGGCCTCAGCCGCCTCCTGCAGCGCGGA #################################### MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:0:2018#0/1 0 chr8 48995004 0 1S35M * 0 0 NAAGGGTATAACATCTCTGATGTTCTCCATTCCGGT #/-,+//.,,9:???==<<=FFF<<=:=?=881:8= MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:2 NH:i:2 YR:i:1 GALZUI2_0001:8:100:0:2018#0/1 0 chr8 49024165 0 1S35M * 0 0 NAAGGGTATAACATCTCTGATGTTCTCCATTCCGGT #/-,+//.,,9:???==<<=FFF<<=:=?=881:8= MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:2 NH:i:2 YR:i:1 GALZUI2_0001:8:100:0:287#0/1 4 * 0 0 * * 0 0 NCGGGGTTTCTTATGCGTGGATCCGGGAGATCGGAA #################################### RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:0:356#0/1 0 chr8 175216115 0 1S35M * 0 0 NTCGAATACATGTCCTCTCTTCTGGTTCAGAACACC #################################### MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:0:362#0/1 4 * 0 0 * * 0 0 NAACAGCATGGATCCACCTTTTTCCCAACCTTTGAG #*+('(++)+8::::1:60:==:<;=====FFF;FF RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:0:434#0/1 16 chr1 300430123 0 35M1S * 0 0 ACCTTACTCTATGCAAGGCATGCCTTACTATCCTGN #################################### MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:0:596#0/1 0 chr2 104752145 0 1S35M * 0 0 NGATGTGGTTGCGAAGAATGGCATGACGATGGTTGA #################################### MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:4 NH:i:4 YR:i:1 GALZUI2_0001:8:100:0:596#0/1 0 chr4 68856217 0 1S35M * 0 0 NGATGTGGTTGCGAAGAATGGCATGACGATGGTTGA #################################### MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:4 NH:i:4 YR:i:1 GALZUI2_0001:8:100:0:596#0/1 0 chr5 193863017 0 1S35M * 0 0 NGATGTGGTTGCGAAGAATGGCATGACGATGGTTGA #################################### MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:4 NH:i:4 YR:i:1 GALZUI2_0001:8:100:0:596#0/1 0 chr9 90763379 0 1S35M * 0 0 NGATGTGGTTGCGAAGAATGGCATGACGATGGTTGA #################################### MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:4 NH:i:4 YR:i:1 GALZUI2_0001:8:100:1000:1005#0/1 16 chr1 31201451 0 36M * 0 0 AAATATGGCACATATCAGGTGAACAGTGACCAAAAC =886A<A?8)CEB=.CFEF88>CEHHDHBHEEEGEG MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:1011#0/1 4 * 0 0 * * 0 0 GCTACATCGACCTTTCGAAGCGTCGCGAGATCGGAA AGEFBAEGGGECFHHHGCHDHHHFFHDHFHHH?HBC RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1039#0/1 4 * 0 0 * * 0 0 TCTCATGTGATGAGAAGTAGAACTAGTGGAGAGATC FFFFF:FFEF4EBFGFE>FFEFFFFFFFFFFFFFFF RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1073#0/1 4 * 0 0 * * 0 0 ACCAGAGCCTGTCCGTGGATGGGACCGGAGATCGGA HHEEHCHEHEGFHHEE=H/HEH9DHFEEF?=@GA## RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1076#0/1 4 * 0 0 * * 0 0 GACAAGTTGGCCCACCAGAATATGAGCCTACAGGAA GH1HHHEHHHHHHHFFFF6FFFFFFFF?FFF-FEF< RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1089#0/1 4 * 0 0 * * 0 0 GCTCGATGGCGGATGAAAATCAGGCAGATCGGAAGA HHCHHEHDHHF@EFF6=C;AE?BFFEE?EFBDA?FF RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1099#0/1 16 chr9 15614973 0 2S34M * 0 0 TCTCAACGTTTGAAGAAAAACCGTGAGATATACCGG 8HHHGHGE=BCHHHHFGDB6EHFGEHHBDHD?GFFG MD:Z:34 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:11#0/1 4 * 0 0 * * 0 0 GTGGAGACGCAGGCGTGGAAGAGATCGGAAGAGCGG AAA@>6>@/<DG;C?GGGDGGCGGCEEGEGGGGAGG RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1145#0/1 0 chr6 147883848 0 32M4S * 0 0 ATTGTCGGCAACGGCGGGAAGCACCGCTGCCCCGCC FGFG>DDHDHGEFADA#################### MD:Z:32 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:1173#0/1 16 chr1 269036723 0 36M * 0 0 TGTCAGGGACATGAAGGAGAAGCTCGCCTACATTGC FD?BDFCC?6<<CC@>C6:1AGGEAG???7@AAC@= MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:1195#0/1 16 chr3 135445725 0 36M * 0 0 ACGGCGCCTGCCGCAAAGATCATAGATACAGTTGGA ###@?=6.GGCCGGGFG6GDGGGGGCGGCGGGGGGG MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:121#0/1 0 chr3 3313715 0 34M2S * 0 0 GCCTCACTGAATATTCCAGCAGCTGTTGGCTGGGAG HHHHHHHHHHIHHHHHHHHHHHHHHHHHHHHHHFGH MD:Z:34 RG:Z:s_8_sequence.txt.gz IH:i:3 NH:i:3 YR:i:1 GALZUI2_0001:8:100:1000:121#0/1 0 chr8 25015676 0 34M2S * 0 0 GCCTCACTGAATATTCCAGCAGCTGTTGGCTGGGAG HHHHHHHHHHIHHHHHHHHHHHHHHHHHHHHHHFGH MD:Z:34 RG:Z:s_8_sequence.txt.gz IH:i:3 NH:i:3 YR:i:1 GALZUI2_0001:8:100:1000:121#0/1 16 chr4 63782093 0 2S34M * 0 0 CTCCCAGCCAACAGCTGCTGGAATATTCAGTGAGGC HGFHHHHHHHHHHHHHHHHHHHHHHIHHHHHHHHHH MD:Z:34 RG:Z:s_8_sequence.txt.gz IH:i:3 NH:i:3 YR:i:1 GALZUI2_0001:8:100:1000:1217#0/1 16 chr10 68420921 0 36M * 0 0 ACCTGAAGAGTGTTAGGGAATTGATCTACAAAAGAG 50?EEEGEGEGEEBGED??@;CDGGDC?CCBBD=DD MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:1249#0/1 0 chr1 222562953 0 36M * 0 0 AAAGGATGAGACCAGAGAATCATAGCAATCAGCTCA HHGCHH@HHGDHHEHAGE5GHHHHHHF7HHHHEHHE MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:3 NH:i:3 YR:i:1 GALZUI2_0001:8:100:1000:1249#0/1 16 chr2 41634471 0 36M * 0 0 TGAGCTGATTGCTATGATTCTCTGGTCTCATCCTTT EHHEHHHH7FHHHHHHG5EGAHEHHDGHH@HHCGHH MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:3 NH:i:3 YR:i:1 GALZUI2_0001:8:100:1000:1249#0/1 16 chr5 161906550 0 36M * 0 0 TGAGCTGATTGCTATGATTCTCTGGTCTCATCCTTT EHHEHHHH7FHHHHHHG5EGAHEHHDGHH@HHCGHH MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:3 NH:i:3 YR:i:1 GALZUI2_0001:8:100:1000:1371#0/1 0 chr10 89544264 0 35M1S * 0 0 CTTATGCTTCACTTTTACTATAGGCTCAGAACTTTT GGGFAHHHHHHHHHHHHGGHHHHGHHHGHHHHHHHE MD:Z:35 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:1465#0/1 4 * 0 0 * * 0 0 CGGTTCAGCAGGAATGCCGAGATCGGAAGAGCGGTT >BBEEBEEEF3FCC@FFFEFFFFFBFEFFF=FBFFE RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1489#0/1 16 chr2 176558351 0 36M * 0 0 CGTGACAAGTGCAGGAAACAAACCACTGAAAAGAAT @=DA@6F<<@>F?DDG>7C=?6==4;A@6ADEBEBE MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:1496#0/1 4 * 0 0 * * 0 0 CCAGATCAGCGTCGACTCATTTCGGGAGATCGGAAG GHEHHHHHFHGGGG??AG=GEGGGGGGGDGBGEGGE RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1536#0/1 16 chr9 94009423 0 36M * 0 0 AGTCAAATGTAACAACTGGTTTTAGCTTGATCTCTT FHHHHHHGHHHEHHHHFHHHHHHHGHHHHHHHHHHH MD:Z:36 RG:Z:s_8_sequence.txt.gz IH:i:1 NH:i:1 GALZUI2_0001:8:100:1000:1555#0/1 4 * 0 0 * * 0 0 CGTCGTTGGGGTAGTAGACGGCAGATCGGAAGAGCG FFBBFFHHHFHHHFEHHHHAHHHHHFDHHHFHHGHH RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped GALZUI2_0001:8:100:1000:1559#0/1 4 * 0 0 * * 0 0 GCAGATTTCACCAAGTGTTGGATTGTTCAGATCGAA HHDGHFHHHHHFH@B?HGHCHHHHE:HHFFHEHHFH RG:Z:s_8_sequence.txt.gz IH:i:0 NH:i:0 YU:Z:unmapped
I`m getting the following error. Though the reads are paired-end, the program does not recognize it as paired end. How do I get the program to run? If I dont specify any parameter (not use --pair=yes), it gives me an output with all 0 counts.
Code:
File "dexseq_count.py", line 132, in <module> for af, ar in HTSeq.pair_SAM_alignments( HTSeq.SAM_Reader( sam_file ) ): File "/usr/lib64/python2.6/site-packages/HTSeq-0.5.4p3-py2.6-linux-x86_64.egg/HTSeq/__init__.py", line 612, in pair_SAM_alignments raise ValueError, "'pair_alignments' needs a sequence of paired-end alignments" ValueError: 'pair_alignments' needs a sequence of paired-end alignments
Comment