Header Leaderboard Ad

Collapse

Help with sorting sam files in python!

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Help with sorting sam files in python!

    Hello All!

    I am new to coding. I have written a python script to analyze a bunch of fastq files from RNASeq analysis. I am using the subprocess module to call bowtie2 in order to index the genome and align the unpaired short reads to the genome.My script generates a sam file which needs to be sorted before running the cuffdiff module. The linux sort command works perfectly on my mac, however, in spite of installing the Unxupdates on my windows at work, I am unable to make my script work. I am hoping for some help/advice here to sort my sam file in python. Here is the script I tried.

    #!/usr/bin/env python2.7

    import os, sys

    inputFile = open(‘Outputfile1.txt’, 'r')

    lineList = inputFile.readlines()

    for line in sorted(lineList):

    print(line.rstrip());

    With this script, the stdout is written to the screen. But this recognizes only the text file. Sam files are not recognized.

    Any help is much appreciated.

    Thanks in advance

  • #2
    I am not familiar with Python, but why do you need it? This forum seems to work with PHP and I know it a little bit so if you will need PHP help, I can do it for you. Just let me know.

    Comment


    • #3
      Here's an example of how you can sort a SAM file in Python using the pysam library:

      code:
      import pysam # Open the input SAM file samfile = pysam.AlignmentFile("input.sam", "r") # Create an output file for the sorted SAM data sorted_samfile = pysam.AlignmentFile("sorted.sam", "wb", template=samfile) # Sort the input SAM file by read name for read in samfile.fetch(until_eof=True): sorted_samfile.write(read) # Close the input and output files samfile.close() sorted_samfile.close()
      This code will open an input SAM file called "input.sam", sort the reads by read name, and write the sorted data to an output file called "sorted.sam".


      You can also sort the SAM file by other criteria such as reference name, position, etc. by using the appropriate sort function from pysam. For example, to sort the SAM file by reference name and position, you can use the following code:


      code:
      sorted_samfile = pysam.AlignmentFile("sorted.sam", "wb", template=samfile) sorted_samfile.sort("-o", "sorted.sam", "input.sam") sorted_samfile.close()
      This code will use the pysam sort function to sort the input SAM file by reference name and position and write the sorted data to the output file "sorted.sam".

      Regards
      Jamal Shah

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
        by seqadmin



        Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
        03-21-2023, 01:49 PM
      • seqadmin
        Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
        by seqadmin




        Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
        03-10-2023, 05:31 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 11:44 AM
      0 responses
      8 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-24-2023, 02:45 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-22-2023, 12:26 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-17-2023, 12:32 PM
      0 responses
      19 views
      0 likes
      Last Post seqadmin  
      Working...
      X