Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Editing SAM files

    Dear all,

    I want to remove reads that map to ribosomal RNAs. I tried to modify the sam file (output of TopHat), and remove the reads that correspond to rRNAs. However, I could not run cufflinks on the modified sam files. Then I tried to sort them by converting them to bam (using samtools), but even then, it still did not work. I got the following error messages from cuffdiff.

    [22:39:41] Inspecting maps and determining fragment length distributions.
    SAM error on line 66185: CIGAR op has zero length
    SAM error on line 94994: CIGAR op has zero length
    SAM error on line 135374: CIGAR op has zero length

    Has anyone seen this error? What does it mean?

    What would you recommend to remove ribosomal reads?

    thanks,
    Grace

  • #2
    How did you removed the reads from SAM which correspond to rRNA?

    Comment


    • #3
      I just looked at the coordinates of each SAM output, and removed lines that overlapped with rRNA locations, as I read in each line.

      I reasoned that since my SAM file was sorted before editing, it should have remained sorted if I just moved certain lines.

      Comment


      • #4
        Regarding the error message you got, there is some discussion at:http://seqanswers.com/forums/showthread.php?t=3551
        If you moving out the reads (from sam file) corresponding to rRNA, you can remove them from your original fastq file (using read IDs) and then re-run the alignment.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 11:49 AM
        0 responses
        15 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-24-2024, 08:47 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        61 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Working...
        X