Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Recovering the original 5' position of reverse strand reads from SAM files

    Hello everyone,

    I need to get the precise 5' position of my mapped directional RNA-seq reads. Problem is, for reads that map to the reverse strand, the BAM/SAM format output by my mapping software (TMAP) seems to be the left-most position of the last *aligning* base relative to the reference. In other words, it's the 3'-most base (using the orientation of the read) after clipping non-aligning bases.

    I've tried just adding the length of the read onto the position, but this fails for reads with bad 3' ends or several indels in the alignment. I suppose I could write a script to count up the number of "S"s etc. in the cigar, but I am a wet-lab person and the thought of that just makes me sad.

    Is there a better way to recover the original 5' position of the seed alignment for reverse reads? Thanks in advance.

  • #2
    The left position is just the POS field, the right position is POS plus the sum of the M/=/X/D/N operations in the CIGAR string. The S operations are irrelevant here.

    If you are using a SAM/BAM library to process your data, there will probably be a method/function call already available to do this for you.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Choosing Between NGS and qPCR
      by seqadmin



      Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
      10-18-2024, 07:11 AM
    • seqadmin
      Non-Coding RNA Research and Technologies
      by seqadmin




      Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

      Nobel Prize for MicroRNA Discovery
      This week,...
      10-07-2024, 08:07 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 11-01-2024, 06:09 AM
    0 responses
    18 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 10-30-2024, 05:31 AM
    0 responses
    18 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 10-24-2024, 06:58 AM
    0 responses
    24 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 10-23-2024, 08:43 AM
    0 responses
    53 views
    0 likes
    Last Post seqadmin  
    Working...
    X