Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • RedLightPanic
    Junior Member
    • Jan 2013
    • 9

    How does bedtools - bamToBed handle S operation in cigar string?

    Hi,

    I've been recently using bamToBed utility from the bedtools suite in order to transform the alignments in a BAM file into features of a BED file.

    I have one question regarding the way bamToBed handles the "S" operation in the cigar string.

    For example, I have this alignment (in BAM/SAM format, obtained using STAR mapper. I'm omitting certain not relevant parts of the alignment):
    Code:
    HANNIBAL_0054_FC62M9BAAXX:7:53:2326:12457#0/1   0   chr1   28166   251   4S25M7S   *   0   0   ...
    that bamToBed transforms to the following BED feature:
    Code:
    chr1    28165   28190   HANNIBAL_0054_FC62M9BAAXX:7:53:2326:12457#0/1   251     +
    I was just wondering why the BED feature is 25 nts. long (25M), and starts at the same position of the BAM alignment. Considering that the first operation in the cigar string is 4S, shouldn't the BED feature start 4 nucleotides after the position of the BAM alignment?

    Thanks a lot in advance.
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    The start position in the SAM file refers to the 5th base of the read in your example. Soft-clipped bases don't offset the start of the mapped position in SAM.

    Comment

    • RedLightPanic
      Junior Member
      • Jan 2013
      • 9

      #3
      Than you so much Devon.

      Didn't find this in the sam specs.

      Comment

      • dpryan
        Devon Ryan
        • Jul 2011
        • 3478

        #4
        Yeah, the SAM spec is unfortunately not always very explicit. This is only covered in an example on the first page rather than the main part of the spec.

        Comment

        Latest Articles

        Collapse

        • GATTACAT
          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by GATTACAT
          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
          07-01-2026, 11:43 AM
        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 07-02-2026, 11:08 AM
        0 responses
        10 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-30-2026, 05:37 AM
        0 responses
        13 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        20 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        54 views
        0 reactions
        Last Post SEQadmin2  
        Working...