Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Calculating total no. of multimapped reads using samtools for RNA-Seq data analysis

    Dear all ,

    please excuse me if my questionaire is wrong.

    I am confused in estimating proper command for calculating number of multimapped reads using samtools for my paired end RNA-Seqdata, when i tried searching forums , i found two
    threads for calculating total no. of multimapped reads

    Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

    Application of sequencing to RNA analysis (RNA-Seq, whole transcriptome, SAGE, expression analysis, novel organism mining, splice variants)


    I tried using both , the results were very different , please suggest me proper command to calculate multimapped reads using samtools

  • #2
    I'd try filtering with FLAG 0x100 or 256. Sometimes I have seen inconsistencies depending on the mapper being used (e.g GEM mapper).

    Comment


    • #3
      I have used Tophat for mapping my genomes .

      Comment


      • #4
        If I recall correctly, I wasn't able to actually find this in the TopHat documentation, but take a look at this discussion. According to the top answer, the mapping quality encodes the number of mappings.

        255 = unique mapping
        3 = maps to 2 locations in the target
        1 = maps to 3-4 locations
        0 = maps to 5 or more locations (up to the number defined in "--prefilter-multihits")
        So I guess you could do something like

        Code:
        samtools view file.bam | awk '$5 < 255 {count++} END {print count}'
        Edit: this might be inefficient, I'm really not sure. But I think it'll get the job done.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 08:47 AM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        57 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        53 views
        0 likes
        Last Post seqadmin  
        Working...
        X