Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Filippo
    Junior Member
    • Dec 2011
    • 2

    badly sorted BAM

    Hi,

    I have some BAM files in which the chromosomes are sorted like characters, eg the first chromosome is 10, the second 11 and the last two are 8 and 9.

    What is the quickest way to fix this? I tried samtools sort with no success.

    Thanks!
  • svl
    Member
    • Sep 2009
    • 43

    #2
    Hi Filippo, haven't tried this myself but perhaps you can use SortSam in Picard tools:



    Why do you want this specific order in an indexed file? Is it required by some tool?

    Comment

    • Filippo
      Junior Member
      • Dec 2011
      • 2

      #3
      Hi, thanks for the answer, I will try it.
      Do you think it would automatically sort the chromosomes or do I have to specify something?

      I need them in a regular order (1,2,3 ... 22,X,Y) so that the following files (for example pileups made out of the bam) will also be in the regular order. The scripts that I made so far go crazy if they get chromosome 10 instead of 1.

      Comment

      • gringer
        David Eccles (gringer)
        • May 2011
        • 845

        #4
        Originally posted by Filippo View Post
        I need them in a regular order (1,2,3 ... 22,X,Y) so that the following files (for example pileups made out of the bam) will also be in the regular order. The scripts that I made so far go crazy if they get chromosome 10 instead of 1.
        Then you should probably change your scripts so that they can at least handle what samtools does. The people who write samtools also write the specification for SAM/BAM, so you need to be able to handle that format.



        This document says that the coordinate sort order in a SAM/BAM file must be based on the order of SQ lines in the header (p. 2, tag SO).

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Pathogen Surveillance with Advanced Genomic Tools
          by seqadmin




          The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
          03-24-2025, 11:48 AM
        • seqadmin
          New Genomics Tools and Methods Shared at AGBT 2025
          by seqadmin


          This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

          The Headliner
          The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
          03-03-2025, 01:39 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-20-2025, 05:03 AM
        0 responses
        49 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-19-2025, 07:27 AM
        0 responses
        57 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-18-2025, 12:50 PM
        0 responses
        50 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-03-2025, 01:15 PM
        0 responses
        201 views
        0 reactions
        Last Post seqadmin  
        Working...