Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sunsnow86
    Member
    • Jul 2010
    • 17

    how to split BED file according to chromsome

    Does anyone know a program which can split BED file according to the chromosome? I have generate a BED file which contains the data for all chromosome, but it is not sorted. When I did sorting using BedSort, the output was not ordered according the numeric order, it always give chr10 on the top and then followed chr11, up to chr19. It seems I have to do the sorting for each chr respectively, I wonder whether there is a program which can split BED file according to the chromosome. Thanks
  • zee
    NGS specialist
    • Apr 2008
    • 249

    #2
    You could try the following with your bed file:

    Code:
    sort -k 1V,1 -k 2n,2 file.bed -o file.sorted.bed
    if you want to split your bed file you could do with bash:

    Code:
    mkdir -p split_results
    for chr in `cut -f 1 file.bed | sort | uniq`; do
                    grep -w $chr file.bed > split_results/$chr.output.bed
    
    done

    Comment

    • adamdeluca
      Member
      • Jul 2010
      • 95

      #3
      An alternative:
      Code:
      awk '{close(f);f=$1}{print > f".bed"}'

      Comment

      • quinlana
        Senior Member
        • Sep 2008
        • 119

        #4
        Similar to adamdeluca's suggestion, here is another simple awk solution. Note that the ">>" creates and appends to files named CHROM.bed, where CHROM is column 1 of the bed input bed file (in this case, example.bed).

        So, in plain English, the awk command prints each entire line ($0) from example.bed to distinct files that are each named by the chrom field ($1).

        This strategy is useful in many other cases where you want to do a context-based "grep", and route the results to distinct files.

        Code:
        $ awk '{print $0 >> $1".bed"}' example.bed
        
        $ ls -1 *.bed
        chr1.bed
        chr2.bed
        ... (snip)
        chrY.bed
        example.bed
        arq

        Comment

        • sunsnow86
          Member
          • Jul 2010
          • 17

          #5
          Thank you !

          Many thanks to you guys! I have worked it out.

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM
          • SEQadmin2
            Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
            by SEQadmin2


            With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


            Introduction

            Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
            05-22-2026, 06:42 AM
          • SEQadmin2
            Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
            by SEQadmin2

            Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


            Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
            05-06-2026, 09:04 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Today, 08:59 AM
          0 responses
          10 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 12:03 PM
          0 responses
          21 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 11:40 AM
          0 responses
          17 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 05-28-2026, 11:40 AM
          0 responses
          31 views
          0 reactions
          Last Post SEQadmin2  
          Working...