Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Filter sequence from bed file

    Hello,

    I am very new to bioinformatics stuff. I have a bed file and I want to filter sequences which starts with GGG. I used grep but it gave all sequences that have GGG.

    This is example file:

    chr1 3165857 3165877 GGGGGGGTCGCCTTTAATAC_494 559.876 +
    chr1 3172959 3172979 ACGAGGGGGGTCATCTTTTT_1280 166.748 -
    chr1 3176088 3176108 ATCGAGGGGGTGATGTTTTT_2924 29.7413 +
    chr1 3207150 3207170 CCGGGGGAATCGACTTTGGA_265 795.823 -
    chr1 3207151 3207171 ACCGGGGGAATCGACTTTGG_186 884.041 -
    chr1 3207154 3207174 CCGACCGGGGGAATCGACTT_182 888.415 -
    chr1 3220405 3220425 TTGGGTGGGGGGCAGAGTCT_273 786.893 +

    Is there any way to define in grep (or anything else) to search in the beginning of the string?

    Thanks,
    Alan

  • #2
    If you want to exclude things that begin with GGG then do this (fields separated by space):

    Code:
    $ awk -F " " '$4 !~ /^GGG/ {print $0}' yourfile > new_file
    if you want to keep things that start with GGG then

    Code:
    $ awk -F " " '$4 ~ /^GGG/ {print $0}' yourfile > new_file
    If your file is tab-delimited then use

    Code:
    $ awk -F "\t" '$4 ~ /^GGG/ {print $0}' yourfile > new_file

    Comment

    Latest Articles

    Collapse

    • seqadmin
      New Genomics Tools and Methods Shared at AGBT 2025
      by seqadmin


      This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

      The Headliner
      The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
      03-03-2025, 01:39 PM
    • seqadmin
      Investigating the Gut Microbiome Through Diet and Spatial Biology
      by seqadmin




      The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
      02-24-2025, 06:31 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 12:50 PM
    0 responses
    10 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 03-03-2025, 01:15 PM
    0 responses
    181 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-28-2025, 12:58 PM
    0 responses
    276 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-24-2025, 02:48 PM
    0 responses
    663 views
    0 likes
    Last Post seqadmin  
    Working...
    X