Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • joseangelivan
    Junior Member
    • May 2016
    • 5

    Help

    Hi everyone

    I have a file 1 with a column of several NCBI ID's like NC_014251.1 and a file B with several lines containing ID's and the description, i want to generate a file C with my IDs from file 1 ans its description from file B

    for one record i did this
    grep -w "NC_014251.1" ident_allbac160615.txt > NNSV15DNABNNSV15RNAB_R1R1q20allhg38nmIDsnum1.txt

    but when i am trying to do it for all file A with
    grep -wf NNSV15DNABNNSV15RNAB_R1R1q20allhg38nmIDsnum.txt ident_allbac160615.txt > NNSV15DNABNNSV15RNAB_R1R1q20allhg38nmIDsnumA.txt

    i am losing IDs and its order (from 34140 just 1440 ), any idea whats going on?

    I will appreciate any help
  • wdecoster
    Member
    • Oct 2015
    • 97

    #2
    Just as a side note, perhaps you should consider a more informative title for your post...

    Comment

    • skbrimer
      Member
      • Mar 2014
      • 55

      #3
      you can use the join command, below is a quick tutorial

      Tutorial on using join, a UNIX and Linux command to join lines of two files on a common field. Examples of joining two files, sorting before joining, specifying a field separator and specifying the output format.

      Comment

      • joseangelivan
        Junior Member
        • May 2016
        • 5

        #4
        Hi everyone

        This time i will try to be more clear, i am using grep to search a string on a file, individually works fine but now i have a file with a column of strings many of them are the same and the result is only one for those that are the same.

        grep -wFf file_A Pattern_file > result


        Like:

        Pattern_file
        aQ2nVEJb
        aQ2nVEJb
        DFGTYHDF
        aQ2nVEJb

        file_A
        aQ2nVEJb 12
        DFGTYHDF 15
        sdfgRTYH 18

        The result is
        DFGTYHDF 15
        aQ2nVEJb 12

        I would like this
        aQ2nVEJb 12
        aQ2nVEJb 12
        DFGTYHDF 15
        aQ2nVEJb 12

        Any idea how to solve this?

        Tks

        Comment

        • skbrimer
          Member
          • Mar 2014
          • 55

          #5
          I'm not sure if grep does duplicates so I don't know how to do it in grep. However if you do this it will work:
          Code:
          sort pattern_file > pattern_file_sorted
          sort file_a > file_a_sorted
          join pattern_file_sorted file_a_sorted > results
          You will get the list you are after.

          Comment

          Latest Articles

          Collapse

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          12 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-04-2026, 08:59 AM
          0 responses
          24 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 12:03 PM
          0 responses
          28 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 11:40 AM
          0 responses
          22 views
          0 reactions
          Last Post SEQadmin2  
          Working...