Header Leaderboard Ad

Collapse

Extract the high value column from a big file

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Extract the high value column from a big file

    Hi,

    As I am new in this field. I am trying to get the best score with larger start length from below file. Here, the file header is like Chromosome location, score, and start length. I want top start length w.r.t its score and other details.

    chr9:136028339-136029648-|NM_021996|GBGT1 5.629998 1303 TGCTCAAGTACACTCATTTCA
    chr9:136028339-136029648-|NM_021996|GBGT1 5.629998 1304 GCTCAAGTACACTCATTTCAT
    chr9:136028339-136029648-|NM_021996|GBGT1 13.2 1301 TGTGCTCAAGTACACTCATTT
    chr9:136028339-136029648-|NM_021996|GBGT1 10.8 1302 GTGCTCAAGTACACTCATTTC
    chr12:54735989-54739299+|NM_016057|COPZ1 5.629998 216 GAGCCAGATGCTGAGTACTAT
    chr12:54735989-54739299+|NM_016057|COPZ1 10.8 217 AGCCAGATGCTGAGTACTATG
    chr16:21868579-21893272-|None|None 6.0 473 TTTAATGAGTATTCTGGATTG
    chr16:21868579-21893272-|None|None 6.0 5880 TTGATCCTCCCTTAACCTATC
    chr16:21868579-21893272-|None|None 6.0 5923 CTTCCTATTCCTCCAGCATAC
    chr16:21868579-21893272-|None|None 6.0 6463 TGAAGTCATCTATCTGGTTTG

    I want the output like this
    chr9:136028339-136029648-|NM_021996|GBGT1 13.2 1301 TGTGCTCAAGTACACTCATTT
    chr9:136028339-136029648-|NM_021996|GBGT1 10.8 1302 GTGCTCAAGTACACTCATTTC
    chr12:54735989-54739299+|NM_016057|COPZ1 5.629998 216 GAGCCAGATGCTGAGTACTAT
    chr12:54735989-54739299+|NM_016057|COPZ1 10.8 217
    chr16:21868579-21893272-|None|None 6.0 5923 CTTCCTATTCCTCCAGCATAC
    chr16:21868579-21893272-|None|None 6.0 6463 TGAAGTCATCTATCTGGTTTG


    Any help is much appreciated.


    Thanks

  • #2
    Got it

    Yahoo, I figure it out..... Anyway, any other way if we can do it then also fine...

    My Answer is sort -k1,1 -k3,3nr -k2,2n infile.txt | sort -u -k1,2 --merge

    Comment

    Latest Articles

    Collapse

    • seqadmin
      How RNA-Seq is Transforming Cancer Studies
      by seqadmin



      Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
      09-07-2023, 11:15 PM
    • seqadmin
      Methods for Investigating the Transcriptome
      by seqadmin




      Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

      Whole Transcriptome RNA-seq
      Whole transcriptome sequencing...
      08-31-2023, 11:07 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 09-22-2023, 09:05 AM
    0 responses
    14 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-21-2023, 06:18 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-20-2023, 09:17 AM
    0 responses
    13 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-19-2023, 09:23 AM
    0 responses
    28 views
    0 likes
    Last Post seqadmin  
    Working...
    X