Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GFF file formatting

    Hello,

    I downloaded some GFF files from ensembl website and only need a small subset of rows in these files. So, I opened them up in Excel and chose whatever I wanted.

    I saved them as tab delimited text files and everything looked ok. But, I am having trouble with downstream analysis.

    My question is, are there any software to check the formatting of my edited GFF files? If so, I will be really happy if you could share them with me.

    Also, is there any better way to edit GFF files than opening them in Excel. I heard line endings could also cause some problems between Mac and Linux.

    Thank you,
    Neel

  • #2
    There are some tools for manipulating gff files;
    If you have a little of experience with perl you could use them.

    Comment


    • #3
      Originally posted by naluru View Post
      Also, is there any better way to edit GFF files than opening them in Excel.
      If you are on linux you can edit text files using grep, gawk, sed, perl, etc.
      There are nice linux tutorials around.

      Originally posted by naluru View Post
      I heard line endings could also cause some problems between Mac and Linux.
      I tried this:
      http://www.google.com/search?hl=en&s...l=&oq=dos2unix
      http://www.google.com/search?hl=en&s...ql=&oq=mac2lin

      Comment


      • #4
        I've tried to do something similar with gff file and it works fine for me. Just need to make sure that the information in each column in excel spreadsheet is ok. I coppied it into 010 Editor and saved as a gff file(I did it only because I had too many rows to put it all into one spreadsheet).

        Comment


        • #5
          Another reason why opening an annotation file with Excel should be avoided: gene names can be automatically changed

          Comment


          • #6
            Quick way to tell if it is an end of line issue - if you type: more my_file from the command line you will see the funky EOL characters that you won't necessarily see just opening the txt file.

            There are methods to fix the problem, but I agree with steven that the best thing is to do it in unix. Most likely grep is what you need (to select certain rows, based on whether they have the pattern you are looking for). cut may also be useful- this will select by column. There are plenty of online guides for these.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Advanced Methods for the Detection of Infectious Disease
              by seqadmin




              The recent pandemic caused worldwide health, economic, and social disruptions with its reverberations still felt today. A key takeaway from this event is the need for accurate and accessible tools for detecting and tracking infectious diseases. Timely identification is essential for early intervention, managing outbreaks, and preventing their spread. This article reviews several valuable tools employed in the detection and surveillance of infectious diseases.
              ...
              11-27-2023, 01:15 PM
            • seqadmin
              Strategies for Investigating the Microbiome
              by seqadmin




              Microbiome research has led to the discovery of important connections to human and environmental health. Sequencing has become a core investigational tool in microbiome research, a subject that we covered during a recent webinar. Our expert speakers shared a number of advancements including improved experimental workflows, research involving transmission dynamics, and invaluable analysis resources. This article recaps their informative presentations, offering insights...
              11-09-2023, 07:02 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Yesterday, 08:23 AM
            0 responses
            8 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 12-01-2023, 09:55 AM
            0 responses
            21 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 11-30-2023, 10:48 AM
            0 responses
            20 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 11-29-2023, 08:26 AM
            0 responses
            15 views
            0 likes
            Last Post seqadmin  
            Working...
            X