Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • from CDS/genome sequences to gff file

    I inherited the database of CDS and genome. Wonder if there is an easy way to generate the gff3 file or any version of gff file.

    Thanks.

  • #2
    Hi,
    Which database are you refering to? You make have to look carefully as some databases provide annotation files which would have information about location of the CDS on the genome. It may be of another type which you will have to convert to gff3.
    Should you not find it, you may have to align the CDS to the genome. However, this option has some limitations:
    - Parameters for aligning them may not gaurantee their exact locations (e.g paralogs)
    - Some may not align eventhough you are sure they are from that genome.

    In anycase, this would be the lizard over the croccodile choice.

    Hope this helps!

    Comment


    • #3
      Thank you for your reply.

      The databases are sequence databases (both are fasta files)

      Comment


      • #4
        Hi capricy,

        As I am also in almost similar situation like you (I've protein and genome, looking to generate gff), I am interested to know the approach you take to achieve it ...

        Thanks.

        Comment


        • #5
          Hi capricy,
          I was requesting for the name of the database. Can you also provide the URL to the data-sets which you've downloaded? I should take a look at it. Like I mention earlier, in the absence of such a file, you may have to align the CDS to the genome with your prefered tools

          Comment


          • #6
            Hi capricy,
            I was requesting for the name of the database. Can you also provide the URL to the data-sets which you've downloaded? I should take a look at it. Like I mention earlier, in the absence of such a file, you may have to align the CDS to the genome with your prefered tools
            @fahmida, this goes same for you. There should be information about the CDS that gave those proteins. It is 'confortable' to align CDS than proteins.

            Comment


            • #7
              Hi, Apexy,

              My databases came from the collaborator, not downloadable yet, and they are plain sequence text. As you pointed out, aligning itself has issues. So we decided to use other databases with available gff files.

              Thanks a lot for your reply.

              Capricy

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Recent Developments in Metagenomics
                by seqadmin





                Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                09-23-2024, 06:35 AM
              • seqadmin
                Understanding Genetic Influence on Infectious Disease
                by seqadmin




                During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                09-09-2024, 10:59 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 10-02-2024, 04:51 AM
              0 responses
              13 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 10-01-2024, 07:10 AM
              0 responses
              21 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-30-2024, 08:33 AM
              0 responses
              25 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-26-2024, 12:57 PM
              0 responses
              18 views
              0 likes
              Last Post seqadmin  
              Working...
              X