Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Download fasta file for pdb id from uniprot

    Hi Members,
    I've a list of pdb id [some ~80].
    I want to download their fasta file from Uniprot.

    I have been unable to find URL which can be then utilized using wget or curl.

    example: 2rh1
    Please help.
    Bioinformaticscally calm

  • #2
    If you have 80 perhaps getting the fasta files for UniProt/SwissProt and UniProt/TreEmbl (ftp://ftp.uniprot.org/pub/databases/...base/complete/) then parsing out the sequences you need.

    Comment


    • #3
      Thank you GenoM for your reply.

      I downloaded sprot [swissprot] file. On going through it, I found protein name as:

      >sp|Q6GZX3|002L_FRG3G Uncharacterized protein 002L OS=Frog virus 3 (isolate Goorha) GN=FV3-002L PE=4 SV=1

      I have pdb ids with me, and not much information other than that.
      I was wondering how would I be able to get hold my data from this header.

      Appreciate your inputs.

      Originally posted by GenoMax View Post
      If you have 80 perhaps getting the fasta files for UniProt/SwissProt and UniProt/TreEmbl (ftp://ftp.uniprot.org/pub/databases/...base/complete/) then parsing out the sequences you need.
      Bioinformaticscally calm

      Comment


      • #4
        I guess 6GZX are the PDB ids. I am randomly grep-ping the ids and have hit few of them.
        But there are multiple entries for one such id.
        Code:
        $ grep -i "2m3g" uniprot_sprot.fasta
        >sp|Q2M3G0|ABCB5_HUMAN ATP-binding cassette sub-family B member 5 OS=Homo sapiens GN=ABCB5 PE=1 SV=3

        >sp|Q2M3G4|SHRM1_HUMAN Protein Shroom1 OS=Homo sapiens GN=SHROOM1 PE=1 SV=1
        Bioinformaticscally calm

        Comment


        • #5
          PDB to Uniprot ID's: http://www.uniprot.org/docs/pdbtosp

          Comment


          • #6
            Wow. That helps a lot. Thank you.

            FASTA sequence/file can be downloaded using wget from a URL:
            Ex:
            pdb id -2E9Q
            Code:
            wget "http://www.uniprot.org/uniprot/?query=database%3A%28type%3Apdb+2E9Q%29&format=fasta"
            Note: There can be multiple sequences of one PDB id.

            Last edited by bio_informatics; 09-12-2014, 06:45 AM.
            Bioinformaticscally calm

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Recent Advances in Sequencing Analysis Tools
              by seqadmin


              The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
              05-06-2024, 07:48 AM
            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 05-14-2024, 07:03 AM
            0 responses
            15 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-10-2024, 06:35 AM
            0 responses
            37 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-09-2024, 02:46 PM
            0 responses
            45 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 05-07-2024, 06:57 AM
            0 responses
            39 views
            0 likes
            Last Post seqadmin  
            Working...
            X