Unconfigured Ad

**maubp** · 08-10-2015, 02:29 PM

The slowdown might be the NCBI throttling your searches. Have you looked into using elink rather than esearch? If that is possible, you should be able to submit batches of queries at once.

I suspect however these is a more appropriate way to do this, you can probably download all the accessions for human genes in one go...

**lstbl** · 08-12-2015, 07:39 AM

yeah, I thought that could be it, too. However I never recieved an email from NCBI saying that I was ping-ing them too fast. (According to the biopython cookbook tutorial, they will send you an email if they are limiting your access).

Oh well, I'll figure something else out. It's fairly trivial to parse a .gff file to pull entrez gene IDs. Thanks for your help!

**GenoMax** · 08-12-2015, 07:49 AM

Have you considered the possibility that it may be your institutional firewall that is blocking access (not sure what port you are using)?

**lstbl** · 08-12-2015, 08:01 AM

I don't think so. I was previously able to pull sequences for all the genes in my analysis using basically the same code.

Also, and I don't know if this matters, but the program does work, it just stops collecting data after a few seconds. I'm probably mistaken, but if my institution was blocking, wouldn't that mean I couldn't get any data at all?

I also tried to do this at home with no success...

Thanks for your reply

**GenoMax** · 08-12-2015, 08:09 AM

Can you put in a pause after retrieving every 2-3 records to see if that helps? BTW: Which entrz ID are you referring to? The example above must be a dummy.

**maubp** · 08-13-2015, 12:23 AM

Originally posted by lstbl View Post

yeah, I thought that could be it, too. However I never recieved an email from NCBI saying that I was ping-ing them too fast. (According to the biopython cookbook tutorial, they will send you an email if they are limiting your access).

Well in theory, the NCBI says "The value of email will be used only to contact developers if NCBI observes requests that violate our policies, and we will attempt such contact prior to blocking access."

A General Introduction to the E-utilities - Entrez® Programming Utilities Help - NCBI Bookshelf

http://www.ncbi.nlm.nih.gov/books/NBK25497/#chapter2.Usage_Guidelines_and_Requiremen

The Entrez Programming Utilities (E-utilities) are a set of nine server-side programs that provide a stable interface into the Entrez query and database system at the National Center for Biotechnology Information (NCBI). The E-utilities use a fixed URL syntax that translates a standard set of input parameters into the values necessary for various NCBI software components to search for and retrieve the requested data. The E-utilities are therefore the structured interface to the Entrez system, which currently includes 38 databases covering a variety of biomedical data, including nucleotide and protein sequences, gene records, three-dimensional molecular structures, and the biomedical literature.

Originally posted by lstbl View Post

Oh well, I'll figure something else out. It's fairly trivial to parse a .gff file to pull entrez gene IDs. Thanks for your help!

If you are dealing with 1000s of IDs, this ought to be far more reliable and faster than making all those online requests.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 41 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 102 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 123 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 114 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Biopython stops querying database after ~10 seconds

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News