Downloading metagenomes from ncbi from terminal

vsindorf

Junior Member

Join Date: Apr 2014

Posts: 2
- Share
- Tweet
#1

Downloading metagenomes from ncbi from terminal

04-27-2014, 12:15 AM

I am attempting to access metagenomes referenced in a paper (Thurber et al. 2009) which are housed on NCBI. I have the project numbers and genome project ID's. I found the genbank record of one of the genomes I want (wanting to download one first to run a test analysis, to see if my idea will even work). I can click on the WGS ID link and that takes me to a page where I can click on a "downloads" tab and it has three file formats listed. I download the FASTA file and get this:

ã3ËQ‘˝Àédª≤-ˆ˘Á
B∆€≥#Ä≈{Ÿ"™Ø*\ ›ï$®s?ær⁄É4ìúÛ»\KqŒ^È····∆áŸ∞a∆¡ˇÒø¸ˇ˚À„«ÁÀ◊ÁÀÀˇ/ˇÔˇûˇüø˛_?^~l_/ˇ√ÀˇèÚﬂ˛ˇ”ˇÚˇÎ˛ˇ”˘œˇ˙ﬂ˛◊ˇ¸¸«ˇı?ˇ∑ˇÂ?ˇÉø˚èˇ˝˛oˇ«˘?ˇÎ¸Ôˇ˘ø˝üˇ˘_ˇ?ˇô2}ïﬂ_ı˜W˚˝UÌÉf~“_Èa˚'µ¶øQä}≥˛ÊÚo©Ê∑˙_”Áì¸ä¸˛ÉµŸØ*_çˇLŸ^Ì~æ=ë‰Ω˚áØ˙`{√j_˙˚[˛*ÚÜy|•˛ﬁ¸™ÒÜÚ⁄¢O˝É˝ØñÒmo20˙ «'®„'ÙCyœd_Î∆Ä^c˛(}oﬂ„˜+“ˇhŒX8ØëÖ”
·œî˚,ÍxËá§ü±yc¶ŸÃ\S˘˝ê>¸Ô˝~/25ùVdäh¶ç’2lÛ∂p‡øˇj+ç˛8ˇÕ÷£πw´c˚.…ÃRu#˜„'å‹[hÀ…ßﬁl¢ïP∂øX∑Å$≥yŸ—¬ßü¸~Ç>s£Å†…˝˝øî…J⁄!€Øn/‹ﬁåﬂ0ÀÊêeëe]eyÔ\‰ÔÚÍ§q∑olÆ±‚∂Wà®º»¯£mi€+ª¡z¿`ΩáÎ˜ÍŸ>¨Ó≤FK•ÚÓîœNˇ≠MMTkËsnØÙ=ÿ∫œê˘ßY~¶¶ÎØ»Á=X›/pÛß†y”Ó˚ü˝W¶FÛ˘˚ˇìﬂè?æ`†>b´*ãcÆ44J4G¥˜M¨ÿAkh[<ïVê˘OR˜”ﬂÑ∞ık∫ÌÈù≥â*Éƒ˛´l÷∆ã#Àe9˝Í∏2ÊùkËÜÊ3Ë™ÿËﬂ˜∑…ê‘qµ∫M›ˆ£ﬂèrÎæJ6ÚN-ˇ˙Ì™∂!.›±f≥;dTºÅ t˘»üMÙΩ
AÙ2›¬µØŒ⁄W†¸!7000_°Åi»Pù'ä&¥X∑Aa∑,ªoõTﬁ(∫”$‹*FÎ{üÀ}t⁄à≥ÚCﬁ√Iˆ≤›ÜÇ5ƒˇÁ˚˝4≤D›®º√®<B£AbÑa≤ﬁÏ/i!îB·*—≤°Õ∑Y˘˚á<®mãá¸±–ºÉ}7é]=ÏõÁ∆KŸøï˙k˚+e[⁄y{˚ÏGÂ
FÂgdTZ·OΩm˝òU¶H`ôná™Ö¥V
∏íãﬂc¯G8ÔQKê¿.:Ω˙Ay˘ÒWúÓ‘Î¶r≈ÎŒ‹Ó∫øwE≤>óﬂ}ÓàåU>óH)^b)≈ˆâ¯è+ç˝`tJ£S≤¯ò*†¨ fëßÚÊn˘ªmHŸÈH.—AÁ πˆ¿ú’µˆ*y‰/Uùn?C˛IﬂS‹-lÀ/a∞º€∏NwB:¬ƒìç`wÇŸ©˙›RÁﬁnº´èèüÅ_B∏nQvŒv%˘± ç≈Ê4≥dNE
ß∫˝põ€>UÇ-˝rn˛ó$©’øCøëÂ9v±„7˙°eÂ 1ÅO?r_B ∑
\%≥–MΩÒ8¡hÔ
G∏

etc.

Now either I'm showing my complete ignorance or this is not in fact anything usable. Why is it giving me complete gibberish?

Ideally I would like to do this all through the terminal, and I tried it there as well using wget with the same results.

This if my first time trying to access sequence data from NCBI, I must be missing something. Any help would be much appreciated, especially tips on doing this cleanly and sexily through CLI.

Thanks!
Tori
Tags: None
GenoMax

Senior Member

Join Date: Feb 2008

Posts: 7140
- Share
- Tweet
#2

04-27-2014, 05:01 PM

Can you post a representative GenBank ID? I can't access the paper at the moment to lookup the GenBank record #'s.
Comment

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
Yesterday, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 45 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Downloading metagenomes from ncbi from terminal

Comment

Latest Articles

ad_right_rmr

News