Problem while using Gbrowse 2.54 with hapmap data

Votinhkiem90

Member

Join Date: Sep 2013

Posts: 30
- Share
- Tweet
#1

Problem while using Gbrowse 2.54 with hapmap data

10-03-2014, 07:56 PM

Hi all.
I want to visualize hapmap data (on site hapmap project) using Gbrowse. I have download all data that Gbrowse need on hapmap site.
I also performed all step following Readme file (bellow), of course, i changed follow me (ie: name database, description ...). and all the step did'nt get any error. but when i access with firefox browse, it no get anything.
What should i do to slove it.
Readme's content
#---------------------------------
# HAPMAP GBROWSE
#---------------------------------

This guide and the contents of this directory are intended to help users install their
local copy of the HapMap GBrowse feature viewer.

### Note on hardware requirements

Loading the GFF-files into Bio:B::GFF requires a substantial amount of memory (1.5-2Gb). Also,
running the MySQL server with the resulting database giving acceptable performance requires a fairly
powerful machine. Make sure you have sufficient hardware before attempting to install the HapMap
GBrowse locally. A typical workstation will almost certainly not be enough, a server-class machine
with at least 4Gb RAM is a bare minimum.

### Get data

Download all available GFFs from the HapMap website:
ncftpget ftp://www.hapmap.org/gbrowse/latest/gff/*gff.gz
This will get you the following data from the HapMap project:
gt+allele_freqs_*.gff.gz : genotyped SNP features, with allele and genotype frequencies
recomb_hotspots.gff.gz : recombination hotspot features
recomb_rate.gff.gz : recombination rate features
and bincounts for SNP and gene features, in low (500Kb) and high (5-20Kb) resolution windows:
*density*.gff.gz

and various annotations from external sources. For all feature types, see GBrowse help pages for
more information:

http://www.hapmap.org/cgi-perl/gbrowse/gbrowse/hapmap/?help=citations

Download FASTA-files for the genome assembly from UCSC (NB command gets B35 assembly used by the
current HapMap release (rel#21), adjust to get more recent assemblies). Note that even if you do
not plan to use the DNA-sequence in your local installation, the HapMap frequency glyph require it.
ncftpget ftp://hgdownload.cse.ucsc.edu/apache...es/chr*.fa.zip

### Get software

Install the latest GBrowse/Bio:B::GFF package, and make sure you have a working installation
by installing some of the test data bundled with GBrowse (as described in tutorial).

Generic Model Organism Database Project - Browse /Generic Genome Browser at SourceForge.net

http://sourceforge.net/project/showfiles.php?group_id=27707&package_id=34513

Generic Model Organism Database Project /Generic Genome Browser files. Browse /Generic Genome Browser files for Generic Model Organism Database Project

Download and install custom HapMap glyphs & plugins into your GBrowse, and make sure
they do not have compilation errors due to missing libraries in your local machine etc.

http://www.hapmap.org/downloads/gbrowse/latest/plugins/

http://www.hapmap.org/downloads/gbrowse/latest/glyph/

Install the latest Bioperl distribution, so you definately have working GFF3 handling
in the Bio:B::GFF library (there was a buggy version in circulation a while ago).

Page not found · GitHub Pages

http://www.bioperl.org/Core/Latest/index.shtml

Download a patched Bio:B::GFF bulkloader script from the HapMap website and add to
your existing Bioperl installation. The patch does the following:
-forces the attributes of genotyped SNP features for each population to 'pile up'
on a single fdata table entry, instead of the default behaviour of creating seperate
features per SNP per population.
-enables decompression of zipped (.zip) FASTA files from UCSC on the fly.

http://www.hapmap.org/downloads/gbrowse/latest/misc/bulk_load_gff.PLS

### Load feature data into MySQL

To get around 'full-table' problems with the fattribute_to_feature table, make sure you have at least MySQL 5.0.6
so you can have tables >4Gb without modifying server parameters or table properties (see
http://dev.mysql.com/doc/refman/5.0/en/full-table.html).

Technical Difficulties

http://dev.mysql.com/downloads/mysql/5.0.html

To load GFFs, a two-step procedure is needed to get around a Perl memory problem in the bulk loader. The problem
may not exist on newer Unix platforms & Perl distributions, so you can try just loading the entire set of GFFs
in one fell swoop with the bulk loader:

-Load part I: run the bulkloader on all GFFs *except* the dbSNP file, plus the dir containing the FASTA-files.
This takes 6-7 hours on our machine:
nohup time bulk_load_gff -c -d dbi:mysql:[dbname] --user [user name] --password [XXXX]
--gff3_munge --maxfeature 10000000000 --local --fasta dna/* *.gff* &

-Load part II: run the regular loader on the dbSNP files to add them to the bulkloaded data. This takes
8-10 hours on our machine:
nohup time load_gff --dsn dbi:mysql:[dbname] -gff3_munge --local dbsnp_b124_on_B34.gff3.gz &
[NOTE: You can combine both the steps if you are reasonably confident of the dataload procedure]

Set proper permissions on the database, so the user running Apache (typically 'nobody') can read it:
mysql>grant select on [dbname].* to nobody;
mysql>flush privileges;

Download the HapMap GBrowse config, copy to your config dir, modify and test:

http://www.hapmap.org/downloads/gbrowse/latest/conf/hapmap.conf

Finally, if everything is working, optimize MySQL database tables to get better performance. This
can take an hour or even more:
mysql>optimize table, fattribute, fattribute_to_feature, fdata,fdna,fgroup,fmeta,ftype;

-------------------------------
[email protected]

So it
Tags: None

Previous template Next

Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by GATTACAT

Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
- Channel: Articles
Today, 11:43 AM
Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by SEQadmin2

I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

Here are nine questions we think about, in roughly the order they matter, before...
- Channel: Articles
06-18-2026, 07:11 AM
From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data

by SEQadmin2

Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.

The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...
- Channel: Articles
06-02-2026, 10:05 AM

Topics	Statistics	Last Post
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, Yesterday, 05:37 AM	0 responses 8 views 0 reactions	Last Post by SEQadmin2 Yesterday, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 17 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 52 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 110 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM

Unconfigured Ad

Problem while using Gbrowse 2.54 with hapmap data

Latest Articles

ad_right_rmr

News