# This is an example configuration file for FastQ Screen ############################ ## Bowtie, Bowtie 2 or BWA # ############################ ## If the Bowtie, Bowtie 2 or BWA binary is not in your PATH, you can set ## this value to tell the program where to find your chosen aligner. Uncomment ## the relevant line below and set the appropriate location. Please note, ## this path should INCLUDE the executable filename. #BOWTIE /usr/local/bin/bowtie/bowtie BOWTIE2 /usr/lib/bowtie2/bin/bowtie2 #BWA /usr/local/bwa/bwa ############################################ ## Bismark (for bisulfite sequencing only) # ############################################ ## If the Bismark binary is not in your PATH then you can set this value to ## tell the program where to find it. Uncomment the line below and set the ## appropriate location. Please note, this path should INCLUDE the executable ## filename. BISMARK /data/Bismark/bismark ############ ## Threads # ############ ## Genome aligners can be made to run across multiple CPU cores to speed up ## searches. Set this value to the number of cores you want for mapping reads. THREADS 8 ############## ## DATABASES # ############## ## This section enables you to configure multiple genomes databases (aligner index ## files) to search against in your screen. For each genome you need to provide a ## database name (which can't contain spaces) and the location of the aligner index ## files. ## ## The path to the index files SHOULD INCLUDE THE BASENAME of the index, e.g: ## /data/public/Genomes/Human_Bowtie/GRCh37/Homo_sapiens.GRCh37 ## Thus, the index files (Homo_sapiens.GRCh37.1.bt2, Homo_sapiens.GRCh37.2.bt2, etc.) ## are found in a folder named 'GRCh37'. ## ## If, for example, the Bowtie, Bowtie2 and BWA indices of a given genome reside in ## the SAME FOLDER, a SINLGE path may be provided to ALL the of indices. The index ## used will be the one compatible with the chosen aligner (as specified using the ## --aligner flag). ## ## The entries shown below are only suggested examples, you can add as many DATABASE ## sections as required, and you can comment out or remove as many of the existing ## entries as desired. We suggest including genomes and sequences that may be sources ## of contamination either because they where run on your sequencer previously, or may ## have contaminated your sample during the library preparation step. ## ## Human - sequences available from ## ftp://ftp.ensembl.org/pub/current/fasta/homo_sapiens/dna/ #DATABASE Human /data/Bismark/Human/Human ## ## Mouse - sequence available from ## ftp://ftp.ensembl.org/pub/current/fasta/mus_musculus/dna/ #DATABASE Mouse /data/Bismark/Mouse/ ## ## Ecoli- sequence available from EMBL accession U00096.2 DATABASE Daphnia /data/Bismark/GenomeDaphnia/ ## ## PhiX - sequence available from Refseq accession NC_001422.1 #DATABASE PhiX /data/Bismark/Control/ ## ## Adapters - sequence derived from the FastQC contaminats file found at: www.bioinformatics.babraham.ac.uk/projects/fastqc #DATABASE Adapters /data/public/Genomes/Contaminants/Contaminants ## ## Vector - Sequence taken from the UniVec database ## http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html #DATABASE Vectors /data/public/Genomes/Vectors/Vectors