I'm trying to get BFAST working as an aligner for me to use to attempt to detect human contamination in a bacterial metagenomic sample (everything will be 100mer Illumina reads). I am using the ensembl build 36 human genome + some additional novel regions from 2 other human genomes sequenced at the BGI. The total db size is ~3.0Gb, but it consists of 24 chromosomes that are VERY large, and then several thousand small sequences in addition to that. So its kind of a 'lopsided' database.
I successfully ran 'bfast fasta2brg' on the file, but now for the 'bfast index' step I was using the '-d 1' parameter to reduce the memory footprint. From other threads I'd gotten the idea that using '-d 1' would probably keep my memory footprint down to ~8Gb. But all my blade jobs keep dying when I request only 8Gb of memory. What kind of memory can I expect my job to require?
On another matter, I'm using the masks listed in the bfast manual for 'illumina reads > 40bp'. Should those be good enough for me to align Illumina 100mers, or would I be better off defining new masks? My goal is to identify human reads out from amongst bacterial sequences. So I believe I can be fairly relaxed in my search criteria without fear of falsely identifying bacterial reads as human.
I successfully ran 'bfast fasta2brg' on the file, but now for the 'bfast index' step I was using the '-d 1' parameter to reduce the memory footprint. From other threads I'd gotten the idea that using '-d 1' would probably keep my memory footprint down to ~8Gb. But all my blade jobs keep dying when I request only 8Gb of memory. What kind of memory can I expect my job to require?
On another matter, I'm using the masks listed in the bfast manual for 'illumina reads > 40bp'. Should those be good enough for me to align Illumina 100mers, or would I be better off defining new masks? My goal is to identify human reads out from amongst bacterial sequences. So I believe I can be fairly relaxed in my search criteria without fear of falsely identifying bacterial reads as human.
Comment