Hi,
I want to map RNAseq data (from 10 to 30 Go, single and paired-end) on complete human genome with hisat2. To do that, I use a cluster (15 nodes, 24 cores each one, 64Go RAM, workload manager SLURM).
I succeed to align RNAseq files (20Go single reads or 2x20Go paired-end reads) against one human chromosome (chr7), indexed with annotation according to the manual (exons and splice sites extracted by adapted python files) with 1 node/24 cores (with hisat2 option -p 24) or 2 nodes/48 cores (-p 48).
I succeed to map my data against partial genome (chromosomes 1 to 10, without annotation).
But when I want to align with the full human genome (only the chromosomes, index without annotation), ( several tests with 2 to 12 nodes, 12 or 24 cores/node), the execution fails with a error message like this :
slurmstepd: Step 8878.0 exceeded memory limit (3652264 > 2744320), being killed
srun: Exceeded job memory limit
srun: Exceeded job memory limit
srun: Force Terminated job 8878
srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
slurmstepd: *** STEP 8878.0 CANCELLED AT 2016-03-30T11:16:32 ***
srun: error: thau016: task 0: Killed
I understand that I don't have enought memory, even with this cluster. So does anyone know features to align such data against complete human genome? Is the problem something else?
Thanks!
I want to map RNAseq data (from 10 to 30 Go, single and paired-end) on complete human genome with hisat2. To do that, I use a cluster (15 nodes, 24 cores each one, 64Go RAM, workload manager SLURM).
I succeed to align RNAseq files (20Go single reads or 2x20Go paired-end reads) against one human chromosome (chr7), indexed with annotation according to the manual (exons and splice sites extracted by adapted python files) with 1 node/24 cores (with hisat2 option -p 24) or 2 nodes/48 cores (-p 48).
I succeed to map my data against partial genome (chromosomes 1 to 10, without annotation).
But when I want to align with the full human genome (only the chromosomes, index without annotation), ( several tests with 2 to 12 nodes, 12 or 24 cores/node), the execution fails with a error message like this :
slurmstepd: Step 8878.0 exceeded memory limit (3652264 > 2744320), being killed
srun: Exceeded job memory limit
srun: Exceeded job memory limit
srun: Force Terminated job 8878
srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
slurmstepd: *** STEP 8878.0 CANCELLED AT 2016-03-30T11:16:32 ***
srun: error: thau016: task 0: Killed
I understand that I don't have enought memory, even with this cluster. So does anyone know features to align such data against complete human genome? Is the problem something else?
Thanks!
Comment