Seqanswers Leaderboard Ad

**scalabrin** · 01-04-2012, 07:41 AM

Hi Doug,

I just had the same problem and solved it with -XX:MaxPermSize=512m

As you already tried with 1g, it looks like you just need to increase it further... was the 4g enough?

**dgscofield** · 01-04-2012, 09:33 AM

Hi, yep, 4GB was enough. If I recall it died with 2GB. The main challenge was getting enough heap space, had to request 256GB and if I believe the htop stats it was using 221GB at one point :-)

/Doug

**townway** · 02-03-2012, 02:28 PM

The following are the output of picards_markduplicates, I changed some of the options to bigger number but it still give me error.
my file is about 10GB of bam file, and program was running with 24G RAM using version 1.49 and 1.50. Please help me to fix the problem. Thank you so much

net.sf.picard.sam.MarkDuplicates INPUT=accepted_hits_sorted.bam OUTPUT=accepted_hits_sorted.pk.mk.out METRICS_FILE=accepted_hits_sorted.pk.mk.metrics ASSUME_SORTED=true MAX_SEQUENCES_FOR_DISK_READ_ENDS_MAP=500000000 MAX_FILE_HANDLES_FOR_READ_ENDS_MAP=1000 MAX_RECORDS_IN_RAM=500000000 REMOVE_DUPLICATES=false SORTING_COLLECTION_SIZE_RATIO=0.25 READ_NAME_REGEX=[a-zA-Z0-9]+:[0-9]

[0-9]+)

[0-9]+).* OPTICAL_DUPLICATE_PIXEL_DISTANCE=100 TMP_DIR=/tmp/tangwei VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 CREATE_INDEX=false CREATE_MD5_FILE=false
[Fri Feb 03 16:06:12 EST 2012] Executing as tangwei@p809 on Linux 2.6.18-128.el5 i386; Java HotSpot(TM) Server VM 1.7.0_02-b13
INFO 2012-02-03 16:06:12 MarkDuplicates Start of doWork freeMemory: 63278136; totalMemory: 64356352; maxMemory: 1908932608
INFO 2012-02-03 16:06:12 MarkDuplicates Reading input file and constructing read end information.
INFO 2012-02-03 16:06:12 MarkDuplicates Will retain up to 7575129 data points before spilling to disk.
INFO 2012-02-03 16:06:18 MarkDuplicates Read 1000000 records. Tracking 8778 as yet unmatched pairs. 8778 records in RAM. Last sequence index: 0
......
......
INFO 2012-02-03 16:41:35 MarkDuplicates Read 151000000 records. Tracking 5300425 as yet unmatched pairs. 5300425 records in RAM. Last sequence index: 51
[Fri Feb 03 16:52:03 EST 2012] net.sf.picard.sam.MarkDuplicates done. Elapsed time: 45.84 minutes.
Runtime.totalMemory()=1980170240
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
at java.util.regex.Matcher.<init>(Matcher.java:224)
at java.util.regex.Pattern.matcher(Pattern.java:1088)
at net.sf.picard.sam.AbstractDuplicateFindingAlgorithm.addLocationInformation(AbstractDuplicateFindingAlgorithm.java:61)
at net.sf.picard.sam.MarkDuplicates.buildReadEnds(MarkDuplicates.java:364)
at net.sf.picard.sam.MarkDuplicates.buildSortedReadEndLists(MarkDuplicates.java:298)
at net.sf.picard.sam.MarkDuplicates.doWork(MarkDuplicates.java:117)
at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:169)
at net.sf.picard.sam.MarkDuplicates.main(MarkDuplicates.java:101)

Originally posted by dgscofield View Post

Hi, yep, 4GB was enough. If I recall it died with 2GB. The main challenge was getting enough heap space, had to request 256GB and if I believe the htop stats it was using 221GB at one point :-)

/Doug

**scalabrin** · 02-05-2012, 12:06 AM

Perhaps you need to tell Java to use your memory (Java heap space), if I remember correctly Java allocates only 1Gb of memory if you don't instruct it differently.
You should use the option -Xmx

Have a look, for example, at http://www.ehow.com/how_5347474_set-...eap-space.html

Originally posted by townway View Post

The following are the output of picards_markduplicates, I changed some of the options to bigger number but it still give me error.
my file is about 10GB of bam file, and program was running with 24G RAM using version 1.49 and 1.50. Please help me to fix the problem. Thank you so much

[cut]

[Fri Feb 03 16:52:03 EST 2012] net.sf.picard.sam.MarkDuplicates done. Elapsed time: 45.84 minutes.
Runtime.totalMemory()=1980170240
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
at java.util.regex.Matcher.<init>(Matcher.java:224)
at java.util.regex.Pattern.matcher(Pattern.java:1088)
at net.sf.picard.sam.AbstractDuplicateFindingAlgorithm.addLocationInformation(AbstractDuplicateFindingAlgorithm.java:61)
at net.sf.picard.sam.MarkDuplicates.buildReadEnds(MarkDuplicates.java:364)
at net.sf.picard.sam.MarkDuplicates.buildSortedReadEndLists(MarkDuplicates.java:298)
at net.sf.picard.sam.MarkDuplicates.doWork(MarkDuplicates.java:117)
at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:169)
at net.sf.picard.sam.MarkDuplicates.main(MarkDuplicates.java:101)

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 10 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Picard tools out of memory: PermGen

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News