Plz help me guys.. give me some reply...
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
It may be a good idea to try a subset of your data (select a few large contigs and/or a known sequence with the right repeats) before you start running a large genome file through some of these tools. Depending of the size of data set the run times can increase logarithmically.
Comment
-
Thank You.. GenoMax
I did that and i got the result. I have one more problem
I have installed repeatmodeler. But when i am building database it is showing error
./BuildDatabase -name test test.fa
RepModelConfig.pm did not return a true value at ./BuildDatabase line 146.
BEGIN failed--compilation aborted at ./BuildDatabase line 146.
Can you tell me why the error is coming?
Comment
-
Originally posted by tnguyen View PostHi Rahul,
How large was your genome? How much memory was needed for your run? I received this error message at the start of Step 2:
"Could not allocate space for sequence"
sequence = (char *) malloc( (2 * MAXLENGTH + 3 * PADLENGTH) * sizeof(char) );
if( NULL == sequence ) {
fprintf(stderr, "Could not allocate space for sequence\n");
exit(1);
}
to
sequence = (char *) malloc( (2 * (size_t)MAXLENGTH + 3 * (size_t)PADLENGTH) * sizeof(char) );
if( NULL == sequence ) {
fprintf(stderr, "Could not allocate space for sequence\n");
exit(1);
}
otherwise calculation of big numbers (files more than about 1 GB) are not correct and results in much much bigger memory allocations than neccessary. I had this situation previously under FreeBSD, Linux and Solaris. That change helped me to overcome this allocation error... Actually it is running under FreeBSD :-)
Cheers, sunnyseq
Comment
-
Hi guys, I still have the same problem that people in this list previously had.
I followed the suggestions above and here is my command for running the step 2 of the RepeatScout:
RepeatScout
-sequence genome.fasta
-output genome_repeat.fasta
-freq genome.freq
-l 14
I get this error : "Could not allocate space for sequence" .
I ran the test file and its running, so the installation is not a problem. Although I realized that the genome.fasta file in the test is only one concensus fasta sequence. However, my genome.fasta is an assembly containing multiple contigs but in fasta format. I should also add that I am giving a big time memory to the machine, so I doubt that its a problem.
Anybody has suggestion.
Thanks a lot, Solidether
Comment
-
Originally posted by solidether View PostHi guys, I still have the same problem that people in this list previously had.
I followed the suggestions above and here is my command for running the step 2 of the RepeatScout:
RepeatScout
-sequence genome.fasta
-output genome_repeat.fasta
-freq genome.freq
-l 14
I get this error : "Could not allocate space for sequence" .
I ran the test file and its running, so the installation is not a problem. Although I realized that the genome.fasta file in the test is only one concensus fasta sequence. However, my genome.fasta is an assembly containing multiple contigs but in fasta format. I should also add that I am giving a big time memory to the machine, so I doubt that its a problem.
Anybody has suggestion.
Thanks a lot, Solidether
Comment
-
The error message ""Could not allocate space for sequence"
The error message ""Could not allocate space for sequence" :
The reason for this error is in the RepeatScout software itself.
In the source code file "build_repeat_families.c" there are two
steps where memory allocation is done with command:
malloc( (2 * MAXLENGTH + 3 * PADLENGTH) * sizeof(char) )
This command tries to allocate proper amount of memory, based on the size of your input file. However, for some reason the allocation fails when the input file size is more than 2 GB.
I don't know enough about programming with C to say, why there is
this limit of 2 GB. Anyhow, for testing purposes I created a modified RepeatScout version (RepeatScout_fixmem) where the memory
allocation is allways 5 GB. ( malloc( 5000000000 ) )
After these modifications I was able to run the repeatscout analysis.
Comment
-
Originally posted by solidether View PostThe error message ""Could not allocate space for sequence" :
The reason for this error is in the RepeatScout software itself.
In the source code file "build_repeat_families.c" there are two
steps where memory allocation is done with command:
malloc( (2 * MAXLENGTH + 3 * PADLENGTH) * sizeof(char) )
This command tries to allocate proper amount of memory, based on the size of your input file. However, for some reason the allocation fails when the input file size is more than 2 GB.
I don't know enough about programming with C to say, why there is
this limit of 2 GB. Anyhow, for testing purposes I created a modified RepeatScout version (RepeatScout_fixmem) where the memory
allocation is allways 5 GB. ( malloc( 5000000000 ) )
After these modifications I was able to run the repeatscout analysis.
Done allocating headptr
Done building headptr
There are 0 l-mers
Done sorting headptr
OOPS no good lmers
Any ideas?
Comment
-
hello evryone i have an error when i write the second command of RepeatScout if anyone have an idea please share
$ ./RepeatScout -sequence Ca_dromedarius_kacst.fna -output output_repeats -freq output -l 14
RepeatScout(9531,0x7fff9faf2380) malloc: *** mach_vm_map(size=18446744073479073792) failed (error code=3)
*** error: can't allocate region
*** set a breakpoint in malloc_error_break to debug
Could not allocate space for sequence
Comment
-
Hello. I know that is an old thread but I don't find people able to answer.
I'm running Repeatscout. I built the l-mer table called myfile.freq of myfile.fa
Can anyone tell me what do they mean the second and third columns produced as output?
here I report an example:
```
AAAAAAAAGCGGGA 3 107776875
AAAAAAACTGTATG 10 83440519
AAAAAAAAGGCGTA 3 41037187
AAAAAAACTTGAAT 7 94493612
CATACATGCATGCA 1065 125671338
CATACATGCTTGAA 7 121799834
AAAAAAATCATGCA 10 95493021
AAAAAAAGTCCAGT 3 125127980
AATTCACATGTATG 7 102505668
```
Thank you
Comment
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
103 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
111 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
114 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|
Comment