Originally posted by ohofmann
View Post
Unconfigured Ad
Collapse
X
-
Hi Nils
Just wondering can SRMA be used for rescuing orphaned reads. So we have a dataset of variable insert library as we are sequencing the 5' and 3' end of transcripts. As a result the distance between the mates( <--- --->) is dependent on the length of transcript. To map the reads initially I am first using Mosaik which i belv does a better job with variable insert mate pair data.
After mapping we still see 40% orphaned reads where one read maps and the other doesn't. I am wondering if SRMA can rescue these reads.
Thanks!
-Abhi
Comment
-
-
I have used it and it is fast. I have sometimes had trouble with files in the 100GB range but generally it works fine.Originally posted by ymc View PostDead project now? Are there other alternatives that work on the whole genome?
We have also parallelized the GATK implementation of LR if you are interested. I am not sure which is better at realigning. I do remember comparing SRMA and GATK LR and there are differences but it was not clear to me if one was consistently better than the other. I suspect that Nils would be a better source for info on that.
Comment
-
-
Tried several bams with 0.1.16 but all I got was this:
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
Comment
-
-
Could you post the full error message?Originally posted by ymc View PostTried several bams with 0.1.16 but all I got was this:
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
Comment
-
-
I have been interested in this tool for some time but never got it working:
Input is a sorted bam.
java -Xmx16g -jar srma-0.1.15.jar I=491_full_s.bam O=srma_491.bam R=../NC_002516.fna
[Fri Aug 17 10:00:54 CEST 2012] srma.SRMA INPUT=[491_full_s.bam] OUTPUT=[srma_491.bam] REFERENCE=../NC_002516.fna OFFSET=20 MIN_MAPQ=0 MINIMUM_ALLELE_PROBABILITY=0.1 MINIMUM_ALLELE_COVERAGE=3 MAXIMUM_TOTAL_COVERAGE=100 CORRECT_BASES=false USE_SEQUENCE_QUALITIES=true QUIET_STDERR=false MAX_HEAP_SIZE=8192 MAX_QUEUE_SIZE=65536 GRAPH_PRUNING=false NUM_THREADS=1 TMP_DIR=/tmp/colin2 VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false
java.util.NoSuchElementException
at java.util.Scanner.nextLine(Scanner.java:1503)
at net.sf.picard.reference.FastaSequenceIndex.parseIndexFile(FastaSequenceIndex.java:131)
at net.sf.picard.reference.FastaSequenceIndex.<init>(FastaSequenceIndex.java:55)
at net.sf.picard.reference.IndexedFastaSequenceFile.<init>(IndexedFastaSequenceFile.java:95)
at srma.SRMA.doWork(SRMA.java:131)
at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:156)
at srma.SRMA.main(SRMA.java:98)
Please report bugs to [email protected]
The fasta index file looks like this:
more ../NC_002516.fna.fai
NC_002516.2 6264404 58 70 71
Cheers for any help.
Comment
-
-
There are thousands of lines of these error messages. If I copy the stderr output, it will be too many lines. You can replicate my problem by downloading the pair-ended reads from
ftp://ftp.1000genomes.ebi.ac.uk/vol1...sequence_read/
and then align them using bwa. I got the same bug with SRR098401_*.filt.fastq.gz and SRR035330_*.filt.fastq.gz
Comment
-
-
It looks like your FASTA index is broken. Can you try rebuilding?Originally posted by colindaven View PostI have been interested in this tool for some time but never got it working:
Input is a sorted bam.
java -Xmx16g -jar srma-0.1.15.jar I=491_full_s.bam O=srma_491.bam R=../NC_002516.fna
[Fri Aug 17 10:00:54 CEST 2012] srma.SRMA INPUT=[491_full_s.bam] OUTPUT=[srma_491.bam] REFERENCE=../NC_002516.fna OFFSET=20 MIN_MAPQ=0 MINIMUM_ALLELE_PROBABILITY=0.1 MINIMUM_ALLELE_COVERAGE=3 MAXIMUM_TOTAL_COVERAGE=100 CORRECT_BASES=false USE_SEQUENCE_QUALITIES=true QUIET_STDERR=false MAX_HEAP_SIZE=8192 MAX_QUEUE_SIZE=65536 GRAPH_PRUNING=false NUM_THREADS=1 TMP_DIR=/tmp/colin2 VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false
java.util.NoSuchElementException
at java.util.Scanner.nextLine(Scanner.java:1503)
at net.sf.picard.reference.FastaSequenceIndex.parseIndexFile(FastaSequenceIndex.java:131)
at net.sf.picard.reference.FastaSequenceIndex.<init>(FastaSequenceIndex.java:55)
at net.sf.picard.reference.IndexedFastaSequenceFile.<init>(IndexedFastaSequenceFile.java:95)
at srma.SRMA.doWork(SRMA.java:131)
at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:156)
at srma.SRMA.main(SRMA.java:98)
Please report bugs to [email protected]
The fasta index file looks like this:
more ../NC_002516.fna.fai
NC_002516.2 6264404 58 70 71
Cheers for any help.
I am sorry, please try reducing your read set or the like to a manageable test case. Otherwise, I charge $5KUSD/hourOriginally posted by ymc View PostThere are thousands of lines of these error messages. If I copy the stderr output, it will be too many lines. You can replicate my problem by downloading the pair-ended reads from
ftp://ftp.1000genomes.ebi.ac.uk/vol1...sequence_read/
and then align them using bwa. I got the same bug with SRR098401_*.filt.fastq.gz and SRR035330_*.filt.fastq.gz
Comment
-
Latest Articles
Collapse
-
by GATTACATLove this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
-
Channel: Articles
07-01-2026, 11:43 AM -
-
by SEQadmin2
I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.
Here are nine questions we think about, in roughly the order they matter, before...-
Channel: Articles
-
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Yesterday, 11:08 AM
|
0 responses
6 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 11:08 AM
|
||
|
Started by SEQadmin2, 06-30-2026, 05:37 AM
|
0 responses
11 views
0 reactions
|
Last Post
by SEQadmin2
06-30-2026, 05:37 AM
|
||
|
Started by SEQadmin2, 06-26-2026, 11:10 AM
|
0 responses
19 views
0 reactions
|
Last Post
by SEQadmin2
06-26-2026, 11:10 AM
|
||
|
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population
by SEQadmin2
Started by SEQadmin2, 06-17-2026, 06:09 AM
|
0 responses
53 views
0 reactions
|
Last Post
by SEQadmin2
06-17-2026, 06:09 AM
|
Comment