Originally posted by ohofmann
View Post
Unconfigured Ad
Collapse
X
-
Hi Nils
Just wondering can SRMA be used for rescuing orphaned reads. So we have a dataset of variable insert library as we are sequencing the 5' and 3' end of transcripts. As a result the distance between the mates( <--- --->) is dependent on the length of transcript. To map the reads initially I am first using Mosaik which i belv does a better job with variable insert mate pair data.
After mapping we still see 40% orphaned reads where one read maps and the other doesn't. I am wondering if SRMA can rescue these reads.
Thanks!
-Abhi
Comment
-
-
I have used it and it is fast. I have sometimes had trouble with files in the 100GB range but generally it works fine.Originally posted by ymc View PostDead project now? Are there other alternatives that work on the whole genome?
We have also parallelized the GATK implementation of LR if you are interested. I am not sure which is better at realigning. I do remember comparing SRMA and GATK LR and there are differences but it was not clear to me if one was consistently better than the other. I suspect that Nils would be a better source for info on that.
Comment
-
-
Tried several bams with 0.1.16 but all I got was this:
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
Comment
-
-
Could you post the full error message?Originally posted by ymc View PostTried several bams with 0.1.16 but all I got was this:
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
at java.util.ArrayList$SubList.add(ArrayList.java:965)
Comment
-
-
I have been interested in this tool for some time but never got it working:
Input is a sorted bam.
java -Xmx16g -jar srma-0.1.15.jar I=491_full_s.bam O=srma_491.bam R=../NC_002516.fna
[Fri Aug 17 10:00:54 CEST 2012] srma.SRMA INPUT=[491_full_s.bam] OUTPUT=[srma_491.bam] REFERENCE=../NC_002516.fna OFFSET=20 MIN_MAPQ=0 MINIMUM_ALLELE_PROBABILITY=0.1 MINIMUM_ALLELE_COVERAGE=3 MAXIMUM_TOTAL_COVERAGE=100 CORRECT_BASES=false USE_SEQUENCE_QUALITIES=true QUIET_STDERR=false MAX_HEAP_SIZE=8192 MAX_QUEUE_SIZE=65536 GRAPH_PRUNING=false NUM_THREADS=1 TMP_DIR=/tmp/colin2 VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false
java.util.NoSuchElementException
at java.util.Scanner.nextLine(Scanner.java:1503)
at net.sf.picard.reference.FastaSequenceIndex.parseIndexFile(FastaSequenceIndex.java:131)
at net.sf.picard.reference.FastaSequenceIndex.<init>(FastaSequenceIndex.java:55)
at net.sf.picard.reference.IndexedFastaSequenceFile.<init>(IndexedFastaSequenceFile.java:95)
at srma.SRMA.doWork(SRMA.java:131)
at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:156)
at srma.SRMA.main(SRMA.java:98)
Please report bugs to [email protected]
The fasta index file looks like this:
more ../NC_002516.fna.fai
NC_002516.2 6264404 58 70 71
Cheers for any help.
Comment
-
-
There are thousands of lines of these error messages. If I copy the stderr output, it will be too many lines. You can replicate my problem by downloading the pair-ended reads from
ftp://ftp.1000genomes.ebi.ac.uk/vol1...sequence_read/
and then align them using bwa. I got the same bug with SRR098401_*.filt.fastq.gz and SRR035330_*.filt.fastq.gz
Comment
-
-
It looks like your FASTA index is broken. Can you try rebuilding?Originally posted by colindaven View PostI have been interested in this tool for some time but never got it working:
Input is a sorted bam.
java -Xmx16g -jar srma-0.1.15.jar I=491_full_s.bam O=srma_491.bam R=../NC_002516.fna
[Fri Aug 17 10:00:54 CEST 2012] srma.SRMA INPUT=[491_full_s.bam] OUTPUT=[srma_491.bam] REFERENCE=../NC_002516.fna OFFSET=20 MIN_MAPQ=0 MINIMUM_ALLELE_PROBABILITY=0.1 MINIMUM_ALLELE_COVERAGE=3 MAXIMUM_TOTAL_COVERAGE=100 CORRECT_BASES=false USE_SEQUENCE_QUALITIES=true QUIET_STDERR=false MAX_HEAP_SIZE=8192 MAX_QUEUE_SIZE=65536 GRAPH_PRUNING=false NUM_THREADS=1 TMP_DIR=/tmp/colin2 VERBOSITY=INFO QUIET=false VALIDATION_STRINGENCY=STRICT COMPRESSION_LEVEL=5 MAX_RECORDS_IN_RAM=500000 CREATE_INDEX=false CREATE_MD5_FILE=false
java.util.NoSuchElementException
at java.util.Scanner.nextLine(Scanner.java:1503)
at net.sf.picard.reference.FastaSequenceIndex.parseIndexFile(FastaSequenceIndex.java:131)
at net.sf.picard.reference.FastaSequenceIndex.<init>(FastaSequenceIndex.java:55)
at net.sf.picard.reference.IndexedFastaSequenceFile.<init>(IndexedFastaSequenceFile.java:95)
at srma.SRMA.doWork(SRMA.java:131)
at net.sf.picard.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:156)
at srma.SRMA.main(SRMA.java:98)
Please report bugs to [email protected]
The fasta index file looks like this:
more ../NC_002516.fna.fai
NC_002516.2 6264404 58 70 71
Cheers for any help.
I am sorry, please try reducing your read set or the like to a manageable test case. Otherwise, I charge $5KUSD/hourOriginally posted by ymc View PostThere are thousands of lines of these error messages. If I copy the stderr output, it will be too many lines. You can replicate my problem by downloading the pair-ended reads from
ftp://ftp.1000genomes.ebi.ac.uk/vol1...sequence_read/
and then align them using bwa. I got the same bug with SRR098401_*.filt.fastq.gz and SRR035330_*.filt.fastq.gz
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Today, 10:09 AM
|
0 responses
8 views
0 reactions
|
Last Post
by SEQadmin2
Today, 10:09 AM
|
||
|
Started by SEQadmin2, Yesterday, 08:59 AM
|
0 responses
14 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
22 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
||
|
Started by SEQadmin2, 06-02-2026, 11:40 AM
|
0 responses
19 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 11:40 AM
|
Comment