Originally posted by sci_guy
View Post
Header Leaderboard Ad
Collapse
Software packages for next gen sequence analysis
Collapse
Announcement
Collapse
No announcement yet.
This topic is closed.
X
This is a sticky topic.
X
X
-
There's another program as well for mapping short reads called gnumap (http://dna.cs.byu.edu/gnumap/) made to increase the accuracy with duplicate matches. Open source, creates viewable output (with Affy's Integrated Genome Browser), and produces results very similar to novocraft's.
Comment
-
Another good program: ZOOM! Zillions of oligos mapped, which is online now at Bioinformatics. ZOOM resembles Eland a lot, but it further improves the spaced-seed method. I think SOLiD read mapper also uses quite a similar strategy of spaced seed, but it indexes genome, while ZOOM indexes reads like Eland.
Also, I believe ZOOM is carefully engineered. Outperforming eland with a similar algorithm is non-trivial, even given the advantages in the algorithm. I just wonder how ZOOM-C/I/P may perform. The authors did not give benchmark.Last edited by lh3; 08-07-2008, 12:04 PM.
Comment
-
I've seen the manuscript of their ZOOM publication, which looks impressive, and followed up with someone at the company that produces it - I was told that the software wasn't yet available, and might not be for some period of time.
Unfortunately, I get the impression ZOOM might be vapourware for the forseable future - though if anyone knows more than I do, please feel free to correct me.Last edited by apfejes; 08-07-2008, 12:11 PM.The more you know, the more you know you don't know. —Aristotle
Comment
-
I do not know more that you, apfejes. I just wonder why they do not release the program when the benchmark is ready. This might be the best time for "short" read aligners. In a year time (probably half-a-year time), the length of Illumina reads will come to >70bp which will effectively cripple a lot of current programs, including maq unfortunately.
Comment
-
Thanks - all I know is that when I spoke to my contact at Bioinformatics Solutions, they were using the core of the software to generate benchmarks, but were still working on the application itself. A lot of my questions weren't answered, unfortunately.
Either way, I agree, once we pass 64bp reads (I've heard we're going to start doing 72bp test runs next week), we're going to leave the realm of short read aligners and need to start dealing with medium length (100-1000bp) read aligners, anyhow. (I'm not sure what's in that space, though: blast, blat, exonerate?)
As an aside, I'll save the term "long read alignments" for when Pacific Biosciences releases their SMRT (single molecule - real time) sequencing machine. 5-25k reads are about as long as I expect are going to be necessary for any application, though at that point, you're probably better off doing assembly than alignments.The more you know, the more you know you don't know. —Aristotle
Comment
-
I will be looking out for some benchmark data for > 70bp Illumina runs to test future versions of novoalign/novopaired on. We had anticipated this would be coming and have some ideas on how to handle these.
If anybody could arrange some , public or under NDA, then we'd like that very much to get something out to the community.
Originally posted by lh3 View PostI do not know more that you, apfejes. I just wonder why they do not release the program when the benchmark is ready. This might be the best time for "short" read aligners. In a year time (probably half-a-year time), the length of Illumina reads will come to >70bp which will effectively cripple a lot of current programs, including maq unfortunately.
Comment
-
Hi zee,
I haven't seen the runs yet - but if you message me in a week or two, I might be able to let you know how things ran here.
Actually, make that early September - I'm on holiday for the last 2 weeks of august. (-:The more you know, the more you know you don't know. —Aristotle
Comment
-
Hi, lh3 and apfejes. Thanks for you opinion on ZOOM. I am one of the developers of ZOOM. In fact, we will release the command-line version of ZOOM next week.
We finished the part of ZOOM dealing with Illumina/Solexa data in January of this year. Due to some personal reason, we came back to it until May. Then we found the release of ABI SOLiD data. We wanted to support color space data in ZOOM. After that, we focused on the GUI part, manual, website... We are trying to release an efficient, useful and easy-to-use tool. So, sorry for letting you wait.The good news is it will appear next week.
Welcome to try ZOOM out. We'd like to provide any help we could.
Originally posted by lh3 View PostI do not know more that you, apfejes. I just wonder why they do not release the program when the benchmark is ready. This might be the best time for "short" read aligners. In a year time (probably half-a-year time), the length of Illumina reads will come to >70bp which will effectively cripple a lot of current programs, including maq unfortunately.
Comment
-
Thanks for the update, spirit.
Maybe you could give us a little bit of information on Zoom, as well, since things may have changed since last time I heard anything about it.
What are the longest and shortest reads it can handle effectively?
how does it compare to Eland or MAQ in reads aligned per minute?
How many mismatches does it handle?
Does it have a gapped mode?
What format is required for the reference genome?
What format are the alignments reported in?
Can you comment on the cost/licenses it will be provided under?
Can you give us the link to the download when it's ready?
I'm sure I'm missing other important information, but those are the first questions that occur to me.
Thanks!The more you know, the more you know you don't know. —Aristotle
Comment
-
And may I ask a question? What is the pair-end data of Illumina/Solexa data like? ZOOM now accepts two types of pair-end reads input. One is two fasta input files recording reads from two ends separately. ZOOM will automatically find the reads paired according to their name before mapping. The other is one fasta input files with two reads of a pair appearing in odd line and even line respectively. Thanks.
Originally posted by lh3 View PostAnother good program: ZOOM! Zillions of oligos mapped, which is online now at Bioinformatics. ZOOM resembles Eland a lot, but it further improves the spaced-seed method. I think SOLiD read mapper also uses quite a similar strategy of spaced seed, but it indexes genome, while ZOOM indexes reads like Eland.
Also, I believe ZOOM is carefully engineered. Outperforming eland with a similar algorithm is non-trivial, even given the advantages in the algorithm. I just wonder how ZOOM-C/I/P may perform. The authors did not give benchmark.
Comment
-
I suppose the PET comes out in the format provided by whichever base caller you use. I imagine it wouldn't be too difficult to convert to the formats ZOOM requires.
I believe that people here are starting to look at alternate base calling programs, so maybe someone who knows more than I do can point you in the direction of the documentation for those applications.The more you know, the more you know you don't know. —Aristotle
Comment
Latest Articles
Collapse
-
by seqadmin
Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...-
Channel: Articles
03-21-2023, 01:49 PM -
-
by seqadmin
Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...-
Channel: Articles
03-10-2023, 05:31 AM -
-
by seqadmin
Using automation to prepare sequencing libraries isn’t a new concept, and most researchers are aware that there are numerous benefits to automating this process. However, many labs are still hesitant to switch to automation and often believe that it’s not suitable for their lab. To combat these concerns, we’ll cover some of the key advantages, review the most important considerations, and get real-world advice from automation experts to remove any lingering anxieties....-
Channel: Articles
02-21-2023, 02:14 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 12:26 PM
|
0 responses
7 views
0 likes
|
Last Post
by seqadmin
Yesterday, 12:26 PM
|
||
Started by seqadmin, 03-17-2023, 12:32 PM
|
0 responses
14 views
0 likes
|
Last Post
by seqadmin
03-17-2023, 12:32 PM
|
||
Started by seqadmin, 03-15-2023, 12:42 PM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
03-15-2023, 12:42 PM
|
||
Started by seqadmin, 03-09-2023, 10:17 AM
|
0 responses
68 views
1 like
|
Last Post
by seqadmin
03-09-2023, 10:17 AM
|
Comment