Seqanswers Leaderboard Ad

**boetsie** · 12-20-2010, 12:40 AM

Hi all,

publication of SSPACE is now available at;

http://bioinformatics.oxfordjournals.org/content/early/2010/12/12/bioinformatics.btq683.short

Boetsie

**ganga.jeena** · 12-20-2010, 02:32 AM

congrats

Its grt to hear such an achievement.
Is your paper freely available.
Can you mail me downloadable software copy
Regards,
Ganga Jeena

**dan** · 12-20-2010, 02:38 AM

Congrats!

Before I get into the paper, can I ask if this tool supports 'hierarchical scaffolding' in the way that Bambus (supposedly) does? i.e. If I want to add in 'scaffolding' information based on gene synteny from some related organisms, can I add that in but with a lower priority than the true PE/MP data?

Does it detect repeats from the graph structure like Bambus does now?

I'm curious because Bambus promises a lot of nice functionality, which is why I keep hammering away at it. However, I'm starting to wonder if it's time to jump ship to a tool that is more robust (if perhaps less feature rich).

Cheers,
Dan.

**dan** · 12-20-2010, 03:27 AM

Nice paper! The question that arises is weather we can feed PE data directly to the algorithm, rather than being shoehorned through Bowtie?

For example, Bowtie may not be the best tool for aligning 454 reads to contigs, but I'd still like to use 454 PE data to scaffold my assembly. Is there some intermediate file or Bowtie like PE format that we can feed to SSPACE?

Unfortunately parts of http://bioinformatics.oxfordjournals.org are down, so I can't see the supplementary figure, sorry if that would help address my question.

**boetsie** · 12-20-2010, 10:13 AM

Hi Dan,

thanks for your reply!

It does not fully supports the same hierarchical scaffolding as Bambus. We use a simple approach;

1) Produce scaffolds using the first library
2) Use scaffolds from 1), and produce scaffolds using the second library
3) and so on...

we do not use a priority for the libraries, like Bambus. We let the user determine what order of library is used.

It is able to detect repeats by determining the number of incoming and outcoming 'links' between contigs. Repeats are outputted by the program.

Bambus has indeed more functionality. However, we found that the input options were too complex for simple scaffolding purposes.

About your question about Bowtie;
Unfortunately, only Bowtie is supported at the moment, as SSPACE was designed for Illumina input (or other short paired reads) and based on Bowtie output.

My question; What program do people use for aligning 454 reads, can it produce similar output as Bowtie?

Cheers,
Boetsie

Originally posted by dan View Post

Congrats!

Before I get into the paper, can I ask if this tool supports 'hierarchical scaffolding' in the way that Bambus (supposedly) does? i.e. If I want to add in 'scaffolding' information based on gene synteny from some related organisms, can I add that in but with a lower priority than the true PE/MP data?

Does it detect repeats from the graph structure like Bambus does now?

I'm curious because Bambus promises a lot of nice functionality, which is why I keep hammering away at it. However, I'm starting to wonder if it's time to jump ship to a tool that is more robust (if perhaps less feature rich).

Cheers,
Dan.

**dan** · 12-20-2010, 12:48 PM

Thanks for the clear reply Boetsie, really great to hear that you do do repeat filtering based on graph structure, and allowing the user to pick the order of the libraries seems like a nice strategy.

I've been using Newbler to align 454's PE data to contigs. Newbler automatically handles the specifics of the 454 style PE reads so, although it isn't the best aligner for 454, it is very easy to use the results, which are just tab delimited... You can read about the format of the Newbler PE data here!

Newbler can be persuaded to output ace-like format too, but it doesn't do SAM/BAM IIRC.

I was looking at the code, and it should be easy enough to feed in the data to SSPACE ;-)

**sjackman** · 12-31-2010, 02:45 PM

Hi Boetsie,

Does SSPACE use the SAM output format of Bowtie? If not, could it?

Cheers,
Shaun

**boetsie** · 01-01-2011, 09:05 AM

Hi Shaun,

no it does not, it uses the standard output from bowtie. With modifications to the script, it should be possible to use the SAM format.

Cheers,
Boetsie

**corthay** · 01-12-2011, 04:02 PM

BAC / Fosmid end

Hi boetsie,

Can I use additional BAC/Fosmid ends for scaffolding the pre-assebmled contigs
or scaffolds with SSPACE? If it's possible, is there any parameter for this purpose?

Thanks,
Corthay

**boetsie** · 01-14-2011, 12:33 AM

Originally posted by corthay View Post

Hi boetsie,

Can I use additional BAC/Fosmid ends for scaffolding the pre-assebmled contigs
or scaffolds with SSPACE? If it's possible, is there any parameter for this purpose?

Thanks,
Corthay

Hi Corthay,

i'm not very familiar with BAC/fosmid ends, so there is no parameter for this purpose. However, if;
- these are paired sequences
- the sequences' lengths are below 1024 (maximum input of Bowtie)
- the pairs have either orientation of --> <-- (typical paired-end) or <-- --> (typical mate pair)

I see no problems why you should not give it a try if it satisfies the above points.

Kind regards,
Boetsie

**dan** · 01-14-2011, 12:57 AM

What would be great is a simple tab delimited format for providing paired sequence alignments, rather than going via Bowtie... I had a quick look at the code, but unfortunately I couldn't work out where to add such functionality easily. I'll have another look at some point if nobody else does.

**corthay** · 01-16-2011, 04:48 PM

Hi Boetsie,

Thanks for the response.

I've just specified "k=2" as clone coverage of BAC ends is almost 5x.
As a result, scaffolds N50 is a bit improved and the number of scaffolds is reduced. Thanks for the development of useful tool.

Corthay.

Originally posted by boetsie View Post

Hi Corthay,

i'm not very familiar with BAC/fosmid ends, so there is no parameter for this purpose. However, if;
- these are paired sequences
- the sequences' lengths are below 1024 (maximum input of Bowtie)
- the pairs have either orientation of --> <-- (typical paired-end) or <-- --> (typical mate pair)

I see no problems why you should not give it a try if it satisfies the above points.

Kind regards,
Boetsie

**boetsie** · 01-17-2011, 01:40 AM

Originally posted by dan View Post

What would be great is a simple tab delimited format for providing paired sequence alignments, rather than going via Bowtie... I had a quick look at the code, but unfortunately I couldn't work out where to add such functionality easily. I'll have another look at some point if nobody else does.

Hi Dan,

i know what you mean, but than multiple library input can't be used since we do an hierarchical clustering (first generate scaffolds using one library, than produce scaffolds by aligning next library on first scaffolds and produce new scaffolds etc...). So for each library we align the reads to the new scaffolds. Therefore, no predefined paired sequence alignments could be provided, except if only one library is used. In addition, if we have such an input we would be very similar to Bambus. Our purpose is to have an easy to use scaffolder without providing complex input formats, but with a simple fasta input.

Next week, i'll try to provide another alignment tool (e.g. Newbler) to map long reads to the contigs/scaffolds.

Kind regards,
Boetsie

**boetsie** · 01-17-2011, 01:41 AM

Originally posted by corthay View Post

Hi Boetsie,

Thanks for the response.

I've just specified "k=2" as clone coverage of BAC ends is almost 5x.
As a result, scaffolds N50 is a bit improved and the number of scaffolds is reduced. Thanks for the development of useful tool.

Corthay.

Hi Corthay,

great that it worked and that it improved your assembly a bit!

Kind regards,
Boetsie

Topics	Statistics	Last Post
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, Yesterday, 05:31 AM	0 responses 10 views 0 likes	Last Post by seqadmin Yesterday, 05:31 AM
Small Blood Stem Cell Subset Linked to Immune System Aging by seqadmin Started by seqadmin, 10-24-2024, 06:58 AM	0 responses 20 views 0 likes	Last Post by seqadmin 10-24-2024, 06:58 AM
New AI Model Designs Synthetic DNA Switches for Targeted Gene Expression in Specific Cell Types by seqadmin Started by seqadmin, 10-23-2024, 08:43 AM	0 responses 48 views 0 likes	Last Post by seqadmin 10-23-2024, 08:43 AM
Microbes in Urban Spaces Adapt to Disinfectants and Scarce Resources by seqadmin Started by seqadmin, 10-17-2024, 07:29 AM	0 responses 58 views 0 likes	Last Post by seqadmin 10-17-2024, 07:29 AM

Seqanswers Leaderboard Ad

Announcement

SSPACE: a new stand-alone scaffolding tool for small and large genomes

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News