Seqanswers Leaderboard Ad

**boetsie** · 07-23-2013, 08:56 AM

These are the original contigs (order is based on the order of the contig in the fasta file). The 'f' indicates that it has forward orientation in the final scaffold, the 'r' means the reverse orientation.

**Yue Xu** · 10-18-2013, 04:08 AM

**boetsie** · 10-18-2013, 05:41 AM

The negative gap indicates a potential overlap between the two contigs. However, it seems unlikely that there is 615bp overlap between the contigs, indicating that the insert size you've provided in the library file is not correct.

To illustrate how this is estimated;

Say you have a two contigs, contig1 of 1000bp and contig2 of 2000bp, one of your paired-read aligns at position 900 at contig1 and the other at position 100 on contig 2.

If you set the insert size to 210bp, the estimated gap is;
Provided insert size - ((size of contig1)-(position of read1 on contig1)) + (position of read2 on contig2). In this case it is;
210 - (1000-900) + 100 = 10

So a gap of 10bp. If we change the insert size to 2000, it is;

2000 - (1000-900) + 100 = 1800

If we change the insert size to 100, it is;

100 - (1000-900) + 100 = -100

As you can see, the estimated gap really depends on the provided insert size by the user.

In your case I see a number of large negative gaps, this is highly unusual. Probably you should lower your insert-size by 600 bases.

Regards,
Boetsie

**Yue Xu** · 10-18-2013, 07:39 AM

Hi, thank your detailed reply, because of your reply, I understand how to calculate the gap between contigs in SSPACE. Thank you very much.
But I have seen your writing formula:
Provided insert size - (((size of contig1)-(position of read1 on contig1)) + (position of read2 on contig2))
whether is it lack of a pair of bracket that I mark it in the type of bold and italic?

yours sincercely,
Yue Xu

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

SSPACE help

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News