Seqanswers Leaderboard Ad

**The_Roads** · 07-16-2009, 01:12 PM

no not yet. i'm going to have run through with them as soon as i can. thanks for the info

**polivares** · 07-22-2009, 07:31 AM

Hi all,
I am still confused with the difference between pair ends and mate pairs. Is it just that you call mates a pair of sequences (ends) from the same molecule?

Tnx

**ECO** · 07-22-2009, 07:44 AM

Originally posted by polivares View Post

Hi all,
I am still confused with the difference between pair ends and mate pairs. Is it just that you call mates a pair of sequences (ends) from the same molecule?

Tnx

Hey...Nice last name!

**polivares** · 07-22-2009, 07:53 AM

Originally posted by ECO View Post

Hey...Nice last name!

That means?

**ECO** · 07-22-2009, 08:28 AM

Originally posted by polivares View Post

That means?

Olivares is my surname as well.

Haven't met many before.

**yvan.wenger** · 12-02-2009, 03:24 AM

paired-end and mate pairs

Paired-end and mate pairs. These two denominations refer to slightly different library preparations... correct?

Do we usually have the choice between the two? What are the practical differences between them? Anyone compared possible biases for example?

Best,

Yvan

**ScottC** · 12-02-2009, 01:32 PM

Illumina refers to "paired end" as the original library preparation method they use, where you sequence each end of the same molecule. Because of the way the cluster generation technology works, it is limited to an inter-pair distance of ~300bp ( 200-600bp).

Illumina refers to "mate pairs" as sequences derived from their newer library prep method which is designed to provide paired sequences separated by a greater distance (between about 2 and 10kb). This method still actually only sequences the ends of ~400bp molecules, but this template is derived from both ends of a 2-10kb fragment that has had the middle section cut out and the 'internal' ends ligated in the middle. Basically, you take your 2-10kb random fragments, biotinylate the end, circularise them, shear the circles to ~400bp, capture biotinylated molecules, and then sequence those (they go into what is essentially a standard 'paired end' sample prep procedure).

**thondeboer** · 03-07-2010, 10:16 PM

Will original insert size of library be in SAM file header for Mate-paired sequences?

Would the original insert size for the library be listed in any resulting sequence alignment files (SAM/BAM for instance) for mate-paired reads (typicaly 2-5 kBp) or would it list the insert size of the paired-end reads (Typically ~500 bp). This would be important for the downstream Structural Variant analysis I can imagine...

I know that SAM/BAM is not (yet) produced by Illumina or Solid systems natively, but do any translators take care of this?

Thanks!

**nilshomer** · 03-07-2010, 10:30 PM

Originally posted by thondeboer View Post

Would the original insert size for the library be listed in any resulting sequence alignment files (SAM/BAM for instance) for mate-paired reads (typicaly 2-5 kBp) or would it list the insert size of the paired-end reads (Typically ~500 bp). This would be important for the downstream Structural Variant analysis I can imagine...

I know that SAM/BAM is not (yet) produced by Illumina or Solid systems natively, but do any translators take care of this?

Thanks!

Take a look at the SAM specifcation. The insert size of an alignment (ISIZE) is defined as the distance between the 5' ends of the reads after mapping. The PI field in the RG tag in the header allows reads with the same expected insert size to be identified.

So to answer your question, if a read has its two ends 2-5Kbp or 500bp apart, ISIZE should be set accordingly. Optionally, the predicted size can be inferred from PI/RG in the header (when available).

**flobpf** · 05-03-2010, 06:45 PM

I think paired end refers to sequencing from the ENDS of a DNA fragment. If you're doing 75-bp paired end read using an insert size of 300bp, then the machine will sequence 1-75 and 300-225 (for simplicity, omitting the adapters)

Mate pair requires a completely different protocol and is typically over longer distances such as 2-5kb. If you want to sequence a 5kb mate pair library, then 5kb fragments of DNA are isolated on the gel, the ends are biotinylated, the fragment is circularized and sheared. So now when you select using streptavidin, you'll get the fragment that has the ENDS of the original 5kb fragment. This fragment is then sequenced.

Mate pair is more relevant in genome assembly, especially for covering repetitive sequences. Paired end can be used for anything - RNA, DNA.

You can get more information on the Illumina site:

Page Not Found

http://www.illumina.com/technology/paired_end_sequencing_assay.ilmn

Page Not Found

http://www.illumina.com/technology/mate_pair_sequencing_assay.ilmn

I'm not sure how 454 or other methods define these terms

**krobison** · 05-04-2010, 05:24 AM

Mate pair can also cover longer distances & is therefore more sensitive for detecting rearrangements / structural variation.

A third entry in this general category are what Pacific Biosciences calls strobe reads and Helicos calls dark fill. Essentially what happens is that sequence is read then there is a gap of probabilistic length then more sequence is read. This can be repeated many times to give a series of sequence islands separated by gaps with a constrained length distribution.

**plichel** · 05-12-2010, 02:03 PM

I am not very familiar with all the biotechnology, so want to ask:
Is it possible to resolve which pair of reads come from the one, unique molecule ? (For instance, that the sequencer tracks this information somehow.) I mean usually there are a lot of reads and in diploid systems it might happen there are two 'versions' of the reads.

**hege** · 06-07-2010, 01:34 AM

I understand that a normal read pair can be aligned with either F-B or B-F orientation. What I'm not too sure about the importance of read1 and read2, does it mean anything if the mapped position of read1 is greater than read2?

**drio** · 06-07-2010, 11:42 AM

Originally posted by hege View Post

I understand that a normal read pair can be aligned with either F-B or B-F orientation. What I'm not too sure about the importance of read1 and read2, does it mean anything if the mapped position of read1 is greater than read2?

Assuming the alignment is correct that may indicate a structural variation happened at that location (you want to check what other alignments on that location tell you).

**krobison** · 06-08-2010, 06:19 AM

Originally posted by plichel View Post

I am not very familiar with all the biotechnology, so want to ask:
Is it possible to resolve which pair of reads come from the one, unique molecule ? (For instance, that the sequencer tracks this information somehow.)

Yes. In any of these schemes the sequencer keeps track of the physical location in the flowcell of the first read for each polony (spot of DNA) and then

For ligation technologies (Polonator & SOLiD), the reverse reads are really the same technology. For Illumina, there are some clever molecular acrobatics to replace the originally sequenced molecule with it's reverse complement to do paired end sequencing. For mate pairs, I believe they just strip the old extended primer & anneal a new primer. Helicos proposed just adding unlabeled bases for a time period (hence the term "dark fill") and PacBio shuts off their laser and keeps extending (hence "strobe sequencing" is an apropos term).

Complete Genomics used a 4-part read structure & may have gone to even higher number of mates.

Keeping track of the location of polonies or individual DNA molecules is one of the core tricks to all these technologies.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 29 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News