what is a paired-end read?

swbarnes2 replied

10-15-2011, 11:39 PM
Originally posted by raonyguimaraes View Post

Hello,

Suppose I have two reads from an exome in fastq. How to determine if they are pair-end or single-end or mate-pair ?

Paired end or mate pair reads have to have a means of knowing which two reads go together. Usually, this is by the name. In Illumina at least, reads are normally named by their coordinates on the flowcell. So if you don't have two reads with the same coordiantes, you've got single end.

Paired ends run towards each other, and are about 100-500 bp apart. Mate pairs run away from each other, and tend to be a few kb apart, but I belive sometimes they are contaminated with ordinary paired end data.
Leave a comment:
arkal replied

10-14-2011, 10:12 PM
Originally posted by raonyguimaraes View Post

Hello,

Suppose I have two reads from an exome in fastq. How to determine if they are pair-end or single-end or mate-pair ?

i could be wrong but i think u can get ur answer by decoding the name of the fastq read... i.e the string following the > symbol.

-A
Leave a comment:
raonyguimaraes replied

08-18-2011, 10:15 AM
Hello,

Suppose I have two reads from an exome in fastq. How to determine if they are pair-end or single-end or mate-pair ?
Leave a comment:
ScottC replied

06-28-2011, 03:02 PM
Originally posted by edilana.gomes View Post

Hi!!
Can anyone clarify the difference between mate pair, pair end and single end reads?

Thanks.

Hi,

Mate-pairs and paired-end reads have been covered. Single end reads are just that... a single read from one end of each sheared DNA fragment.

Scott.
Leave a comment:
edilana.gomes replied

06-28-2011, 02:06 PM
What is the difference between mate pairs, pair end and single end?

Hi!!
Can anyone clarify the difference between mate pair, pair end and single end reads?

Thanks.
Leave a comment:
karve replied

02-18-2011, 01:30 PM
This helped but not enough ! I think I get it but then questions pop up - this mates in a pair, paired read stuff pops up when I look a the specification for the SAM format output from the bowtie program. There are flags values that are returned to indicate if the aligned read is one of pair and/or first one in a pair and/or second one in a pair and so on..

But how does it, meaning bowtie, know ? Am I right in thinking that is specified in the input record ? I can see the .fastq input format has a place holder for this, so I'm guessing that other input formats eg. sra also have it ? And if they don't then there's no way for bowtie ( or other algorithms ) to derive it ?

Seeing as this is all equipment specific I'm going to need to look at some videos that describe the front parts of the this entire operation. Any ideas where ? I was hoping to just pick it up and work with it from the point of sequences of characters - an IT perspective - but guess not.
Leave a comment:
fedor5002 replied

01-17-2011, 07:38 AM
About single-end read

For a double-stranded DNA molecule, are the single-end reads generated by sequencing from both 3' ends of the two strands of DNA?
Leave a comment:
coswaters replied

09-28-2010, 12:47 AM
scaffold and contig helps a lot~ thanks a lot~
Leave a comment:
jdrum00 replied

06-08-2010, 08:36 AM
Thanks to everyone who contributes to these basic threads. I know they may get tedious for the old-timers, but they're vital for newer folks. Even if the same info is technically available at reference sites, it's often easier to understand when the answer is to a specific, basic question, rather than organized as vendor documentation or generalized teaching material!
Leave a comment:
krobison replied

06-08-2010, 06:19 AM
Originally posted by plichel View Post

I am not very familiar with all the biotechnology, so want to ask:
Is it possible to resolve which pair of reads come from the one, unique molecule ? (For instance, that the sequencer tracks this information somehow.)

Yes. In any of these schemes the sequencer keeps track of the physical location in the flowcell of the first read for each polony (spot of DNA) and then

For ligation technologies (Polonator & SOLiD), the reverse reads are really the same technology. For Illumina, there are some clever molecular acrobatics to replace the originally sequenced molecule with it's reverse complement to do paired end sequencing. For mate pairs, I believe they just strip the old extended primer & anneal a new primer. Helicos proposed just adding unlabeled bases for a time period (hence the term "dark fill") and PacBio shuts off their laser and keeps extending (hence "strobe sequencing" is an apropos term).

Complete Genomics used a 4-part read structure & may have gone to even higher number of mates.

Keeping track of the location of polonies or individual DNA molecules is one of the core tricks to all these technologies.
Leave a comment:
drio replied

06-07-2010, 11:42 AM
Originally posted by hege View Post

I understand that a normal read pair can be aligned with either F-B or B-F orientation. What I'm not too sure about the importance of read1 and read2, does it mean anything if the mapped position of read1 is greater than read2?

Assuming the alignment is correct that may indicate a structural variation happened at that location (you want to check what other alignments on that location tell you).
Leave a comment:
hege replied

06-07-2010, 01:34 AM
I understand that a normal read pair can be aligned with either F-B or B-F orientation. What I'm not too sure about the importance of read1 and read2, does it mean anything if the mapped position of read1 is greater than read2?
Leave a comment:
plichel replied

05-12-2010, 02:03 PM
I am not very familiar with all the biotechnology, so want to ask:
Is it possible to resolve which pair of reads come from the one, unique molecule ? (For instance, that the sequencer tracks this information somehow.) I mean usually there are a lot of reads and in diploid systems it might happen there are two 'versions' of the reads.
Leave a comment:
krobison replied

05-04-2010, 05:24 AM
Mate pair can also cover longer distances & is therefore more sensitive for detecting rearrangements / structural variation.

A third entry in this general category are what Pacific Biosciences calls strobe reads and Helicos calls dark fill. Essentially what happens is that sequence is read then there is a gap of probabilistic length then more sequence is read. This can be repeated many times to give a series of sequence islands separated by gaps with a constrained length distribution.
Leave a comment:
flobpf replied

05-03-2010, 06:45 PM
I think paired end refers to sequencing from the ENDS of a DNA fragment. If you're doing 75-bp paired end read using an insert size of 300bp, then the machine will sequence 1-75 and 300-225 (for simplicity, omitting the adapters)

Mate pair requires a completely different protocol and is typically over longer distances such as 2-5kb. If you want to sequence a 5kb mate pair library, then 5kb fragments of DNA are isolated on the gel, the ends are biotinylated, the fragment is circularized and sheared. So now when you select using streptavidin, you'll get the fragment that has the ENDS of the original 5kb fragment. This fragment is then sequenced.

Mate pair is more relevant in genome assembly, especially for covering repetitive sequences. Paired end can be used for anything - RNA, DNA.

You can get more information on the Illumina site:

Page Not Found

http://www.illumina.com/technology/paired_end_sequencing_assay.ilmn

Page Not Found

http://www.illumina.com/technology/mate_pair_sequencing_assay.ilmn

I'm not sure how 454 or other methods define these terms
Leave a comment:

Previous 1 2 3 4 5 6 template Next

Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM
Strategies for Sequencing Challenging Samples

by seqadmin

Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
- Channel: Articles
03-22-2024, 06:39 AM

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 31 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Leave a comment:

Latest Articles

ad_right_rmr

News