if the merging is optional,
how to do overlap reads without merging the two files into one file??
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Originally posted by mido1951 View Postyes I know.
I'm talking of phase before getting contigs.
you know to make an assembly of reads that it must overlap.
In the case of paired end reads, how to find overlaps between the two files?
we F1.fq that contains reads RX.1 and F2.fq that contains reads RX.2.
how to find the overlap between these reads? is what we should do a merging between RX.1 and RX.2?
So the merging is necessary?
thanks
bbmerge.sh in1=F1.fq in2=F2.fq out=merged.fq outu=unmerged.fq
Leave a comment:
-
@Rick/@OTU: In order to avoid a re-hash of things that have been already discussed in other threads I am posting two below.
Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc
Bridged amplification & clustering followed by sequencing by synthesis. (Genome Analyzer / HiSeq / MiSeq)
@mido1951 is either mis-understanding some basic concepts about sequencing/assembly or I am not able to understand what @mid1951 wants to know.
Leave a comment:
-
yes I know.
I'm talking of phase before getting contigs.
you know to make an assembly of reads that it must overlap.
In the case of paired end reads, how to find overlaps between the two files?
we F1.fq that contains reads RX.1 and F2.fq that contains reads RX.2.
how to find the overlap between these reads? is what we should do a merging between RX.1 and RX.2?
So the merging is necessary?
thanks
Leave a comment:
-
Use Panda or Flash (or probably a number of other programs) to do merging.
Leave a comment:
-
You realize that you are not doing this manually, right? You usually use a program, which creates overlapping/merging reads and gives you the contig.
Leave a comment:
-
Code:for example we have two fragments. S1: ATCGTTGAGCAGACT and S2: TGAGCAGACTTAAGTAGTTTT they have an overlap zone"TGAGCAGACT". so the consensus is:ATCGTTGAGCAGACTTAAGTAGTTTT we sequenced this two fragments S1 and S2 in paired end reads R1 and R2. S1:R1: ATCGTTGAGCA S1:R2: AGTCTGCTCAA (illumina read the fragment in the other way so I think that to write in reverse complement. correct me if I'm wrong) S2:R1:TGAGCAGACTT S2:R2:AAAACTACTTA (reverse complement)I think this way is built the R2. how to make assembly in this case?
in both files (paired end), is what we must do the merging of the two files?
Leave a comment:
-
Sorry, you are right. I meant something different.
Your previous post depicts the assembly process correctly.
What I don't understand is what exactly is your question then?
Leave a comment:
-
for example we have two fragments.
S1: ATCGTTGAGCAGACT and S2: TGAGCAGACTTAAGTAGTTTT they have an overlap zone"TGAGCAGACT". so the consensus is:ATCGTTGAGCAGACTTAAGTAGTTTT
we sequenced this two fragments S1 and S2 in paired end reads R1 and R2.
S1:R1: ATCGTTGAGCA
S1:R2: AGTCTGCTCAA (illumina read the fragment in the other way so I think that to write in reverse complement. correct me if I'm wrong)
S2:R1:TGAGCAGACTT
S2:R2:AAAACTACTTA (reverse complement)I think this way is built the R2.
how to make assembly in this case?
Leave a comment:
-
I know the operation of assembly programs.
but I basically want to understand the overlap between the paired end reads (the two files).
i have single cell not meta genome.
have you an example?
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 11:49 AM
|
0 responses
15 views
0 likes
|
Last Post
by seqadmin
Yesterday, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Leave a comment: