I can't seem to find an answer to this simple question, although this must be a fairly common issue. I have Illumina paired-end reads from an RNA-Seq run, and have run a script to filter adapter dimers. This operation has resulted in my having different numbers of reads in my _R1 and _R2 files. Does Bowtie identify mate pairs in these files simply using their order, or does it use the read IDs? If it goes in order, then Bowtie will fail to match the correct pairs from my filtered files. If it uses the IDs, then Bowtie should be ok, unless it crashes when a read doesn't have a mate. Does anyone know how this works? Should I just run this data as single-end to avoid these issues?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Dear Volklor,
Paired inputs for Bowtie2:
Pairs are often stored in a pair of files, one file containing the mate 1s and the other containing the mates 2s. The first mate in the file for mate 1 forms a pair with the first mate in the file for mate 2, the second with the second, and so on. When aligning pairs with Bowtie 2, specify the file with the mate 1s mates using the -1 argument and the file with the mate 2s using the -2 argument. This causes Bowtie 2 to take the paired nature of the reads into account when aligning them.
(http://bowtie-bio.sourceforge.net/bowtie2/manual.shtml)
Please write a simple perl script to order your reads.
Best wishes,
RahulRahul Sharma,
Ph.D
Frankfurt am Main, Germany
-
Thanks for your reply, Rahul. I assume that Bowtie 1 (the version I'm using) works the same way as Bowtie 2. In my case, it is not that read order is the only issue; it's that certain reads don't have mates because they've been filtered out. I think my best bet will be to run this data as single-end.
Comment
-
Hi Volklor,
If you still have the original files somewhere it might be worth running a trimming program that is aware of paired-ends, such as Trimmomatic. We have also written a wrapper around Cutadapt that can do this (trim galore), even though it was initially destined for some other stuff. It would be a shame to let the paired-end information go to waste, wouldn't it?
Comment
Latest Articles
Collapse
-
by seqadmin
Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.
Nucleic Acid Quality Control
Preparing for NGS starts with isolating the...-
Channel: Articles
02-10-2025, 01:58 PM -
-
by seqadmin
In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...-
Channel: Articles
01-27-2025, 07:46 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Genetic Mapping of Plasmodium knowlesi Identifies Essential Genes and Drug Resistance Mechanisms
by seqadmin
Started by seqadmin, 02-07-2025, 09:30 AM
|
0 responses
72 views
0 likes
|
Last Post
by seqadmin
02-07-2025, 09:30 AM
|
||
Started by seqadmin, 02-05-2025, 10:34 AM
|
0 responses
113 views
0 likes
|
Last Post
by seqadmin
02-05-2025, 10:34 AM
|
||
Started by seqadmin, 02-03-2025, 09:07 AM
|
0 responses
90 views
0 likes
|
Last Post
by seqadmin
02-03-2025, 09:07 AM
|
||
Started by seqadmin, 01-31-2025, 08:31 AM
|
0 responses
49 views
0 likes
|
Last Post
by seqadmin
01-31-2025, 08:31 AM
|
Comment