Not sure if this is what you want, but a few months back I wrote a very simple perl script (probably not very good, I'm a biologist not a programmer, so my scripts tend to be brute force approaches) that will take a sam file and spit out the names of the reads that map more than once and how many times they mapped.
Now I can't even remember what I was using it for, but maybe it will help you get to where you need.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
First find the number of reads that map at least once. To do this, run bowtie with "-k 1" option. Lets call this number K1.
Then run bowtie with the "-k 2" option. This will report up to two alignments per read. Lets call this number K2.
Subtracting the K1 from K2 gives the number of reads that mapped two or more times. Lets call this number K2p.
To find the number of reads that mapped exactly once, subtract K2p from K1.
To find the number of reads that did not map at all, just subtract K1 from the total number of reads in your sample.
Leave a comment:
-
Bowtie call to get unique, multi-hits and nonmatching reads
Hello there,
I want to estimate (1) the number of reads that map uniquely (in one place only), (2) the number of reads that map in multiple places and (3) the number of reads that do not map at all.
Is this the correct bowtie call?
-m 1 --un nomatch_reads.txt -- max multihits_reads.txt
Thanks!Tags: None
Latest Articles
Collapse
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
-
by seqadmin
Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...-
Channel: Articles
03-22-2024, 06:39 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
31 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
33 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
||
Started by seqadmin, 04-10-2024, 09:21 AM
|
0 responses
28 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 09:21 AM
|
||
Started by seqadmin, 04-04-2024, 09:00 AM
|
0 responses
53 views
0 likes
|
Last Post
by seqadmin
04-04-2024, 09:00 AM
|
Leave a comment: