I am analyzing an RNASeq data set. I have screened reads in Trimmomatic and aligned them using bwa mem. The reads are mostly 100bp long, but a minority are as short as 30bp. Reads are submitted in two fastq files and are paired end. The input files each have a total of 61,383,869 reads.
I get the following message in the bwa standard output:
[M::main_mem] read 109456 sequences (10000139 bp)...
[M::mem_pestat] # candidate unique pairs for (FF, FR, RF, RR): (45, 19766, 45, 52)
Can anyone tell me precisely what the above means? Why did bwa read only 109,456 sequences from the millions in the fastq files?
As only 19,908 "unique pairs" in total were reported, does this mean that the rest of the read data were all PCR duplicates?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
Latest Articles
Collapse
-
by seqadmin
Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.
Long-Read Sequencing
Long-read sequencing has seen remarkable advancements,...-
Channel: Articles
12-02-2024, 01:49 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 12-02-2024, 09:29 AM
|
0 responses
158 views
0 likes
|
Last Post
by seqadmin
12-02-2024, 09:29 AM
|
||
Started by seqadmin, 12-02-2024, 09:06 AM
|
0 responses
56 views
0 likes
|
Last Post
by seqadmin
12-02-2024, 09:06 AM
|
||
Started by seqadmin, 12-02-2024, 08:03 AM
|
0 responses
48 views
0 likes
|
Last Post
by seqadmin
12-02-2024, 08:03 AM
|
||
Started by seqadmin, 11-22-2024, 07:36 AM
|
0 responses
76 views
0 likes
|
Last Post
by seqadmin
11-22-2024, 07:36 AM
|