I am analyzing an RNASeq data set. I have screened reads in Trimmomatic and aligned them using bwa mem. The reads are mostly 100bp long, but a minority are as short as 30bp. Reads are submitted in two fastq files and are paired end. The input files each have a total of 61,383,869 reads.
I get the following message in the bwa standard output:
[M::main_mem] read 109456 sequences (10000139 bp)...
[M::mem_pestat] # candidate unique pairs for (FF, FR, RF, RR): (45, 19766, 45, 52)
Can anyone tell me precisely what the above means? Why did bwa read only 109,456 sequences from the millions in the fastq files?
As only 19,908 "unique pairs" in total were reported, does this mean that the rest of the read data were all PCR duplicates?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
104 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
112 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
116 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
22 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|