Hi! I have a values of the mean per-base read coverage for each gene. My question what is it? How to calculate RPKM using these "the mean per-base read coverage". Thank you.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Going from a single number for each gene (which is what I presume you have in 'mean per-base read coverage') to something that encompasses all reads (the 'M' in RPKM) using the length of each gene (the 'K') seems hard if not impossible.
As for using the values for a comparison, normalizing them, making some broad assumptions and looking for gross changes, then I think that you could do so. I'd back up everything you find with other evidence but for a rough estimate of differential then your values will give you guidance.
Comment
-
I suppose if you assume that all of your reads are the same length and if you know the lengths of your exons then you can determine the number of reads for each exon. From that information you can sum up the number of reads for all exons and sum up the lengths of all exons. From that information you can calculate RPKM for each exon. The assumption of all reads being the same length is the most critical part.
Comment
-
First, get the number of reads for each gene, call this number Rz . This would be Rz = L * A / X where:
L = length of the gene
A = average reads per base
X = length of a read
And lower-case 'z' represents the gene number (e.g., R1 is gene #1, R2 is gene #2, etc.)
Second, sum up the number of reads for all of the genes. Divide by 1,000,000. Call this number 'M'. In other words M = sum(Rz) / 1000000
Then for each gene you can determine the RPKM via Rz / L / M
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-25-2024, 11:49 AM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
04-25-2024, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
17 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
62 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Comment