As the title says? What's the minimum coverage required for alignment of approx ~50Mbp to a reference?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
To echo swbarnes2, you need to specify your problem both in terms of experiment design & goals.
In particular, what does the 50Mbp have to do with it? Is this the genome size? Do you have a set of clones which should add up to 50Mbp? Are you trying to capture 50Mbp? The latter two would require baking in some overage for vector or off-target sequences.
Comment
-
In general, your total coverage target should depend on your experimental setup. As the other posters have mentioned, if you're sequencing a 50mb genome, then all of your reads should come from that genome. If you're doing 50mb exome sequencing from a much larger genome, then a lot of your reads will fall outside the target regions because the capture isn't 100% efficient, and you'll also have issues with some areas capturing or amplifying better than others.
Also, the coverage requirement will depend on what you want to get out of sequencing. If every sample is individually important (e.g. for clinical studies), and you want to see every SNP and indel, even heterozygous ones, in every sample then you'll need high coverage, like 100+ (say 3-4 samples per lane of a HiSeq 1000/2000, or a 1500/2500 running in high-output mode).
If you're more interested in population-level data where the exact genotype for a single sample isn't crucial, or you only care about homozygous mutations, you can probably go lower. If you're confused, give us some more info about what your goals for this sequencing project are and I'll try to suggest a reasonable coverage target.
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 08:06 AM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
Today, 08:06 AM
|
||
Started by seqadmin, 04-30-2024, 12:17 PM
|
0 responses
13 views
0 likes
|
Last Post
by seqadmin
04-30-2024, 12:17 PM
|
||
Started by seqadmin, 04-29-2024, 10:49 AM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
04-29-2024, 10:49 AM
|
||
Started by seqadmin, 04-25-2024, 11:49 AM
|
0 responses
26 views
0 likes
|
Last Post
by seqadmin
04-25-2024, 11:49 AM
|
Comment