Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Hi, check this paper: Zhang et al. Modeling ChIP sequencing in silico with applications. PLoS Comput Biol (2008) vol. 4 (8) pp. e1000158
Indeed the Input is not random nor is flat. There are regions (open chromatin, fragile sites) that may be preferentially enriched. One should try to use naked DNA to isolate sequence bias (I believe there's a paper on this... I can't get the reference right now).
d
Leave a comment:
-
I am used to getting only 30% unique reads when doing ChIP-Seq on Arabidopsis genome. The number goes up to around 80% when all alignments are counted.
I would be very interested if dawe could elaborate on the read distribution, because our Input DNA does not span the genome evenly and particularly shows enrichment in exons. We have been wondering why this would be so ?
Leave a comment:
-
Hello Kathrin,
Originally posted by kathrin View PostAfter mapping the reads to a reference genome by using Bowtie and additionally MAQs, around 70% of all reads were uniquely mapped to the reference, which should be a quiet good rate (I guess).
Originally posted by kathrin View PostIn literature I read that the expected number of reads matching the same position and strand can be modeled by the Poisson distribution. Is this assumption also true for ChIP-samples, where we enrich specific chromosomal locations and get rid of those, where the TF doesn't bind? Wouldn't we expect to find more more duplicated reads in ChIP-samples than in the Input samples
I see your point but consider also that genomic DNA (the Input) may contain some genomic features (open chromatin) that may be constitutively "enriched", plus a number of repeated sequences you may not find in the IP. Also, TF binding sites + background noise cover a slice of your genome which is probably "wide" enough to sparse your duplicates.
You may try do zap duplicates from your alignment using picard (assuming you have BAM files)
Originally posted by kathrin View PostDoes anyone know a good peak caller that parameters concerning duplicated reads can be adjusted by the user?
HTH
d
Leave a comment:
-
ChIP-seq duplicate reads/ Poisson distribution
Hi everybody
I started analyzing my first ChIP-seq data set, it contains one ChIP-sample and one input sample. After mapping the reads to a reference genome by using Bowtie and additionally MAQs, around 70% of all reads were uniquely mapped to the reference, which should be a quiet good rate (I guess).
For the input sample ~20 mio reads were left, and for the ChIP-sample ~17 mio reads.
However, I found ~2 mio duplicated reads (matching the same chromosomal location) in the input sample and ~16 mio duplicated reads in the ChIP-sample, which might be due to amplification errors or library preparation.
In literature I read that the expected number of reads matching the same position and strand can be modeled by the Poisson distribution. Is this assumption also true for ChIP-samples, where we enrich specific chromosomal locations and get rid of those, where the TF doesn't bind? Wouldn't we expect to find more more duplicated reads in ChIP-samples than in the Input samples
To identify Peaks I used MACS that removes duplicated reads before calling the peaks. Does anyone know a good peak caller that parameters concerning duplicated reads can be adjusted by the user? I want to try to set a customized threshold for the number of duplicated reads depending on my duplicated read distribution and check the sequences of my peak regions. Fortunately the TFBS motif of my TF is already known, so I can verify my results.
It would be great to get some comments or ideas, as I am an absolute beginner in NGS analyses...
Besides that, thank you for the great forum, it's a great help
Thanks a lot in advance
KathrinTags: None
Latest Articles
Collapse
-
by seqadmin
Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.
Nucleic Acid Quality Control
Preparing for NGS starts with isolating the...-
Channel: Articles
02-10-2025, 01:58 PM -
-
by seqadmin
In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...-
Channel: Articles
01-27-2025, 07:46 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Genetic Mapping of Plasmodium knowlesi Identifies Essential Genes and Drug Resistance Mechanisms
by seqadmin
Started by seqadmin, 02-07-2025, 09:30 AM
|
0 responses
68 views
0 likes
|
Last Post
by seqadmin
02-07-2025, 09:30 AM
|
||
Started by seqadmin, 02-05-2025, 10:34 AM
|
0 responses
107 views
0 likes
|
Last Post
by seqadmin
02-05-2025, 10:34 AM
|
||
Started by seqadmin, 02-03-2025, 09:07 AM
|
0 responses
83 views
0 likes
|
Last Post
by seqadmin
02-03-2025, 09:07 AM
|
||
Started by seqadmin, 01-31-2025, 08:31 AM
|
0 responses
47 views
0 likes
|
Last Post
by seqadmin
01-31-2025, 08:31 AM
|
Leave a comment: