Unconfigured Ad

**mudshark** · 02-25-2012, 04:30 AM

in fact it is a cross-correlation not an autocorrelation.

as regards your question: i have seen this before and I don't think it is a problem in the first place. It probably depends on the 'true' fragment size of your target bound DNA, the signal-to-noise ratio and the abundance of target sites. i.e. if your signal to noise is low and the target sites are just a few you will get the average fragment size determined by the size selection step. if you have a good signal to noise and the target protein protects 35 bp of DNA you might get a cross correlation of 35bp.

**Chipper** · 02-25-2012, 11:37 AM

It's the other way around - good signal to noise gives av fragment size, else the correlation is dominated by a peak of exactly the read length. Not sure why though, but has nothing to do with protein DNA protection.

**cwhelan** · 02-27-2012, 12:48 PM

This very insightful and helpful post by Anshul Kundaje on the MACS mailing list has a really good theory involving the mappability of the genome for why you see this pattern in non-enriched ChIP-seq data sets:

http://groups.google.com/group/macs-announcement/msg/d6595465a1f9b212

**skip56558** · 02-28-2012, 02:38 PM

Thank you all for your responses!

I've looked at the data again, and the best cross-correlation profiles are from the best antibodies, so your explanations make sense.

I only have one lingering question: is the data from the not-as-good cross-correlation profiles still usable? That is, do we need to repeat those entire experiments, or will MACS be able to identify the real peaks?

Many thanks!

skip56558

**cwhelan** · 02-28-2012, 02:47 PM

In my experience I have not found realistic-looking or useable peaks in these types of data sets, unfortunately. I usually try to examine some of the peaks in a browser - you can tell pretty quickly if they look like real ChIP-seq peaks, which are very enriched compared to the background, or just like slightly higher regions in a noisy background. Another way to check is to run your peaks through an annotation tool like CEAS and look for enrichment in promoter regions.

My experience is with ChIP-seq for transcription factor binding sites, so that advice might not apply for other types of experiments like histone modifications, though.

**BAMseek** · 02-28-2012, 03:24 PM

Originally posted by cwhelan View Post

This very insightful and helpful post by Anshul Kundaje on the MACS mailing list has a really good theory involving the mappability of the genome for why you see this pattern in non-enriched ChIP-seq data sets:

http://groups.google.com/group/macs-...595465a1f9b212

Here is some more information from the same author: Phantom Peaks

I've also noticed the same thing - that there are usually two peaks: one at the read length and one at the average fragment length. I have found that the strength of the fragment length peak compared to the read length peak is usually a good indicator of the signal-to-noise quality and one's ability to detect peaks in the data.

I've always been under the impression that those peaks at the read length might be caused by PCR duplication, but the above link also has a good idea about biases in mappability.

Justin

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Yesterday, 11:08 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

autocorrelation pattern in ChIP-seq alignments

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News