Clustering of ChIPseq data

mizar106

Junior Member

Join Date: Mar 2014

Posts: 6
- Share
- Tweet
#1

Clustering of ChIPseq data

09-15-2015, 02:01 AM

Hi guys!
I have an issue about the analysis of different type of ChIP-seq data. I want to combine them using clustering to observe meaningful epigenetic patterns in the dataset. Briefly, I generated a matrix with rows representing genomic 200bp bins (I have different millions of rows) and epigenetic marks in columns. I apply pam-clustering (clara from 'cluster' R package) to the matrix and fortunately seems to work and it is quite fast. The problem is about the method to determine the optimal number of clusters. I tried different approaches from different R packages (silhouette, pamk, gap statistic and so on..) but obviously all of them didn't work since they require too much memory in R. So, my idea was to extract a subset of , let's say, 10000/50000 rows from the full matrix and use them to infer the optimal cluster number. Do you think it could be correct? In that case, of course, I would have to find a good criteria to define my subset. Otherwise, I didn't find any other solution to set the optimal k for the moment. I would be very grateful if somebody can help me. Thanks a lot.
fran
Tags: None

Previous template Next

Topics	Statistics	Last Post
The Role of Spliceosomes in RNA Splicing and Genome Evolution by seqadmin Started by seqadmin, 05-14-2024, 07:03 AM	0 responses 24 views 0 likes	Last Post by seqadmin 05-14-2024, 07:03 AM
A Closer Look at the Enigmatic Genomes of Oikopleura dioica by seqadmin Started by seqadmin, 05-10-2024, 06:35 AM	0 responses 44 views 0 likes	Last Post by seqadmin 05-10-2024, 06:35 AM
Advanced Epigenome Editing Platform Explores Gene Regulation Mechanisms by seqadmin Started by seqadmin, 05-09-2024, 02:46 PM	0 responses 58 views 0 likes	Last Post by seqadmin 05-09-2024, 02:46 PM
Telomere Maintenance by PARP1: A New Perspective in Cancer Research by seqadmin Started by seqadmin, 05-07-2024, 06:57 AM	0 responses 44 views 0 likes	Last Post by seqadmin 05-07-2024, 06:57 AM

Seqanswers Leaderboard Ad