Hi, everyone. I meet a problem in drawing Venn diagram to describe the overlapping of peaks between two groups. I defined one base overlapping between one peak from A and another peak from B as overlapping. Now, the problem comes. Sometimes, one peak in A group may overlapping with two near peaks in B group. Then , the number in A group overlapped with B group is not as same as that in B group. How to solve this? Thank you very much!
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X

I guess I do not understand the problem. There will be X number of elements in set A that have overlaps with set B. There will be Y number of elements in set B that overlap with set A. X and Y do not need to be the same number in order to create a Venn diagram.
Now if you want a more complex relationship; e.g., something along the lines of element 'I' in set A has a relationship to elements 'J' and 'K' in set B ... well then a Venn diagram is not what you are looking for.

Lee_sh,
My suggestion is to exclude the "ambiguous" peaks from your lists first, then check 1) the Venn Diagram without the ambiguous peaks; 2) the number of ambiguous peaks. If it represents a small percentage, you may ignore. If it is substantial, you need to fine tune your separation and comparison algorithm.
Comment

Originally posted by westerman View PostI guess I do not understand the problem. There will be X number of elements in set A that have overlaps with set B. There will be Y number of elements in set B that overlap with set A. X and Y do not need to be the same number in order to create a Venn diagram.
I'd have to respectfully disagree with that statement. The overlapping portions of a Venn diagram contain the elements which sets A and B have in common. Since it is single set of elements who size is a finite integer N; set A contains all N elements and set B contains all N elements.
Lee_sh's problem is that a Venn diagram is not an appropriate representation for the data as defined. It would be appropriate to display peaks which are common to the two sets, with the stipulation that a peak in set A can only be considered in common with a single peak of set B and vice versa.
Comment

Originally posted by westerman View PostI guess I do not understand the problem. There will be X number of elements in set A that have overlaps with set B. There will be Y number of elements in set B that overlap with set A. X and Y do not need to be the same number in order to create a Venn diagram.
Now if you want a more complex relationship; e.g., something along the lines of element 'I' in set A has a relationship to elements 'J' and 'K' in set B ... well then a Venn diagram is not what you are looking for.
Comment

Originally posted by DZhang View PostLee_sh,
My suggestion is to exclude the "ambiguous" peaks from your lists first, then check 1) the Venn Diagram without the ambiguous peaks; 2) the number of ambiguous peaks. If it represents a small percentage, you may ignore. If it is substantial, you need to fine tune your separation and comparison algorithm.
Comment

Originally posted by kmcarr View PostRick,
I'd have to respectfully disagree with that statement. The overlapping portions of a Venn diagram contain the elements which sets A and B have in common. Since it is single set of elements who size is a finite integer N; set A contains all N elements and set B contains all N elements.
Lee_sh's problem is that a Venn diagram is not an appropriate representation for the data as defined. It would be appropriate to display peaks which are common to the two sets, with the stipulation that a peak in set A can only be considered in common with a single peak of set B and vice versa.
Comment

Originally posted by kmcarr View PostRick,
I'd have to respectfully disagree with that statement. The overlapping portions of a Venn diagram contain the elements which sets A and B have in common. Since it is single set of elements who size is a finite integer N; set A contains all N elements and set B contains all N elements.
Lee_sh's problem is that a Venn diagram is not an appropriate representation for the data as defined...
Comment

Originally posted by lee_sh View PostYes. Strictly, Venn diagram may be not suitable for this question if the elements in two sets can not meet the criteria as you discribed. However, the phenomena of one peak overlapping with more is common in ChIPseq especially histone modifications.
Any better method to describe such relationship between two sets?
Comment

We tend to look at this in two ways. One is to look at the percentage of peaks in set A that overlap ones in B, and viceversa (percentage in B that overlap A). The other way is to merge the peaks, which gives you a common set of (wider) peaks. In your example where two peaks in one set overlap single peak in the other set, you would be left with a single peak that covers all three of them. Then you can do a proper Venn diagram.
There are function in our Bioconductor package, DiffBind, that make it easy to do both of these: dba.plotVenn will do the merging and plotting in one step, and dba.overlap will give you the eachway overlap rates of unmerged peaks (and extract the overlapping and nonoverlapping sets as well).
Comment
Latest Articles
Collapse

by seqadmin
The first FDAapproved CRISPRbased therapy marked the transition of therapeutic gene editing from a dream to reality^{1}. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes^{2}. This technique introduces targeted mutations across numerous genes, enabling largescale identification of gene functions, interactions, and pathways^{3}. Identifying the full range...
Channel: Articles
08272024, 04:44 AM 
ad_right_rmr
Collapse
News
Collapse
Topics  Statistics  Last Post  

Started by seqadmin, Yesterday, 08:02 AM

0 responses
10 views
0 likes

Last Post
by seqadmin
Yesterday, 08:02 AM


Started by seqadmin, 09032024, 08:30 AM

0 responses
13 views
0 likes

Last Post
by seqadmin
09032024, 08:30 AM


Started by seqadmin, 08272024, 04:40 AM

0 responses
21 views
0 likes

Last Post
by seqadmin
08272024, 04:40 AM


New SingleMolecule Sequencing Platform Introduces Advanced Features for HighThroughput Genomics
by seqadmin
Started by seqadmin, 08222024, 05:00 AM

0 responses
361 views
0 likes

Last Post
by seqadmin
08222024, 05:00 AM

Comment