Hello,
I am trying to use breakdancer for the first time, and I am wondering about rational strategies for trying to filter down to the most likely candidates. I am hampered by not completely understanding some of the output. So I would appreciate any advice people have. But barring that, I would appreciate help on understanding the read support information that is given (more below), since it is not well explained.
More specifically, for the moment I have low-pass (10x coverage) WGS data on 5 samples (1 normal + 4 other samples from the same patient) and I ran breakdancer on all -- a pooled analysis. I have SNP array data on these patients, so I'd like to use that as a check, if possible.
An obvious candidate is to look at the number of reads supporting and reject very small or very large. Another thing I have been looking at is only those where the positions are fairly far apart. I've also seen masking repeat regions as a suggestion.
In addition, I would like to make use of the orientation information ("The orientation is a string that records the number of reads mapped to the plus (+) or the minus (-) strand in the anchoring regions."). For example, if the number of reads supporting the match is much smaller than the reads in orientation or if the +/- mapping is not roughly equal perhaps these might be problems (both difficult to gauge in low-pass since the numbers are small, but I still like to understand if these are reasonable ideas for filters). But I don't really know enough about these numbers to know if these are sensible, for example does the reads mapped mean all reads, or just the abherrant reads identified by the program? Does anyone have any insight into how these numbers are determined?
Thank you very much.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
Latest Articles
Collapse
-
by seqadmin
Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.
3D Genomics
While spatial biology often involves studying proteins and RNAs in their...-
Channel: Articles
01-01-2025, 07:30 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 07:35 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
Yesterday, 07:35 AM
|
||
Started by seqadmin, 01-23-2025, 09:43 AM
|
0 responses
14 views
0 likes
|
Last Post
by seqadmin
01-23-2025, 09:43 AM
|
||
Started by seqadmin, 01-23-2025, 08:36 AM
|
0 responses
18 views
0 likes
|
Last Post
by seqadmin
01-23-2025, 08:36 AM
|
||
Started by seqadmin, 01-17-2025, 09:38 AM
|
0 responses
37 views
0 likes
|
Last Post
by seqadmin
01-17-2025, 09:38 AM
|