Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Optimal bandwidth for kernel CNV estimation

    I just started working on bandwidth selection for KC-smart and/or similar kernel regression methods for the purpose of identifying recurrent copy number aberrations in NGS libraries from genomic tumor DNA. If there are other people here working on related research and/or potential users of such a bandwidth selection algorithm, maybe we could share some thoughts.

    We use KC-smart for the detection of recurrent copy number abberations across multiple tumor libraries. In short, it works like this:

    - Each library (i.e. sample, i.e. tumor) gives us appr. 5 mio reads. Since the fragments to be read are made with sonocation rather than restriction enzymes, they tend to be at unique locations. Subject to correction for GC bias and such, the density of the reads are proportional to the copy number.

    - The reads are binned into windows of size say 50 kb so that each window has a copy number estimate

    - That copy number estimate is then used by KC smart which is a kernel regression algorithm originally made for aCGH. It produces locally weighted regression coefficients related to research questions such as "does this region have a higher copy number in library class A than in library class B?".

    What I want to do is to make a bandwidth selection algorithm. I want to use the read locations directly without binning them. Bandwidth selection for kernel regression is a little different than for kernel density estimation. Also, I might consider
    - building the GC correction into the bandwidth selection
    - dynamic bandwidth selection, i.e. larger bandwidth in low-copy number regions
    - Shrinking estimates towards nearest integer copy number (the sample may be homogenous with respect to the CN of some regions)
    - handling ambigiously mapped reads

Latest Articles

Collapse

  • seqadmin
    Strategies for Sequencing Challenging Samples
    by seqadmin


    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
    03-22-2024, 06:39 AM
  • seqadmin
    Techniques and Challenges in Conservation Genomics
    by seqadmin



    The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

    Avian Conservation
    Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
    03-08-2024, 10:41 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:37 PM
0 responses
7 views
0 likes
Last Post seqadmin  
Started by seqadmin, Yesterday, 06:07 PM
0 responses
7 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-22-2024, 10:03 AM
0 responses
49 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-21-2024, 07:32 AM
0 responses
66 views
0 likes
Last Post seqadmin  
Working...
X