Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Targeted Genome Assembly for region poorly represented in reference genome?

    Hello,

    I am not sure if this is the best place to put this, but any help is appreciated.

    I am working on identifying candidate genes for a mutation in a subtelomeric region of Zebrafish. The Zv9 Assembly is not great, and is particularly bad in this region. The region contains an improperly placed clone with WGS contigs surrounding it, which contain genes known to be deleted in one of the mutant alleles but are not causative. The reference in this region is so poor that my best bet is to attempt to re-build it. I am already in contact with the people at Sanger in regards to this, but I also have quite a bit of differential RNAseq data as well as WGS data of both a different mutant allele (that causes the same phenotype but was generated via ENU mutagenesis and so is likely a point mutation).

    Does anyone have any ideas for the best way to assemble this region in a targeted fashion? I have access to a 32GB ram fairly powerful computer, but this obviously is not enough to take the large amount of WGS data I have and assemble with velvet. Even then, I feel that the velvet assembly would only give me at best 5kb contigs that won't be much more effective than what is already available. I have considered trying to align sequences to the region as it is known with bowtie then assembling those reads only, but the reference is so poor that I don't think this will be effective either (multiple genes we know to be deleted are not represented in Zv9 in any fashion, or are only partially represented, or are represented split far apart with opposite strandedness).

    Thanks in advance for any assistance or advice.

  • #2
    Yuck! You certainly have an ugly situation.

    I think if I were in your shoes I would try as many different approaches as possible to identify paired end reads where at least one of the pairs can be mapped to the region (existing assembly, mapping to your RNA-Seq data that you think is in the region, etc), then try assembling that with Velvet or another assembler. Then use that assembly to identify additional paired end reads mapping in & repeat. Keep cycling until things don't seem to be getting better.

    Is there a publicly available BAC or cosmid library for zebrafish? I doubt you'll get anything like what you want without some long-range sequence information, and pulling out a big clone for the region could be a bunch of work but is one of the more obvious ways to go about it. Alternatively, have you tried making mate pair libraries for the whole genome?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Latest Developments in Precision Medicine
      by seqadmin



      Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

      Somatic Genomics
      “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
      05-24-2024, 01:16 PM
    • seqadmin
      Recent Advances in Sequencing Analysis Tools
      by seqadmin


      The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
      05-06-2024, 07:48 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 06:55 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 05-30-2024, 03:16 PM
    0 responses
    24 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 05-29-2024, 01:32 PM
    0 responses
    29 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 05-24-2024, 07:15 AM
    0 responses
    215 views
    0 likes
    Last Post seqadmin  
    Working...
    X