Header Leaderboard Ad

Collapse

Assembling pooled BACs from 454 data

Collapse

Announcement

Collapse

SEQanswers June Challenge Has Begun!

The competition has begun! We're giving away a $50 Amazon gift card to the member who answers the most questions on our site during the month. We want to encourage our community members to share their knowledge and help each other out by answering questions related to sequencing technologies, genomics, and bioinformatics. The competition is open to all members of the site, and the winner will be announced at the beginning of July. Best of luck!

For a list of the official rules, visit (https://www.seqanswers.com/forum/sit...wledge-and-win)
See more
See less
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Assembling pooled BACs from 454 data

    We have a 454 titanium run of ~50 pooled BACs. Not bar-coded. Not paired-end. Two clonal lines. Previously mostly unsequenced genome. Genome is undoubtedly repetitive. BACs could overlap.

    I am having trouble assembling the BACs. Newbler runs but then hangs in the 'deconvoluting step'. The TIGR EST clustering pipeline -- hey, I figured this was like an EST program only with bigger "ESTs" -- is throwing most of the reads into one contig even after masking out vector, adapters, etc. Of course ideally one would like to see 50 or so contigs which could then be assembled.

    Does anyone have any papers to read or ideas on how to extract these BACs from the 350 Mbase dataset? I guess that basically I need a good clustering method. After that the assembly itself should be simple.

    Thanks,
    -- Rick

  • #2
    How about dividing sequences into small groups?

    After assembling the each group and gathering the contigs, you can assemble the whole contigs one more time.

    Using more stringent criteria such as higher homology and longer mimium overlaps can be an another approach.

    Comment


    • #3
      Originally posted by mgenome View Post
      How about dividing sequences into small groups?

      After assembling the each group and gathering the contigs, you can assemble the whole contigs one more time.
      A good idea and one that I will try. If nothing else I might get to the repetitive parts of the BACs.


      Using more stringent criteria such as higher homology and longer mimium overlaps can be an another approach.
      Yes. I was running several of these clusters last night only to come back to work this morning and find that I was over my disk quota and that my programs crashed in mysterious ways. Who would have guessed that 250 GB would not be enough space.

      Comment

      Latest Articles

      Collapse

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 07:14 AM
      0 responses
      7 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-06-2023, 01:08 PM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-01-2023, 08:56 PM
      0 responses
      164 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-01-2023, 07:33 AM
      0 responses
      299 views
      0 likes
      Last Post seqadmin  
      Working...
      X