Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Plant whole genome hybrid assembly

    Hi,

    We have generated data using Illumina HiSeq 2000 and 454 for a plant species.

    We completed the hybrid assembly using Velvet and got assembled contigs.

    Now we were trying to map the Illumina reads on the assembled contigs but only 25-30% of the reads are getting mapped.

    We were expecting more % mapping !!!!

    Does any one has any experience with this ?

  • #2
    Are you using all assembled contigs to map against? Or did you only choose contigs greater than a certain size?

    Comment


    • #3
      We used contigs greater than 300 bases...

      Comment


      • #4
        What length Illumina data do you have? What fold coverage given the expected genome size? What is the total contig length & N50 for your assembly?

        I would try assembling the Illumina data alone & see what you get. Also, try Ray as your assembler -- it tends to build longer contigs, and my experience to date is that the longer contigs are valid.

        Have you scanned different kmer values for velvet, either manually or using VelvetOptimiser?

        Comment


        • #5
          We have generated 45GB illumina data (100PE), 1 full plate 454 FLX and 1 plate 454 FLXplus for approx 700MB genome...we finally got ~400,000 contigs covering 630MB genome with N50 of 7.8KB...we did try different kmer lengths by manual calculations...the problem is that now when we try to map the illumina reads to the contigs ( ~ 400,000) to get heterozygous calls only 25-30% of the reads get mapped which we feel is a little strange !!! Is this expected ?

          Comment


          • #6
            A lot of your reads will map to your singletons (<300bp contigs) so you should try and include those in your analysis to see if the mapping improves. Is the genome repetitive? Have you had a look at the read coverage of the 25-30% that mapped? Is it uniform?

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            15 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            21 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            16 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            45 views
            0 likes
            Last Post seqadmin  
            Working...
            X