Header Leaderboard Ad

Collapse

Denovo Hybrid Assembly using 454/illumina

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Denovo Hybrid Assembly using 454/illumina

    Hi,

    I have managed to get a reasonable assembly of a bacterial genome using newbler based on 454 NG sequence data.

    I have also available paired end reads for the same bacterial genome and I am wondering is there a way to improve the contigs and scaffolding.

    I read about AMOS-hybrid, http://www.biomedcentral.com/1471-2164/11/242, but it requires mates file, which I do not have available instead I have one sequence file for each of the paired ends.

    I also thought to try mira but it also requires mates files. Should I request the vender for mates file or there are other software available that can use contigs from my newbler run and raw paired end files and their qualities?

    I hope other people may have gone through hybrid assembly issue already and may be of help.

    Regards,

    Intikhab

  • #2
    I would refer you to the BAMBUS manual where you find a description of the mates format:

    http://sourceforge.net/apps/mediawik...he_.mates_file

    You may also find an example in the test folder of AMOS-Hybrid.
    library fiveK 4000 6000 (r).*
    pair (.*)\.1 (.*)\.2
    The first row describes the library information and the second row contains mate-pair relationships. You may create your own mate-file based on information your own library and read structure.

    Comment


    • #3
      Originally posted by intikhab View Post
      Hi,

      I have managed to get a reasonable assembly of a bacterial genome using newbler based on 454 NG sequence data.

      I have also available paired end reads for the same bacterial genome and I am wondering is there a way to improve the contigs and scaffolding.
      If you wanted to use Mira, there are instructions on how to directly use Mira with Bambus:

      http://mira-assembler.sourceforge.ne...IRA_BAMBUS.pdf

      Comment


      • #4
        About the mates file, if I have the following description of the paired ends reads:

        >NG-5247_HLP_3_1_1058_5238/1
        ATGATGCATCTGAGACGATNGGTGTAAACGATCGT

        >NG-5247_HLP_3_1_1058_5238/2
        ACAAGATATATACGTATTATTAGGGTCAATAATGG

        How should the mates file look a like?:

        library NG-5247 150 200 (^NG).*
        pair (.*)\/1 (.*)\/2


        One related question. When we have two separate files for paired end reads, how can one provide these to mira or AMOS-Hybrid, these programs require one file. Does, jut combining the two e.g. using cat a b >c should do?

        Intikhab

        Originally posted by andreas.sjodin View Post
        I would refer you to the BAMBUS manual where you find a description of the mates format:

        http://sourceforge.net/apps/mediawik...he_.mates_file

        You may also find an example in the test folder of AMOS-Hybrid.


        The first row describes the library information and the second row contains mate-pair relationships. You may create your own mate-file based on information your own library and read structure.

        Comment


        • #5
          I suggest checking MIRA again, I think that you are in error in thinking that it requires mates files. I have used MIRA with good results doing hybrid assemblies of single end read 454 and illumina data. See the thread: "Combining 454FLX and SOLiD runs for de novo genome assembly", I outlined the process that I used there.
          SBB

          Comment


          • #6
            Here I am using AMOS-Hybrid, where mates file is required. For mira I know we dont need mates file, what we need there is a caf file from an initial assembly and sequence file from the second technology.

            Can anybody correct me on the mates file, mentioned above. When I use this, the error log shows a large number of reads got excluded.

            I want to know specifically the regular expression for the library and for the forward and reverse reads, I have mentioned the description of forward and reverse reads above.

            Regards,

            Intikhab

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
              by seqadmin



              Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
              03-21-2023, 01:49 PM
            • seqadmin
              Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
              by seqadmin




              Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
              03-10-2023, 05:31 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Today, 11:44 AM
            0 responses
            8 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-24-2023, 02:45 PM
            0 responses
            18 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-22-2023, 12:26 PM
            0 responses
            18 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-17-2023, 12:32 PM
            0 responses
            18 views
            0 likes
            Last Post seqadmin  
            Working...
            X