Header Leaderboard Ad

Collapse

Assembly of Large Genomes using Cloud Computing by Contrail

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Assembly of Large Genomes using Cloud Computing by Contrail

    I have found a new software (the code source is not available yet) for large genomes assembly(http://sourceforge.net/projects/contrail-bio/). It relies on Hadoop to iteratively transform an on-disk representation of the assembly graph, allowing an in depth analysis even for large genomes, which can reduce the memory requirement. Contrails also use de Brujin graph strategy to do the short reads assembly.
    for more see the wiki: http://sourceforge.net/apps/mediawik...title=Contrail

  • #2
    does anyone know when the first release of Contrail is scheduled?

    The quality of software coming from the Salzberg and Pop labs has been very high. Despite the inelegant name, I am really looking forward to seeing how Contrail compares with Velvet, Abyss, SOAPdenovo, etc..
    Last edited by Zigster; 01-11-2010, 10:34 AM.
    --
    Jeremy Leipzig
    Bioinformatics Programmer
    --
    My blog
    Twitter

    Comment


    • #3
      I am also wondering if this assembler is written in entirely in Java. Isn't that a Hadoop requirement?
      --
      Jeremy Leipzig
      Bioinformatics Programmer
      --
      My blog
      Twitter

      Comment


      • #4
        I am also looking forward to the first release of Contrail....

        Comment


        • #5
          The source code has been released but it does not look like one could just run it. No documentation, no hints. Any word on when a usable version might be available?

          Comment


          • #6
            Originally posted by jjv5 View Post
            The source code has been released but it does not look like one could just run it. No documentation, no hints. Any word on when a usable version might be available?
            Meanwhile ->

            Michael Schatz (Cold Spring Harbor Laboratory)
            "Cloud Computing and the DNA Data Race: Theory and Practice."
            http://schatzlab.cshl.edu/presentati....Computing.pdf

            Comment


            • #7
              has anyone gotten this up and running? does it read raw sequence files? so many questions!
              Petri Dish Talk

              Comment


              • #8
                Hi all,

                I managed to get contrail up and running.

                Here is how to run the program on the test case provided by Schatz.

                http://www.homolog.us/blogs/2011/09/...t-uses-hadoop/
                http://homolog.us

                Comment


                • #9
                  Such an amazing post! This is an interesting thread. I wonder how effective this cloud computing is.






                  _____________________________________________________________________________________________________
                  "Defect-free software does not exist."
                  ~ Wietse Venema ~
                  Hosting Dallas

                  Comment


                  • #10
                    Effectiveness of which one are you asking about - cloud computing in general or hadoop?

                    Cloud computing basically allows you to rent computer time from some company managing the hardware. To an user, it is nothing different from using a local supercomputing facility at the university or other place.

                    Hadoop is a different paradigm and has been useful for large data. You can even set it locally and do not need cloud computing for it. I have few posts on hadoop for bioinformatics.

                    http://www.homolog.us/blogs/category/hadoop/
                    http://homolog.us

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
                      by seqadmin



                      Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
                      03-21-2023, 01:49 PM
                    • seqadmin
                      Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
                      by seqadmin




                      Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
                      03-10-2023, 05:31 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 03-24-2023, 02:45 PM
                    0 responses
                    16 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-22-2023, 12:26 PM
                    0 responses
                    17 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-17-2023, 12:32 PM
                    0 responses
                    17 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-15-2023, 12:42 PM
                    0 responses
                    24 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X