Header Leaderboard Ad

Collapse

cloud computing sever for DNA sequence assembly

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cloud computing sever for DNA sequence assembly

    Dear all,

    Do you know some famous cloud computing sever for DNA sequence assembly?

    Because my sequence dataset is too large: 450 million for one end, 900 million for pair end.

    I have checked the cloud computing on Amazon. However, for high memory instances, they just provide 68.4GB, it is not enough.

    In addition, I want these sever have already installed some assembly software like Trinity, AbySS, velvet.

    Do you know such sever?

    Thanks!

    Jingjing

  • #2
    I've been looking for more memory as well.
    As far as pre installed software, you could always boot an image of biolinux.

    Comment


    • #3
      You might want to look here, if you are still searching:

      Comment


      • #4
        Another option is to use Ray, a de novo assembler that can spread the job across multiple nodes on a cluster.

        For setting up such a cluster on Amazon, STAR::Cluster can be quite useful.

        Comment


        • #5
          Again, plugging one's own project, Gossamer may be able to handle that volume of data on your existing hardware. Depending on your existing hardware, of course.
          sub f{($f)=@_;print"$f(q{$f});";}f(q{sub f{($f)=@_;print"$f(q{$f});";}f});

          Comment


          • #6
            @Pseudonym - does Gossamer work on a 2008-vintage Intel Core 2 Quad 8200 with 8 GB of RAM (running Bio-Linux 6) to de novo assemble Ion Torrent Ion-318 shotgun data (480 Mbp)? The data is from an oral bacterium, probably 2.0-2.2 Mbp in size. Your paper is most interesting.

            Comment


            • #7
              Originally posted by hengnck View Post
              @Pseudonym - does Gossamer work on a 2008-vintage Intel Core 2 Quad 8200 with 8 GB of RAM (running Bio-Linux 6) to de novo assemble Ion Torrent Ion-318 shotgun data (480 Mbp)? The data is from an oral bacterium, probably 2.0-2.2 Mbp in size. Your paper is most interesting.
              Thanks!

              Memory won't be a problem at all; 8 GB is more than enough to assemble anything smaller than 250Mbp or so. The only catch I can see is that as far as we know, nobody has ever tried Gossamer on Ion Torrent data, and we don't know what its error characteristics are, so it may need a little fine-tuning to get good results.

              Feel free to PM me if you need any help.
              sub f{($f)=@_;print"$f(q{$f});";}f(q{sub f{($f)=@_;print"$f(q{$f});";}f});

              Comment


              • #8
                Thanks, Pseudonym. I'll install and try Gossamer out when the MIRA assembly finishes... it's only been 6 *days* continuously chugging away - two passes and another 2.36 million reads to go. I'll give you feedback on how Gossamer deals with Ion Torrent data.

                Comment


                • #9
                  For more memory for cloud computing server, you can ask and learn more about here erp on cloud

                  Comment


                  • #10
                    Originally posted by ramcob View Post
                    For more memory for cloud computing server, you can ask and learn more about here erp on cloud
                    You may want to give a try to the SSD instances (hi1.4xlarge) and with that maybe a swap file on these will help for software requiring tons of memory.

                    Otherwise, I suggest you use Ray on cc2.8xlarge instances -- these have 16 cores, 32 threads, 60 GiB ram and 10 Gigabit Ethernet.


                    -Sébastien

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      How RNA-Seq is Transforming Cancer Studies
                      by seqadmin



                      Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                      09-07-2023, 11:15 PM
                    • seqadmin
                      Methods for Investigating the Transcriptome
                      by seqadmin




                      Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                      Whole Transcriptome RNA-seq
                      Whole transcriptome sequencing...
                      08-31-2023, 11:07 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 09:05 AM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-21-2023, 06:18 AM
                    0 responses
                    10 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-20-2023, 09:17 AM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-19-2023, 09:23 AM
                    0 responses
                    26 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X