Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Giorgio C
    Member
    • Oct 2010
    • 89

    Computer requirements to do Bioinformatic analysis

    Hy all,

    my department want to buy a new computer to do bioinformatic analysis for next generation sequencing. Can you suggest me something good with some paramount requirements ?? (CPU; RAM....etc.etc.).

    Thanks you very much
  • jordi
    Member
    • Apr 2009
    • 49

    #2
    Wow! I think you'll have plenty of answers to that question. As usual one could say: it depends! Let me know that NCBI had budgets constraints in order to maintain the SRA repository... I mean, if your department is planning to do a lot of 454 runs or multiple Illumina lines, you should have a lot of free space. Instead of this, a single analysis could be as long as 6 GB.
    Moreover, if you are going to do a parallel process (grid blast: http://www.springerlink.com/content/2rlxkun1cxalxrjd/) you should need a Portable Batch System in order to schedule several jobs between processors.
    Regard the RAM needs, I've run a Newbler assembly program in a 4GB RAM computer.
    Hope this helps.

    Comment

    • ulz_peter
      Senior Member
      • Feb 2010
      • 219

      #3
      Hi Giorgio,

      Like Jordi said: It depends: If your focus is on targetted resequencing, there are some cheap solutions (maybe a quad core for parallel alignment and some 16Gb RAM or so). Depending on the amount of data you are aiming to produce you should consider thinking about a long-term storage in the TB region.

      If you're planning to do de-novo sequencing you'll need a lot more RAM (depending on the genome size you're gonna sequence).

      So to give you a concrete advise we would need some more details on what you're planning to do...

      Comment

      • severin
        Genome Informatics Facility
        • Sep 2009
        • 105

        #4
        Example System

        In my facility, we have a 7 node system where each node consists of 32 cpu and 256G of RAM. For most problems, 16 cores and 128G RAM is sufficient to work on the biological problem of interest and get an answer in a couple of hours to a couple of days. For a genome assembly larger than bacteria, expect to utilize at least 32 core and 128G RAM for 2 days to run one assembly with one set of parameters. If you want to try multiple parameters then you will need twice as much. If you are talking about getting this for your department with 4-5 people using it none stop you will want at least as many nodes as their are people and that would be at the lower end. This all assumes you are working on datasets of the size of a two condition (stress vs controll etc) or perhaps a small time/tissue series of 5-10 points all with 2-3 replicates. If you start thinking bigger (1000 genome project, 10000 vertebrate project, screening populations of individuals) you will need considerably more computational power.

        As previous posts have mentioned you can do most smaller problems on a smaller system. Bacteria can be assembled on a laptop.

        Comment

        • Giorgio C
          Member
          • Oct 2010
          • 89

          #5
          Hy Guys thank you all for your answers !!!

          In the details i need this new Computer to work on only 454 data in the context of transcitpomic, metagenomic, chip-seq, amplicon sequencing ( for the most). Now i'm working with an "home PC" > intel dual core with 4Gb of Ram;
          It 's good but in some cases it crashed down out of memory ( For example doing a big local blast or a multiple alignment).

          Can you suggest me some brands to looking for with the best requirements for my purpose ??

          Thanks like always for your precious help !!!

          Giorgio

          Comment

          • yumtaoist
            Member
            • Dec 2011
            • 10

            #6
            If you don't care about efficiency, a computer with four CPUs is enough. But the memory is very important, 16GB RAM is necessary. What's more, you can set another 16GB hard disk as SWAP, I think this will solove the crash problem.

            Actually, the hardware you needed is fully connected to the software you are using, some software have pointed out the hardware it required.

            Sometimes, renting the server of a supercomputer center is also a good idea. You will know your requirement after a period time of trial.

            Additionally, the space of hard disk is also very important. Generally speaking, 1TB is basically, 2TB or more is better.
            Last edited by yumtaoist; 01-03-2012, 07:05 AM.

            Comment

            • Giorgio C
              Member
              • Oct 2010
              • 89

              #7
              Thank you yumtaoist !!!

              Comment

              • adaptivegenome
                Super Moderator
                • Nov 2009
                • 436

                #8
                Originally posted by yumtaoist View Post
                If you don't care about efficiency, a computer with four CPUs is enough. But the memory is very important, 16GB RAM is necessary. What's more, you can set another 16GB hard disk as SWAP, I think this will solove the crash problem.

                Actually, the hardware you needed is fully connected to the software you are using, some software have pointed out the hardware it required.

                Sometimes, renting the server of a supercomputer center is also a good idea. You will know your requirement after a period time of trial.

                Additionally, the space of hard disk is also very important. Generally speaking, 1TB is basically, 2TB or more is better.
                I agree. People often jump into developing computer infrastructure when it might be more cost-effective and efficient to use someone's else existing system. If you develop a long term need you can always acquire the hardware at a later point.

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #9
                  Do not forget to provision some way of doing regular backup of the data you are going to be working on.

                  When you are using a local computer (BTW: I second geneticform's reco in post #8 to see if you can hop on to some shared infrastructure but if that may not be an option in your case) it is easy to overlook this important need. It will potentially save you hours of frustration down the road.

                  A backup solution can be as simple as an external drive (or two if you want to be thorough) with regular backup via scripts/programs. It sounds like you are not going to deal with more than a couple of TB of data so this should be sufficient.

                  Comment

                  Latest Articles

                  Collapse

                  • SEQadmin2
                    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                    by SEQadmin2


                    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                    Here are nine questions we think about, in roughly the order they matter, before...
                    06-18-2026, 07:11 AM
                  • SEQadmin2
                    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                    by SEQadmin2


                    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                    ...
                    06-02-2026, 10:05 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by SEQadmin2, 06-26-2026, 11:10 AM
                  0 responses
                  15 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-17-2026, 06:09 AM
                  0 responses
                  49 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-09-2026, 11:58 AM
                  0 responses
                  107 views
                  0 reactions
                  Last Post SEQadmin2  
                  Started by SEQadmin2, 06-05-2026, 10:09 AM
                  0 responses
                  125 views
                  0 reactions
                  Last Post SEQadmin2  
                  Working...