Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • swan_r
    Junior Member
    • Apr 2012
    • 5

    AWS for bioinformatics core

    Hello,
    Our department had started bioinformatics core recently and I am the only 1 bioinformatician there, we are looking for different options to run the analysis. We are planning to take AWS account, I have couple of questions of questions in my mind
    1. Will there be any firewall problems when accessing the AWS from university.
    2. If so, is it easier issue to resolve
    3. How much space we need to run 200 RNA-Seq samples from AWS

    Anyone has experience about this, I would really appreciate any feedback on this. Thank you.
  • brianytsui
    Junior Member
    • Aug 2018
    • 5

    #2
    1. Will there be any firewall problems when accessing the AWS from university.
    There is no firewall by default on AWS. You can directly ssh from your computer to an EC2 instance. If you want better security, you can create something called a VPC with limited IP ranges.

    3. How much space we need to run 200 RNA-Seq samples from AWS
    If money is not an issue, EFS would be the most natural solution, as it grows and shrinks depending on the amount of data u put in, which means you don't have to think about the problem of scalability. I usually keep everything in the sorted bam format to keep the data small without losing anything. Most sorted RNAseq bams I have seen are less than 10GB in size, of course, it depends on a lot on the sequencing depth.

    I just wrote a blog post about this kinda issue actually, hope it might be helpful :
    The recurrent question in the data-intensive workplace often revolves around which computing infrastructure to use. In the past four years as a bioinformatics Ph.D. student, I have both received an…

    Comment

    Latest Articles

    Collapse

    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM
    • SEQadmin2
      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
      by SEQadmin2


      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
      ...
      06-02-2026, 10:05 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, Today, 11:10 AM
    0 responses
    6 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    42 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-09-2026, 11:58 AM
    0 responses
    102 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    124 views
    0 reactions
    Last Post SEQadmin2  
    Working...