Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Which file system is the best for NGS???

    Does it really make a difference???

    Or people just use the standard ones like NTFS, ext3???

  • #2
    Do people use Lustre? It seems to be designed for cluster computing.

    Comment


    • #3
      Originally posted by ymc View Post
      Does it really make a difference???

      Or people just use the standard ones like NTFS, ext3???
      It does. If you are not running in a clustered environment (or if the cluster is small) I would suggest ZFS only, a recent build possibly. data deduplication, data compression, easy maintenance and a lot of other stuff. There may be problems in porting apps on OS which support ZFS.

      Comment


      • #4
        Originally posted by ymc View Post
        Does it really make a difference???
        Or people just use the standard ones like NTFS, ext3???
        Well, if you are using NTFS, that implies you are using Windows, which is probably a mistake

        As for Unix, we use XFS and sometimes EXT3 on our RAID volumes. And that means all our volumes. You wouldn't not use RAID would you?

        Ultimately the filesystem won't be your bottleneck, it will be the underlying disk subsystem. If you can't get the data sets into RAM fast enough, you won't be using your CPUs at maximum. The filesystem won't help too much. Most of the files are large and read sequentially.

        Comment


        • #5
          Originally posted by Torst View Post
          Well, if you are using NTFS, that implies you are using Windows, which is probably a mistake

          As for Unix, we use XFS and sometimes EXT3 on our RAID volumes. And that means all our volumes. You wouldn't not use RAID would you?

          Ultimately the filesystem won't be your bottleneck, it will be the underlying disk subsystem. If you can't get the data sets into RAM fast enough, you won't be using your CPUs at maximum. The filesystem won't help too much. Most of the files are large and read sequentially.
          I do understand the bottleneck is most likely CPU and RAM for NGS. But since we are going to invest so much in the hardware, we might as well try to squeeze out every ounce of performance.

          I am going to use RAID, most likely RAID10. I heard that the stripe size of a RAID interacts with the block size of a file system. How does this work? Anyone knows?

          Comment


          • #6
            Stay away from Fat32! You won't be able to save any file over 4GB.

            Comment


            • #7
              You can use NTFS with Ubuntu but not Red Hat.

              Comment


              • #8
                Is ZFS optimized for big sequential read and write????

                Comment


                • #9
                  Originally posted by ymc View Post
                  I do understand the bottleneck is most likely CPU and RAM for NGS. But since we are going to invest so much in the hardware, we might as well try to squeeze out every ounce of performance.
                  My opinion is that spending 90% of your time getting an extra 10% is not worth it. Spend that effort on working smart - choosing the best tools, best methods, best protocols, smart pipelines.

                  I am going to use RAID, most likely RAID10. I heard that the stripe size of a RAID interacts with the block size of a file system. How does this work? Anyone knows?
                  In theory it is simple - you want to align logical volume accesses with the underlying physical volume accesses. In practice it is difficult, especially with 3rd-party hardware raid solutions, as the block layout is often hidden.

                  If you use Linux software raid, you have more control, and mkfs even queries the underlying md device to optimize the settings (I think). Read here for more: http://feedblog.org/2008/06/18/howto...trides-in-xfs/

                  Comment


                  • #10
                    Originally posted by ymc View Post
                    Is ZFS optimized for big sequential read and write????
                    The good thing is that you can configure it to have good performances for almost any kind of files :-)

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      New Genomics Tools and Methods Shared at AGBT 2025
                      by seqadmin


                      This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                      The Headliner
                      The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                      03-03-2025, 01:39 PM
                    • seqadmin
                      Investigating the Gut Microbiome Through Diet and Spatial Biology
                      by seqadmin




                      The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                      02-24-2025, 06:31 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 03-03-2025, 01:15 PM
                    0 responses
                    180 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 02-28-2025, 12:58 PM
                    0 responses
                    275 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 02-24-2025, 02:48 PM
                    0 responses
                    663 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 02-21-2025, 02:46 PM
                    0 responses
                    268 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X