Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to speed up BFAST run time?

    Hi guys,

    I got warm recommendations on BFAST and decided to try it on my SOLID human sequencing data.
    I got to the 'bfast match' stage, but it takes ages to run. Going from a single thread to multiple did not affect the running time significantly (what threading library is used here?).

    I'll be grateful if someone can share their experience on how to speed this process up.

    Thanks,
    Amit

  • #2
    With shorter reads you would speed up your alignments by creating more than one index. With reads longer than 100bp you could get away with one index on a human sized genome. The authors use 10 indexes for the human genome in their example.

    How many did you create?

    Comment


    • #3
      I used a single index.

      How do you create multiple indexes?
      And what do you do with them? Do you then run multiple alignments in parallel?

      Amit

      Comment


      • #4
        Take a look at the supplementary material which goes with the original BFAST publication. It explains all of the parameters in detail. It is quiet complex, and requires some exploration of parameters which will suit your genome of interest. Lucky for you the authors provide 10 binary keys for which you can use to execute 10 indexes within a human genome. If you follow the methods from the publication you should get decent recults in minimal time (they optimized their protocol against a human genome).

        Comment


        • #5
          ps.
          multiple indexs are run separately, but in total will require more memory

          Comment


          • #6
            Thanks for the help, but I don't completely understand.
            How can multiple binary keys speed up the search?

            Comment


            • #7
              The publication explains

              Comment


              • #8
                Do you work with human genome?
                If you do, can you share the commands you use when you run it and how long it takes?

                Thanks

                Comment


                • #9
                  I dont work on human genome. You can get all the info from the paper to duplicate their run.
                  From what I read, for 10 indexes of the human genomes they used
                  a key size of 22
                  Hash width of 14
                  K cals (k) of 8
                  M = 1280

                  Comment


                  • #10
                    Sorry, I read the supplements but I still don't understand.

                    Do you use bfast regularly?
                    How did you index your reference?

                    Comment


                    • #11
                      no i dont. email the bfast mailing list for help

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        The Impact of AI in Genomic Medicine
                        by seqadmin



                        Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
                        Yesterday, 02:07 PM
                      • seqadmin
                        Multiomics Techniques Advancing Disease Research
                        by seqadmin


                        New and advanced multiomics tools and technologies have opened new avenues of research and markedly enhanced various disciplines such as disease research and precision medicine1. The practice of merging diverse data from various ‘omes increasingly provides a more holistic understanding of biological systems. As Maddison Masaeli, Co-Founder and CEO at Deepcell, aptly noted, “You can't explain biology in its complex form with one modality.”

                        A major leap in the field has
                        ...
                        02-08-2024, 06:33 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 02-23-2024, 04:11 PM
                      0 responses
                      54 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 02-21-2024, 08:52 AM
                      0 responses
                      62 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 02-20-2024, 08:57 AM
                      0 responses
                      52 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 02-14-2024, 09:19 AM
                      0 responses
                      65 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X