Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to speed up BFAST run time?

    Hi guys,

    I got warm recommendations on BFAST and decided to try it on my SOLID human sequencing data.
    I got to the 'bfast match' stage, but it takes ages to run. Going from a single thread to multiple did not affect the running time significantly (what threading library is used here?).

    I'll be grateful if someone can share their experience on how to speed this process up.

    Thanks,
    Amit

  • #2
    With shorter reads you would speed up your alignments by creating more than one index. With reads longer than 100bp you could get away with one index on a human sized genome. The authors use 10 indexes for the human genome in their example.

    How many did you create?

    Comment


    • #3
      I used a single index.

      How do you create multiple indexes?
      And what do you do with them? Do you then run multiple alignments in parallel?

      Amit

      Comment


      • #4
        Take a look at the supplementary material which goes with the original BFAST publication. It explains all of the parameters in detail. It is quiet complex, and requires some exploration of parameters which will suit your genome of interest. Lucky for you the authors provide 10 binary keys for which you can use to execute 10 indexes within a human genome. If you follow the methods from the publication you should get decent recults in minimal time (they optimized their protocol against a human genome).

        Comment


        • #5
          ps.
          multiple indexs are run separately, but in total will require more memory

          Comment


          • #6
            Thanks for the help, but I don't completely understand.
            How can multiple binary keys speed up the search?

            Comment


            • #7
              The publication explains

              Comment


              • #8
                Do you work with human genome?
                If you do, can you share the commands you use when you run it and how long it takes?

                Thanks

                Comment


                • #9
                  I dont work on human genome. You can get all the info from the paper to duplicate their run.
                  From what I read, for 10 indexes of the human genomes they used
                  a key size of 22
                  Hash width of 14
                  K cals (k) of 8
                  M = 1280

                  Comment


                  • #10
                    Sorry, I read the supplements but I still don't understand.

                    Do you use bfast regularly?
                    How did you index your reference?

                    Comment


                    • #11
                      no i dont. email the bfast mailing list for help

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Advanced Methods for the Detection of Infectious Disease
                        by seqadmin




                        The recent pandemic caused worldwide health, economic, and social disruptions with its reverberations still felt today. A key takeaway from this event is the need for accurate and accessible tools for detecting and tracking infectious diseases. Timely identification is essential for early intervention, managing outbreaks, and preventing their spread. This article reviews several valuable tools employed in the detection and surveillance of infectious diseases.
                        ...
                        Yesterday, 01:15 PM
                      • seqadmin
                        Strategies for Investigating the Microbiome
                        by seqadmin




                        Microbiome research has led to the discovery of important connections to human and environmental health. Sequencing has become a core investigational tool in microbiome research, a subject that we covered during a recent webinar. Our expert speakers shared a number of advancements including improved experimental workflows, research involving transmission dynamics, and invaluable analysis resources. This article recaps their informative presentations, offering insights...
                        11-09-2023, 07:02 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Yesterday, 08:12 AM
                      0 responses
                      15 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 11-22-2023, 09:29 AM
                      1 response
                      51 views
                      0 likes
                      Last Post VilliamPast  
                      Started by seqadmin, 11-22-2023, 08:53 AM
                      0 responses
                      59 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 11-21-2023, 08:24 AM
                      0 responses
                      31 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X