Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #16
    Try using the "-U" option.

    Comment

    • NanYu
      Member
      • Apr 2011
      • 21

      #17
      with the -U option added, it finished within 1 min!
      but it means BFAST will not consider the results for "mated-pair". How much it will affect the final result (compared with not using -U option)?
      If I use the -U option, may I use the SAMTools or other software to further filter the result before SNP discovery?
      Thanks!

      Comment

      • nilshomer
        Nils Homer
        • Nov 2008
        • 1283

        #18
        Originally posted by NanYu View Post
        with the -U option added, it finished within 1 min!
        but it means BFAST will not consider the results for "mated-pair".
        It means that it will not try the mate-pair rescue strategy, but it will be annotated as mate pair in the SAM file (check the flag field).

        Originally posted by NanYu View Post
        How much it will affect the final result (compared with not using -U option)?
        It should affect it by 42. On a serious note, I don't know how much it will affect your results. Please post back if you notice an interesting difference.

        Originally posted by NanYu View Post
        If I use the -U option, may I use the SAMTools or other software to further filter the result before SNP discovery?
        Thanks!
        I am not sure why you are concerned. Did you look at the data and see something odd?

        Comment

        • Chipper
          Senior Member
          • Mar 2008
          • 323

          #19
          The problem seems to be the estmation of distances and standar deviations. If you set these parameters it will be much faster.

          Comment

          • NanYu
            Member
            • Apr 2011
            • 21

            #20
            Originally posted by nilshomer View Post
            I am not sure why you are concerned. Did you look at the data and see something odd?

            I'm wondering in the postprocess step, if BFAST will do something like: choose the best for F and second best for R within a pair because that is within the expected distance, yet the best for R is in another chromsome.
            I know it is a rare scenario and maybe I should not worry about it.

            Comment

            • NanYu
              Member
              • Apr 2011
              • 21

              #21
              Originally posted by Chipper View Post
              The problem seems to be the estmation of distances and standar deviations. If you set these parameters it will be much faster.

              you mean I should provide a value for "-S"?
              We are expecting ~1.5kb, so should I set "-S 3000" (allow the distance to be a bit bigger).

              Thanks!

              Comment

              • nilshomer
                Nils Homer
                • Nov 2008
                • 1283

                #22
                Yes, that is a possibility with any pairing step.

                Comment

                • NanYu
                  Member
                  • Apr 2011
                  • 21

                  #23
                  Originally posted by nilshomer View Post
                  Yes, that is a possibility with any pairing step.
                  Thanks!

                  As mentioned above, if we are expecting ~1.5kb (insertion size, from the lab protocol) , do you think if I set "-S 3000" (very loose criteria) is an OK value for "-S"?

                  I saw the "-v 160 -s 20 " in #3 but could not find them in the BFAST manual (postprocess section). Could you please let me know what they are?

                  I really appreciate your help in this forum.

                  Comment

                  • nilshomer
                    Nils Homer
                    • Nov 2008
                    • 1283

                    #24
                    It may take a very long time to run. Note that "-S" is the standard deviation, not the mean (etc.).

                    The best way, as a scientist, to find out these answers is to try it out for yourself! I cannot predict the behavior

                    Comment

                    • NanYu
                      Member
                      • Apr 2011
                      • 21

                      #25
                      Originally posted by nilshomer View Post
                      It may take a very long time to run. Note that "-S" is the standard deviation, not the mean (etc.).

                      The best way, as a scientist, to find out these answers is to try it out for yourself! I cannot predict the behavior
                      Thanks!
                      It did takes much longer time to run (compared to use "-U") option.

                      There is no option for me to tell BFAST at the postprocess step the expected insertion size (or range of the insertion size) for pair-end sequences.

                      Maybe what I should do is to use "-U" and use a perl script to look for column #8 in the sam file and separate the results according to the value of #8 (and other information). By doing so, it means this information (expected insertion size) is wasted when using BFAST to map the paired-end sequences.

                      Comment

                      • nilshomer
                        Nils Homer
                        • Nov 2008
                        • 1283

                        #26
                        See "-v" and "-s". Make sure you have latest BFAST.

                        Nils

                        Comment

                        • NanYu
                          Member
                          • Apr 2011
                          • 21

                          #27
                          Originally posted by nilshomer View Post
                          See "-v" and "-s". Make sure you have latest BFAST.

                          Nils
                          Hi, Nils:

                          The version I have is 0.6.5a and the bfast-book.pdf I am reading also shows "version 0.6.5a" but I could not find -v option and -s is common parameter and has nothing to do with insertion size.

                          "-s INTEGER, --startReadNum=INTEGER
                          Specifies the first read in which to process. This may be useful when distributing a large data set across a cluster."

                          Maybe I didn't get things right?

                          Comment

                          • nilshomer
                            Nils Homer
                            • Nov 2008
                            • 1283

                            #28
                            Good point, the manual and binaries are not in sync. Try "bfast postprocess -h".

                            Comment

                            • NanYu
                              Member
                              • Apr 2011
                              • 21

                              #29
                              Originally posted by nilshomer View Post
                              Good point, the manual and binaries are not in sync. Try "bfast postprocess -h".
                              Saw them now.

                              Thanks for your help! I will try them now.

                              -v insertSizeAvg Specifies the mean insert size to use when rescuing
                              -s insertSizeStdDev Specifies the standard deviation of the insert size to use when rescuing

                              Comment

                              • guo
                                Junior Member
                                • Jun 2011
                                • 8

                                #30
                                Hi. everyone.

                                Should we transform the fastq file from solid2fastq (Bfast) before mapping with bwa?
                                I mean from "0123123" to "ACTGACT", something likewise plus some other format issues?

                                Before, I proced without any transformation, then got malformed .sai file and some "segmentation fault" which I haven't specified yet.

                                Any suggestions? Thank you

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  06-02-2026, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, Today, 08:59 AM
                                0 responses
                                8 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                21 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                15 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...