Try using the "-U" option.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
with the -U option added, it finished within 1 min!
but it means BFAST will not consider the results for "mated-pair". How much it will affect the final result (compared with not using -U option)?
If I use the -U option, may I use the SAMTools or other software to further filter the result before SNP discovery?
Thanks!
Comment
-
Originally posted by NanYu View Postwith the -U option added, it finished within 1 min!
but it means BFAST will not consider the results for "mated-pair".
Originally posted by NanYu View PostHow much it will affect the final result (compared with not using -U option)?
Originally posted by NanYu View PostIf I use the -U option, may I use the SAMTools or other software to further filter the result before SNP discovery?
Thanks!
Comment
-
Originally posted by nilshomer View PostI am not sure why you are concerned. Did you look at the data and see something odd?
I'm wondering in the postprocess step, if BFAST will do something like: choose the best for F and second best for R within a pair because that is within the expected distance, yet the best for R is in another chromsome.
I know it is a rare scenario and maybe I should not worry about it.
Comment
-
Originally posted by Chipper View PostThe problem seems to be the estmation of distances and standar deviations. If you set these parameters it will be much faster.
you mean I should provide a value for "-S"?
We are expecting ~1.5kb, so should I set "-S 3000" (allow the distance to be a bit bigger).
Thanks!
Comment
-
Originally posted by nilshomer View PostYes, that is a possibility with any pairing step.
As mentioned above, if we are expecting ~1.5kb (insertion size, from the lab protocol) , do you think if I set "-S 3000" (very loose criteria) is an OK value for "-S"?
I saw the "-v 160 -s 20 " in #3 but could not find them in the BFAST manual (postprocess section). Could you please let me know what they are?
I really appreciate your help in this forum.
Comment
-
Originally posted by nilshomer View PostIt may take a very long time to run. Note that "-S" is the standard deviation, not the mean (etc.).
The best way, as a scientist, to find out these answers is to try it out for yourself! I cannot predict the behavior
It did takes much longer time to run (compared to use "-U") option.
There is no option for me to tell BFAST at the postprocess step the expected insertion size (or range of the insertion size) for pair-end sequences.
Maybe what I should do is to use "-U" and use a perl script to look for column #8 in the sam file and separate the results according to the value of #8 (and other information). By doing so, it means this information (expected insertion size) is wasted when using BFAST to map the paired-end sequences.
Comment
-
Originally posted by nilshomer View PostSee "-v" and "-s". Make sure you have latest BFAST.
Nils
The version I have is 0.6.5a and the bfast-book.pdf I am reading also shows "version 0.6.5a" but I could not find -v option and -s is common parameter and has nothing to do with insertion size.
"-s INTEGER, --startReadNum=INTEGER
Specifies the first read in which to process. This may be useful when distributing a large data set across a cluster."
Maybe I didn't get things right?
Comment
-
Originally posted by nilshomer View PostGood point, the manual and binaries are not in sync. Try "bfast postprocess -h".
Thanks for your help! I will try them now.
-v insertSizeAvg Specifies the mean insert size to use when rescuing
-s insertSizeStdDev Specifies the standard deviation of the insert size to use when rescuing
Comment
-
Hi. everyone.
Should we transform the fastq file from solid2fastq (Bfast) before mapping with bwa?
I mean from "0123123" to "ACTGACT", something likewise plus some other format issues?
Before, I proced without any transformation, then got malformed .sai file and some "segmentation fault" which I haven't specified yet.
Any suggestions? Thank you
Comment
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
[Article Coming Soon!]...-
Channel: Articles
Today, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
-
by seqadmin
During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.
Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...-
Channel: Articles
09-09-2024, 10:59 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
13 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
23 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
29 views
0 likes
|
Last Post
by EmiTom
Today, 06:46 AM
|
||
Started by seqadmin, 09-26-2024, 12:57 PM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
09-26-2024, 12:57 PM
|
Comment