bowtie options
Hi collective brains,
I am getting confused with some Bowtie's options. I did read the manual carefully and also read most of the reads in this thread, but still confused.
* What is difference between -m and -M? From the manual, it seems to me that -M is equivalent to -m --best --strata?
* What option I should use to filter out matches from repeated reads?
Thanks,
D.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
kevpar, you need to start Bowtie from a terminal.
In windows hit the windows-key+R and type "cmd" then hit run, in Ubuntu start a terminal via Programs>Accessories>Terminal.
If you have installed Bowtie in your path you can simply type "Bowtie" and hit enter, otherwise go to the path where you installed Bowtie first.
Running Bowtie with no arguments will give you a manual page describing which options are available. An easier way is to examine the online Bowtie manual at http://bowtie-bio.sourceforge.net/manual.shtml .
Leave a comment:
-
Originally posted by mskonan View PostHi All,
I got the same error "Reads file contained a pattern with more than 1024 quality values." with Bowtie 0.12.3
My data have 76bp / the same length in quality:
ILLUMINA-1A5BF1 1 8 61 12450 2086 0 1 TGCTGCGCTGTGATTTCTCGCTGGCAGACTTGGGTTGGCTTTGCTGAGGGGACGTGAGACATTGTATCAGGGGCCA bbbbbbbbbbbbbbbbbbbbbbbbbbbcbcbbbbbbbbbbbbbbbbbbb`bbbIbbbb_bbbbabbb]bbbbbbbb 1
After I convert them to fastq format (76/76) like this:
@ILLUMINA-1A5BF1:8:1:1303:18887#0/1
TAGGAGGGTGACCTGAAGAGTGGAAGGAAGAGTCAGGAATACTCAGAAGAACCTGTGCATATAGGCCAGGCCCGAC
+ILLUMINA-1A5BF1:8:1:1303:18887#0/1
aaaa_aaaaaaaaaa]aaYaaaaaaaa_`a_a_a_aaXaa_a`[_aa_`N`aa_`]a]`aXHVV]a`^X]YQHYVa
I got the error.
I guess that the count difference between sequence and quality might not matter for the error.
It'd be greatly appreciated if someone can help me.
Cheers,
KJ
We actually found that we missed some reads in our initial filtering for removing reads with multiple uncalled bases (denoted with "."). It seems that if the read has multiple uncalled bases this is a problem for bowtie and it gives the "Reads file contained a pattern with more than 1024 quality values" error. Once these are removed the program works fine with the same file command.
rich
Leave a comment:
-
Ben,
I have just downloaded Bowtie and can't get it to run. A window opens for bowtie.exe, but then quickly closes down again. This occurs in both Ubuntu and Windows. I suspect I am missing something simple, but would appreciate your help.
Leave a comment:
-
Originally posted by mskonan View PostHi All,
I got the same error "Reads file contained a pattern with more than 1024 quality values." with Bowtie 0.12.3
My data have 76bp / the same length in quality:
ILLUMINA-1A5BF1 1 8 61 12450 2086 0 1 TGCTGCGCTGTGATTTCTCGCTGGCAGACTTGGGTTGGCTTTGCTGAGGGGACGTGAGACATTGTATCAGGGGCCA bbbbbbbbbbbbbbbbbbbbbbbbbbbcbcbbbbbbbbbbbbbbbbbbb`bbbIbbbb_bbbbabbb]bbbbbbbb 1
After I convert them to fastq format (76/76) like this:
@ILLUMINA-1A5BF1:8:1:1303:18887#0/1
TAGGAGGGTGACCTGAAGAGTGGAAGGAAGAGTCAGGAATACTCAGAAGAACCTGTGCATATAGGCCAGGCCCGAC
+ILLUMINA-1A5BF1:8:1:1303:18887#0/1
aaaa_aaaaaaaaaa]aaYaaaaaaaa_`a_a_a_aaXaa_a`[_aa_`N`aa_`]a]`aXHVV]a`^X]YQHYVa
I got the error.
I guess that the count difference between sequence and quality might not matter for the error.
It'd be greatly appreciated if someone can help me.
Cheers,
KJ
Leave a comment:
-
The difference between sequence and quality might not matter for the error
Originally posted by Ben Langmead View PostHi Rich,
Another user just contacted me via email and described something similar. When I ran their reads through bowtie, I realized that part of the problem is that Bowtie is printing the wrong error message. In their case, the error message should have been something more like "Too many quality values for read..." because they had a fastq entry where the quality string was 2 characters longer than the sequence string. Do you notice any inconsistencies like that in your input?
I'll fix the error-message bug.
Thanks,
Ben
I got the same error "Reads file contained a pattern with more than 1024 quality values." with Bowtie 0.12.3
My data have 76bp / the same length in quality:
ILLUMINA-1A5BF1 1 8 61 12450 2086 0 1 TGCTGCGCTGTGATTTCTCGCTGGCAGACTTGGGTTGGCTTTGCTGAGGGGACGTGAGACATTGTATCAGGGGCCA bbbbbbbbbbbbbbbbbbbbbbbbbbbcbcbbbbbbbbbbbbbbbbbbb`bbbIbbbb_bbbbabbb]bbbbbbbb 1
After I convert them to fastq format (76/76) like this:
@ILLUMINA-1A5BF1:8:1:1303:18887#0/1
TAGGAGGGTGACCTGAAGAGTGGAAGGAAGAGTCAGGAATACTCAGAAGAACCTGTGCATATAGGCCAGGCCCGAC
+ILLUMINA-1A5BF1:8:1:1303:18887#0/1
aaaa_aaaaaaaaaa]aaYaaaaaaaa_`a_a_a_aaXaa_a`[_aa_`N`aa_`]a]`aXHVV]a`^X]YQHYVa
I got the error.
I guess that the count difference between sequence and quality might not matter for the error.
It'd be greatly appreciated if someone can help me.
Cheers,
KJLast edited by mskonan; 03-11-2010, 06:32 PM.
Leave a comment:
-
Originally posted by Ben Langmead View PostHi Rich,
Another user just contacted me via email and described something similar. When I ran their reads through bowtie, I realized that part of the problem is that Bowtie is printing the wrong error message. In their case, the error message should have been something more like "Too many quality values for read..." because they had a fastq entry where the quality string was 2 characters longer than the sequence string. Do you notice any inconsistencies like that in your input?
I'll fix the error-message bug.
Thanks,
Ben
That seems to be a likely problem. We took the first 20 or so paired reads and verified the sequence and quality value lengths, and that ran well, with the same command line. We'll go through the FASTQ files and try and find the quality string causing us problems. Thanks to everyone for the helpful suggestions.
rich
Leave a comment:
-
Originally posted by RichEast View PostWe're using Bowtie version 0.12.3, with the command line (running on a command prompt in windows) "Bowtie -n 2 -q --solexa1.3-quals -S Pbindex -1QN_read1 -2QN_read2 QNalign.sam" The FASTQ files are run off a Illunima GA II pipeline 1.4. Thanks.
rich
Another user just contacted me via email and described something similar. When I ran their reads through bowtie, I realized that part of the problem is that Bowtie is printing the wrong error message. In their case, the error message should have been something more like "Too many quality values for read..." because they had a fastq entry where the quality string was 2 characters longer than the sequence string. Do you notice any inconsistencies like that in your input?
I'll fix the error-message bug.
Thanks,
Ben
Leave a comment:
-
Originally posted by RichEast View PostWe're using Bowtie version 0.12.3, with the command line (running on a command prompt in windows) "Bowtie -n 2 -q --solexa1.3-quals -S Pbindex -1QN_read1 -2QN_read2 QNalign.sam" The FASTQ files are run off a Illunima GA II pipeline 1.4. Thanks.
rich
Leave a comment:
-
Originally posted by Ben Langmead View PostCan you please post the Bowtie version you're using, and the command you used to run it?
Thanks,
Ben
rich
Leave a comment:
-
Originally posted by RichEast View PostHello everyone,
We have been using MAQ for our Solexa assembly needs, but we're moving to another program for downstream analysis, and Bowtie seems much easier for upstream assembly. Unfortunately, this means learning another assembly program. I was trying to assemble some data that we have previously assembled and analyzed using MAQ using Bowtie and I'm running into an error I don't really understand. It states "Reads file contained a pattern with more than 1024 quality values." I'm using the -n alignment mode to assemble the paired alignments (and including the input option --solexa-quals), but have also tried in -v alignment mode (which I thought ignored quality values). We didn't have any issues assembling this data with MAQ, so I think I'm just missing something being new to Bowtie. Any help anyone can provide would be greatly appreciated.
Thanks
Leave a comment:
-
Can you please post the Bowtie version you're using, and the command you used to run it?
Thanks,
Ben
Leave a comment:
-
Bowtie quality values error
Hello everyone,
We have been using MAQ for our Solexa assembly needs, but we're moving to another program for downstream analysis, and Bowtie seems much easier for upstream assembly. Unfortunately, this means learning another assembly program. I was trying to assemble some data that we have previously assembled and analyzed using MAQ using Bowtie and I'm running into an error I don't really understand. It states "Reads file contained a pattern with more than 1024 quality values." I'm using the -n alignment mode to assemble the paired alignments (and including the input option --solexa-quals), but have also tried in -v alignment mode (which I thought ignored quality values). We didn't have any issues assembling this data with MAQ, so I think I'm just missing something being new to Bowtie. Any help anyone can provide would be greatly appreciated.
Thanks
Leave a comment:
-
Originally posted by Ben Langmead View PostHi,
Yes, the problem was that versions < 0.12.2 were failing to check for a too-high input for -n and -v. The manual and the usage message both said max=3, but bowtie erroneously didn't enforce it.
Note that the -n option only constrains the number of mismatches in the seed, not in the entire alignment. The key is to set -n, -l and -e to reasonable numbers given your data. Since your reads are 75bp, I would suggest trying a few different settings, perhaps starting with -l 28 (the default) -n 2 and -e 180 and then adjusting all 3 until your getting your desired mix of speed and sensitivity.
Thanks,
Ben
Thank you,
Sean
Leave a comment:
-
I am fairly new to the field of next-gen sequencing but find Bowtie to be fairly user friendlybut I do have a question regarding its use. What is the difference in reporting between the default bowtie and the use of the -a, --strata, and --best flags? I understand that with the flags all of the alignments are reported in a best to work format but what does the default bowtie report? For human sequencing data is there a best set of parameters to use in order to gain enough sensitivity in coverage while keeping the file sizes to a manageable number?
thanks in advance for any help.
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...-
Channel: Articles
05-06-2024, 07:48 AM -
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 06:35 AM
|
0 responses
12 views
0 likes
|
Last Post
by seqadmin
Today, 06:35 AM
|
||
Started by seqadmin, Yesterday, 02:46 PM
|
0 responses
18 views
0 likes
|
Last Post
by seqadmin
Yesterday, 02:46 PM
|
||
Started by seqadmin, 05-07-2024, 06:57 AM
|
0 responses
17 views
0 likes
|
Last Post
by seqadmin
05-07-2024, 06:57 AM
|
||
Started by seqadmin, 05-06-2024, 07:17 AM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
05-06-2024, 07:17 AM
|
Leave a comment: