Unconfigured Ad

**maubp** · 09-23-2010, 08:27 AM

I'd guess there is a missing new line in this bit:

@ILLUMINA-C3C24B_0047:1:1:1052:1111#0

Either that or your readnames start with @ which is not allowed in SAM format?

**scami** · 09-23-2010, 10:23 PM

Hi there

thanks for your reply. I gave a better look at the sam format and the number of "new line" should be correct. You are right when you say that the @ symbol can not be used in the reads lines. At the moment I am removing them, but I don't understand how come bowtie does not do this by its own when output a Sam format. I mean it gets in input Fastaq files which of course have the @symbol, and then when it outputs the sam file does not remove such symbol.... sound a bit wierd.....

**maubp** · 09-24-2010, 01:32 AM

What version of bowtie are you using? It sounds like a bug if all the read names in the SAM output have a @ at the start.

**scami** · 09-24-2010, 01:47 AM

I am using version 0.12.5. May be I did something wrong with the command (I am just beginning to play around with these software..... and with this topic actually....

) The command I used is the following:
./bowtie-0.12.5/bowtie -v 2 -k 5 --best --fr -p 8 -I 100 -S --solexa1.3-quals ./indexes/riferimento_pinot --12 exp_47_s_1.fastq_bowtie_pe,exp_47_s_2.fastq_bowtie_pe,exp_47_s_3.fastq_bowtie_pe paired_end.map

is there anything wrong?

thanks a lot for your help

**maubp** · 09-24-2010, 02:59 AM

Well bowtie 0.12.5 is several months out of date, currently at 0.12.7, but the release notes don't mention any SAM output bug fixes:

Bowtie: An ultrafast, memory-efficient short read aligner

http://bowtie-bio.sourceforge.net/index.shtml

Could you post the first few reads of your FASTQ files?

P.S. Use the [ code ] and [ /code ] tags to display it nicely on the forum (but without the spaces I have put round the square brackets). If you use the advanced editor then this can be accessed via the # toolbar button. Your original post tried <code> ... <\code> and [ code ] ... [ \code ] which are both wrong - use the other slash.

**scami** · 09-26-2010, 10:27 PM

Hi,

I used a script to covert my fastaq files in order to be used with bowtie. I had paired mates in one file and I used the 12 flag in bowtie. In according with the manual the input file should have been in a TAB separated text format:

Code:

<r>   Comma-separated list of files containing a mix of unpaired and paired-end reads in Tab-delimited format. Tab-delimited format is a 1-read-per-line format where unpaired reads consist of a read name, sequence and quality string each separated by tabs. A paired-end read consists of a read name, sequnce of the #1 mate, quality values of the #1 mate, sequence of the #2 mate, and quality values of the #2 mate separated by tabs. Quality values can be expressed using any of the scales supported in FASTQ files. Reads may be a mix of different lengths and paired-end and unpaired reads may be intermingled in the same file. If - is specified, bowtie will read the Tab-delimited reads from the "standard in" filehandle.

Therefore I used a script to convert the fastaq file which generated the following output:

Code:

@ILLUMINA-C3C24B_0047:1:1:1052:12086#0/1	TTCCGCGTCCTGACCTCCCCNGTTCAAGTAAGGCAACAACTACATATCCATCCTCTGCGTTAATCCATGTtaant	bbbbbbbbbbbbbbbbbbbaDaaaa```a`bb`bbbbbbbbbbbbbbbbb`bbbbbb_bbbb]bb_cbaaBBBBB	aaatttnggggtnagcaagtaacatacctaaagttgaaacataggnnancnancgagccacannnnannngnnnn	_____]E]]]NNEOO[[ZYV_____________\_________BBBBBBBBBBBBBBBBBBBBBBBBBBBBBBBB

.... all fields are TAB separated. Considering your observations I think I should modify the script to avoid to output the '@' symbol in the bowtie input file. What do you think?

thanks a lot

**maubp** · 09-27-2010, 12:03 AM

Originally posted by scami View Post

Therefore I used a script to convert the fastaq file which generated the following output:

...

.... all fields are TAB separated. Considering your observations I think I should modify the script to avoid to output the '@' symbol in the bowtie input file. What do you think

Yes, remove the @ from your tabular output - you are telling Bowtie the read names all start with an @ character, so it is putting this in the SAM output (which is invalid).

Ideally I think you should also report this as a bug in Bowtie - arguably it should check the readnames don't start with @ when writing SAM format.

In FASTQ files, the @ is just a record marker - not part of the read name.

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 14 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Bowtie + samtools problem

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News