Unconfigured Ad

**maasha** · 01-06-2011, 12:14 AM

This can be done with Biopieces (www.biopieces.org):

Code:

read_fastq -i test.fq | write_454 -o test.fna -q test.fna.qual -x

Cheers,

Martin

**maubp** · 01-06-2011, 02:41 AM

In Biopython the simplest way to do it is like this:

Code:

from Bio import SeqIO
SeqIO.convert("example.fastq", "fastq", "example.fasta", "fasta")
SeqIO.convert("example.fastq", "fastq", "example.qual", "qual")

You can be more cunning if you want to avoid making two passes through the FASTQ, but the above should be pretty fast anyway.

See also http://dx.doi.org/10.1093/nar/gkp1137 - I'd have suggested using EMBOSS seqret which can do FASTQ to FASTA, but I don't think it supports the QUAL format.

**ewilbanks** · 01-06-2011, 09:59 AM

thank you!! The Biopython script did the trick-- even for a python newbie!

**ewilbanks** · 01-06-2011, 10:19 AM

Do you know how to use Biopython to do the reverse? Fasta +qual = fastq?

**maasha** · 01-06-2011, 10:33 AM

Well, Biopieces can do that as well:

Code:

read_454 -i test.fna -q test.qual | write_fastq -o test.fq -x

In fact, Biopieces can also trim sequences based on quality scores by using trim_seq:

Code:

read_454 -i test.fna -q test.qual | trim_seq | write_fastq -o test.fq -x

Martin

**maubp** · 01-06-2011, 02:23 PM

Originally posted by ewilbanks View Post

Do you know how to use Biopython to do the reverse? Fasta +qual = fastq?

Since you asked, yes, most easily done with the PairedFastaQualIterator function in the Bio.SeqIO.QualityIO module:

Code:

from Bio import SeqIO
from Bio.SeqIO.QualityIO import PairedFastaQualIterator
rec_iter = PairedFastaQualIterator(open("Quality/example.fasta"),
                                   open("Quality/example.qual"))
SeqIO.write(rec_iter, "Quality/temp.fastq", "fastq")

This isn't quite as easy as the reverse since we need to take two input files and read over them in sync - and the high level functions in Bio.SeqIO are all intended for just one file. This example is based on the example in the documentation here:

Page Redirection

http://www.biopython.org/DIST/docs/api/Bio.SeqIO.QualityIO-module.html#PairedFastaQualIterator

**ewilbanks** · 01-06-2011, 02:30 PM

Thanks everyone!

@maasha, I'll have to check it out! Does trim_seq accept sanger format qualities or only Solexa?

**maasha** · 01-07-2011, 03:02 AM

trim_seq works on Illumina type qualities.

read_fastq and read_454 convert to Illumina type qualities per default. Phred scores are automagically detected and converted. If you have Solexa scores there is a switch.

write_fastq output Illumina type qualities.

write_454 automagically convertes to decimal scores.

Cheers,

Martin

Topics	Statistics	Last Post
UC San Diego Bioengineers Map Gene Function in Human Stem Cells by SEQadmin2 Started by SEQadmin2, 07-13-2026, 10:26 AM	0 responses 22 views 0 reactions	Last Post by SEQadmin2 07-13-2026, 10:26 AM
New Analysis Splits Leukemia Into 16 Epigenomic Subgroups by SEQadmin2 Started by SEQadmin2, 07-09-2026, 10:04 AM	0 responses 32 views 0 reactions	Last Post by SEQadmin2 07-09-2026, 10:04 AM
Genome-Wide CRISPR Screen Uncovers Unlikely Psoriasis Target by SEQadmin2 Started by SEQadmin2, 07-08-2026, 10:08 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 07-08-2026, 10:08 AM
Engineered Protein Motor Takes Its First Steps Along DNA Track by SEQadmin2 Started by SEQadmin2, 07-07-2026, 11:05 AM	0 responses 34 views 0 reactions	Last Post by SEQadmin2 07-07-2026, 11:05 AM

Unconfigured Ad

Split fastq to fasta and qual file?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News