Header Leaderboard Ad

Collapse

Split fastq to fasta and qual file?

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • maasha
    replied
    trim_seq works on Illumina type qualities.

    read_fastq and read_454 convert to Illumina type qualities per default. Phred scores are automagically detected and converted. If you have Solexa scores there is a switch.

    write_fastq output Illumina type qualities.

    write_454 automagically convertes to decimal scores.



    Cheers,


    Martin

    Leave a comment:


  • ewilbanks
    replied
    Thanks everyone!

    @maasha, I'll have to check it out! Does trim_seq accept sanger format qualities or only Solexa?

    Leave a comment:


  • maubp
    replied
    Originally posted by ewilbanks View Post
    Do you know how to use Biopython to do the reverse? Fasta +qual = fastq?
    Since you asked, yes, most easily done with the PairedFastaQualIterator function in the Bio.SeqIO.QualityIO module:

    Code:
    from Bio import SeqIO
    from Bio.SeqIO.QualityIO import PairedFastaQualIterator
    rec_iter = PairedFastaQualIterator(open("Quality/example.fasta"),
                                       open("Quality/example.qual"))
    SeqIO.write(rec_iter, "Quality/temp.fastq", "fastq")
    This isn't quite as easy as the reverse since we need to take two input files and read over them in sync - and the high level functions in Bio.SeqIO are all intended for just one file. This example is based on the example in the documentation here:

    Leave a comment:


  • maasha
    replied
    Well, Biopieces can do that as well:

    Code:
    read_454 -i test.fna -q test.qual | write_fastq -o test.fq -x

    In fact, Biopieces can also trim sequences based on quality scores by using trim_seq:


    Code:
    read_454 -i test.fna -q test.qual | trim_seq | write_fastq -o test.fq -x


    Martin
    Last edited by maasha; 01-06-2011, 10:36 AM.

    Leave a comment:


  • ewilbanks
    replied
    Do you know how to use Biopython to do the reverse? Fasta +qual = fastq?

    Leave a comment:


  • ewilbanks
    replied
    thank you!! The Biopython script did the trick-- even for a python newbie!

    Leave a comment:


  • maubp
    replied
    In Biopython the simplest way to do it is like this:

    Code:
    from Bio import SeqIO
    SeqIO.convert("example.fastq", "fastq", "example.fasta", "fasta")
    SeqIO.convert("example.fastq", "fastq", "example.qual", "qual")
    You can be more cunning if you want to avoid making two passes through the FASTQ, but the above should be pretty fast anyway.

    See also http://dx.doi.org/10.1093/nar/gkp1137 - I'd have suggested using EMBOSS seqret which can do FASTQ to FASTA, but I don't think it supports the QUAL format.

    Leave a comment:


  • maasha
    replied
    This can be done with Biopieces (www.biopieces.org):

    Code:
    read_fastq -i test.fq | write_454 -o test.fna -q test.fna.qual -x

    Cheers,


    Martin

    Leave a comment:


  • ewilbanks
    started a topic Split fastq to fasta and qual file?

    Split fastq to fasta and qual file?

    Hi all,

    Does anyone have or know about good scripts to split a sanger format fastq file into the corresponding fasta and qual file?? I have a dataset that I'd like to quality trim with LUCY but I can't figure out how to get it split apart! I've tried using the app on the galaxy page -- but its producing weird errors that I don't understand. Any help much appreciated!!

    -Lizzy

Latest Articles

Collapse

  • seqadmin
    How RNA-Seq is Transforming Cancer Studies
    by seqadmin



    Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
    09-07-2023, 11:15 PM
  • seqadmin
    Methods for Investigating the Transcriptome
    by seqadmin




    Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

    Whole Transcriptome RNA-seq
    Whole transcriptome sequencing...
    08-31-2023, 11:07 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 06:18 AM
0 responses
5 views
0 likes
Last Post seqadmin  
Started by seqadmin, Yesterday, 09:17 AM
0 responses
7 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-19-2023, 09:23 AM
0 responses
24 views
0 likes
Last Post seqadmin  
Started by seqadmin, 09-19-2023, 09:14 AM
0 responses
6 views
0 likes
Last Post seqadmin  
Working...
X