Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • yifangt
    replied
    I had the exactly same problem! It seems the problem is more of -Q33 option as I tried it.
    Two things here:

    1) which is the wrong quality score, i.e. ASCII char when you have the -Q33 on?

    2) our sequence machine was run with the MS-Windows platform and the data was deposited on Linux storage disk. There might be a issue with this cross platform situation, but how to solve it?

    Thanks a lot!

    Leave a comment:


  • TonyBrooks
    replied
    There should be no space between the Q and the 33.

    You also need to specify how you want to trim by setting variables for -f and -l e.g.

    fastx_trimmer -Q33 -f 1 -l 50 -i input.fastq -o output.fastq

    AFAIK Fastq's also need to be uncompressed for input but can be compressed on output using the -z flag.

    Use fastx_trimmer -h for help

    Leave a comment:


  • JackieBadger
    replied
    Just use TRIMMOMATIC.
    Works better with more options

    Leave a comment:


  • dGho
    replied
    same problem

    I am using fastq_quality_ trimmer for the first time and I am having this same problem also on Illumina generated data. Does anyone know what the solution to this is?

    Leave a comment:


  • shuang
    replied
    I use Ubuntu. The in.txt is a fastq file, which was downloaded directly from the sequencing facility website.

    17kb is the expected fully assembled length.

    Leave a comment:


  • GenoMax
    replied
    shuang: Can you tell us what OS you are using to run the fastx_trimmer?

    Did you open/edit the sequence file ("in.txt") in a non-unix OS (e.g. windows/OS X)?

    What do you mean by sample length is 17kb?

    Leave a comment:


  • shuang
    replied
    When I use -Q 33, it didn't go much further, neither. Below is the command and the error message.
    fastx_trimmer -i in.txt -Q 33 -o all.fastq


    fastx_trimmer: Error: invalid quality score data on line 448 (quality_tok = "??????B?D?DDDBB?FFFF;FFBB>FFHHDD?EGH>EFEAECEDHHHHHGHFE:FDGA-CEDHHDBF>AAACF=AAEBGFGF?CDDEHH+=CCCCBDAFF?DF.6BFF66...=B<*645,,5A,5,,53>?;>A,A5==*..8A?*:C"

    Leave a comment:


  • kmcarr
    replied
    Originally posted by shuang View Post
    I'm having problems in running commands of fastx tool. For example, when I ran fastx_trimmer,
    fastx_trimmer -i in.txt -o out.fastq

    I received this error
    fastx_trimmer: Invalid quality score value (char '#' ord 35 quality value -29) on line 4

    Even I tried to add "-Q 64", still the same error.
    fastx_trimmer -i in.txt -Q 64 -o out.fastq


    Why is that??
    Illumina stopped using the Phred+64 offset sometime ago; they now use Phred+33. The fastx toolkit assumes Phred+64 by default so your two examples were exactly the same. As GenoMax said you need to use -Q33.

    Leave a comment:


  • shuang
    replied
    Sorry, I forgot to mention my sample is Illumina. The sample length is 17kb.

    Leave a comment:


  • GenoMax
    replied
    If your data is in sanger format then -Q 33 would be the option to add.

    Leave a comment:


  • shuang
    started a topic fastq_quality_trimmer error

    fastq_quality_trimmer error

    I'm having problems in running commands of fastx tool. For example, when I ran fastx_trimmer,
    fastx_trimmer -i in.txt -o out.fastq

    I received this error
    fastx_trimmer: Invalid quality score value (char '#' ord 35 quality value -29) on line 4

    Even I tried to add "-Q 64", still the same error.
    fastx_trimmer -i in.txt -Q 64 -o out.fastq


    Why is that??

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 11:49 AM
0 responses
15 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-24-2024, 08:47 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
61 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Working...
X