Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Lv Ray
    Member
    • Jun 2014
    • 42

    a problem about SHRiMP2 solving AB SOLID

    Hi, everyone. I want to do the mapping about AB SOLID data. They are paired-end sequencing. The format of .fastq files show below:
    pair1:
    @SRR586064.1ugc_357_358_MatePair_2x50bp_solid0032_20100528_MP_ugc_357_854_13_97/1
    T30..01121.12.1032100213131122200031222022101302313
    +
    !AB!!?:>@<!@B!;AB?AA@@<2<@?@@<?AB>A?9?:;@@>;-=>>7=@
    pair2:
    @SRR586064.1ugc_357_358_MatePair_2x50bp_solid0032_20100528_MP_ugc_357_854_13_97/2
    G10330000122222033220201000000220002000000000000000
    +
    !@BBABBBB@>?@@(.))35.%-.3((%1+%((82-'.*-*/3*14*'696
    I try use the tool:SHRiMP2, but it is not friendly to solve my datasets. I run the command:
    gmapper-cs -1 pair1.fastq -2 pair2.fastq -L ./test/Sscro --pair-mode opp-in -Q --trim-first --qv-offset 33 >pair12.sam 2>pair12.log
    And I got the info in the end of file .log:
    note: detected fastq format in input file [pair1.fastq]
    - Processing read files [pair1.fastq , pair2.fastq]
    note: quality value format not set explicitly; using PHRED+33
    done r/hr r/core-hr
    There has been a problem reading in the read "SRR586064.1", the quality length exceeds the sequence length!
    Are you using the right executable? gmapper-cs for color space? and gmapper-ls for letter space?


    I have tried some parameters of SHRiMP2, but failed. And I can't find any similar example from SHRiMP2 website. Does anybody get the same problem and have you solve it ? Or maybe I should use another tool, any suggestion?
  • Brian Bushnell
    Super Moderator
    • Jan 2014
    • 2709

    #2
    Looks like the problem is that there are more qualities than colors. There are 50 colors (the leading 'T' or 'G' are fixed) and 51 qualities, but the first quality (!) is a placeholder. I can't remember if that's normal but I haven't worked with Solid in a while. It should be fine if you eliminate the first quality value for every quality line, and leave everything else unchanged.

    Comment

    • Lv Ray
      Member
      • Jun 2014
      • 42

      #3
      Originally posted by Brian Bushnell View Post
      Looks like the problem is that there are more qualities than colors. There are 50 colors (the leading 'T' or 'G' are fixed) and 51 qualities, but the first quality (!) is a placeholder. I can't remember if that's normal but I haven't worked with Solid in a while. It should be fine if you eliminate the first quality value for every quality line, and leave everything else unchanged.
      Thanks, Brian Bushnell. I will try your suggestion.

      Comment

      • Lv Ray
        Member
        • Jun 2014
        • 42

        #4
        Originally posted by Brian Bushnell View Post
        Looks like the problem is that there are more qualities than colors. There are 50 colors (the leading 'T' or 'G' are fixed) and 51 qualities, but the first quality (!) is a placeholder. I can't remember if that's normal but I haven't worked with Solid in a while. It should be fine if you eliminate the first quality value for every quality line, and leave everything else unchanged.
        Brian Bushnell, Thank you for your suggestion, and I have got the main point to solve it. Maybe now the core problem is to try some parameters of SHRiMP2.

        Comment

        Latest Articles

        Collapse

        • GATTACAT
          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by GATTACAT
          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
          07-01-2026, 11:43 AM
        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 07-02-2026, 11:08 AM
        0 responses
        9 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-30-2026, 05:37 AM
        0 responses
        13 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        20 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        54 views
        0 reactions
        Last Post SEQadmin2  
        Working...