Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • mstagliamonte
    Member
    • Feb 2013
    • 33

    MG-RAST - header format problem?

    Hi everyone,

    after successfully using MG-RAST with assembled data, I am trying to use it with raw fastq illumina sequences. I originally had two fastq files (forward and reverse paired ends); I merged them and uploaded the file.

    I now receive an error message:
    Warning: The unique id count does not match the sequence count. You will not be able to use this file for submission.

    Basically the unique id count is half the number of sequences.
    My reads are ordered as forward and reverse with the following format:

    @HWUSI-EAS1700R:25:FC:6:1:12466:1106 1:Y:0:TTAGGC

    and

    @HWUSI-EAS1700R:25:FC:6:1:12466:1106 2:Y:0:TTAGGC

    My guess is that I may need to modify the header. Any suggestion?

    Thanks
    Max

    - Edit: I should be able to modify the header by myself (I know a little bit of Python), but I am not sure if that is the problem and what my header should be.
    Thanks again
    Max
    Last edited by mstagliamonte; 06-18-2013, 06:12 AM. Reason: Clarify my request
  • Ciaran
    Junior Member
    • Sep 2011
    • 9

    #2
    Only the first part of the header is being used to identify the read,
    Just replace the space with a "_" or other character.


    instead of;
    @HWUSI-EAS1700R:25:FC:6:1:12466:1106 1:Y:0:TTAGGC

    have

    @HWUSI-EAS1700R:25:FC:6:1:12466:1106_1:Y:0:TTAGGC

    Something like

    Code:
    sed 'e/\ /\_/g' seqfile > seqfile_ed

    Comment

    • mstagliamonte
      Member
      • Feb 2013
      • 33

      #3
      Thanks,

      I'll try immediately and let you know.

      Regards,
      Max

      Comment

      • Ciaran
        Junior Member
        • Sep 2011
        • 9

        #4
        sorry,

        Code:
        sed 's/\ /\_/'

        Comment

        • mstagliamonte
          Member
          • Feb 2013
          • 33

          #5
          Hahaha,

          I noticed

          I was not able to fix it, so I've just started running my python script.

          Let's see how it goes

          Comment

          • mstagliamonte
            Member
            • Feb 2013
            • 33

            #6
            Hi, Ciaran,

            many thanks for your advice, it worked.

            Have a nice day
            Max

            Comment

            Latest Articles

            Collapse

            • GATTACAT
              Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by GATTACAT
              Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
              07-01-2026, 11:43 AM
            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 07-02-2026, 11:08 AM
            0 responses
            9 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-30-2026, 05:37 AM
            0 responses
            12 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-26-2026, 11:10 AM
            0 responses
            20 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            54 views
            0 reactions
            Last Post SEQadmin2  
            Working...