Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ThePresident
    Member
    • Jun 2012
    • 72

    Bowtie error

    Hello everybody,

    I'm new here, and already have a question. Lately, we have performed some bacterial RNA seq on Illumina HiSeq. I got my data and went for alignment using Bowtie2.

    Honestly, I've never touched Linux before; however, I downloaded Ubundu, and started from scratch. One week later, I was able to run test alignments and everything seemed to work perfectly. So, few hours ago, I performed full alignment of my first replicate and things didn't go that good.

    Here is my command line:

    bowtie2 -q -t -p 6 -D 20 -x IndexFile -U FastqFile -S OutputFile

    It took 2 hours to align on 8Go RAM and 6core desktop and it generated an output file of 6Gb. However, an error was associated with the alignment:

    Error: Read HWI-ST766:125:COPRMACXX:6:1306:11865:38045 1:N:0:ATNACG has more quality values than read characters!

    I tried to look for posts about this error here but found nothing. Tried to google it, same thing. Anybody knows what this error is about? Can I still trust my alignment?

    Thanks a lot people!
  • ThePresident
    Member
    • Jun 2012
    • 72

    #2
    Oh yeah, another thing: I'm not sure if I had to trim 5' or 3' ends of my reads. In fact, we performed signle reads 50pb RNA-seq and it wasn't directional (so I guess same adaptors on 5' and 3'). For the alignment, I took directly my sequence files that I obtained from our Seq Department, so I'm not sure if the trimming has been done or I have to do it? Perhaps, thats the problem...

    Comment

    • JChase
      Member
      • Jun 2012
      • 17

      #3
      I'm no expert, but I think the first thing you should do is find out to what degree your sequencing facility processes the reads before they're handed over to you. Regardless of what they do, I think it's probably a good idea to take a look at the quality scores with something like FastQC and trim bad quality or adapter contamination with something like AdaptCut before aligning.

      Now, as for your specific problem, you should be able to take a look at the fastq file that your reads are in and see what's going on with this one particular read that's giving you an error. (Look up the grep command if you're not sure how to do that.) If it's just one read causing a problem, and you can't easily see what the issue is, I would just delete that read.

      Comment

      • ThePresident
        Member
        • Jun 2012
        • 72

        #4
        JChase, thanks a lot for your answer. Indeed, I will check with my sequencing facility about their output file just to be sure how do they handle it. I've already performed a FastQC and it seems okay to me. I'll try to check for trimming my sequences.

        Finaly, I do agree that the easiest way of dealing with this problem is to delete that particular read from the Fastq file. However, the Fastq file is like 3Gb, so it takes hours before it opens with gedit and mostly it crushes before it actually finish opening. Is there a simple way to search within a file without actually opening it?

        In the other way, I tried to ignore this error and just continue with SAM to BAM and sorting, but got another error... I'll post another thread for that one.

        Thanks again!

        Comment

        • JChase
          Member
          • Jun 2012
          • 17

          #5
          I would use grep -n to get the line number (http://www.unix.com/shell-programmin...-filename.html) and then use sed to delete those linenumbers from the file (http://snipplr.com/view/6152/); be sure you delete the line that the read name is on as well as the lines associated with it, because fastq reads are not on a single line. Sed can be slow on big files, because it will go through the entire file, but it's better than trying to edit the file manually.

          Comment

          • manianslab
            Junior Member
            • Jun 2012
            • 9

            #6
            I had to delete some lines from a file once like this. I used the sed command. See this thread: http://www.unix.com/shell-programmin...xtra-line.html

            Comment

            • ThePresident
              Member
              • Jun 2012
              • 72

              #7
              Thank you so much guys. I'm working on it right now, hope it will work!

              Cheers

              Comment

              • ThePresident
                Member
                • Jun 2012
                • 72

                #8
                It worked perfectly. Commands:

                grep -n STRING_TO_SEARCH *file (search for a line)

                and after you go:

                sed -i 2,+3d *file: it will go to the second line and from there delete 3 other lines (in this exemple, it will delete line 3,4 and 5)!

                Thanks again!

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                  by SEQadmin2


                  I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                  Here are nine questions we think about, in roughly the order they matter, before...
                  06-18-2026, 07:11 AM
                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, 06-17-2026, 06:09 AM
                0 responses
                26 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-09-2026, 11:58 AM
                0 responses
                43 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-05-2026, 10:09 AM
                0 responses
                48 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-04-2026, 08:59 AM
                0 responses
                49 views
                0 reactions
                Last Post SEQadmin2  
                Working...