Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie1's --12 option in Bowtie2

    Hi all,

    I would like to ask whether Bowtie2 can accept the .fastq in "1-read-per-line format (Tab-delimited format)"? (--12 option in Bowtie1)

    Thanks for the reply.

  • #2
    [FWIW, if it's in a single line, then it's not a fastq formatted file]

    I can't see any '--12' option in the help output of bowtie (could you point to the bowtie manual where it talks about this option?), but do notice that both bowtie and bowtie 2 support the 'raw' single-line file input type, and bowtie2 also accepts Illumina's qseq format:
    Code:
    Bowtie(v1):
    Input:
      -q                 query input files are FASTQ .fq/.fastq (default)
      -f                 query input files are (multi-)FASTA .fa/.mfa
    [b]  -r                 query input files are raw one-sequence-per-line[/b]
    Bowtie2:
     Input:
      -q                 query input files are FASTQ .fq/.fastq (default)
    [b]  --qseq             query input files are in Illumina's qseq format[/b]
      -f                 query input files are (multi-)FASTA .fa/.mfa
    [b]  -r                 query input files are raw one-sequence-per-line[/b]
    The raw format mentioned in that help doesn't seem to be what you want. The bowtie manual states "one sequence per line, without quality values or names. All quality values are assumed to be 40 on the Phred quality scale." No tabs to be found.

    Comment


    • #3
      Thanks gringer!

      Sorry for my unclear description.
      --12 is not an option. As mentioned in the Bowtie1 manual:


      It is one of the choices for read file input.

      Usage:
      bowtie [options]* <ebwt> {-1 <m1> -2 <m2> | --12 <r> | <s>} [<hit>]

      <r>
      Comma-separated list of files containing a mix of unpaired and paired-end reads in Tab-delimited format. Tab-delimited format is a 1-read-per-line format where unpaired reads consist of a read name, sequence and quality string each separated by tabs. A paired-end read consists of a read name, sequnce of the #1 mate, quality values of the #1 mate, sequence of the #2 mate, and quality values of the #2 mate separated by tabs.

      Comment


      • #4
        Huh, not sure how I missed that. I'm guessing based on a search through the bowtie2 manual for the word 'tab' that this option doesn't exist in bowtie2.

        FWIW, you should be able to convert a single-end tab-separated file to fastq with a quick awk script:
        Code:
        $ cat test.seq 
        Read1   ACATCAGGT       AFFABABAA
        Read2   ATTACAGAA       DEADBEEFA
        $ awk -F '\t' '{print "@" $1 "\n" $2 "\n+\n" $3}' test.seq
        @Read1
        ACATCAGGT
        +
        AFFABABAA
        @Read2
        ATTACAGAA
        +
        DEADBEEFA
        $ awk -F '\t' '{print "@" $1 "\n" $2 "\n+\n" $3}' test.seq > test.fastq
        Mixed reads would be a bit more complicated, and probably better output to separate files, so I'd degenerate to writing a Perl (or Python) script to do that because I'm not so comfortable with awk.
        Last edited by gringer; 10-20-2013, 08:01 PM.

        Comment


        • #5
          Thanks for gringer's information and advices.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Quality Control Essentials for Next-Generation Sequencing Workflows
            by seqadmin




            Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

            Nucleic Acid Quality Control
            Preparing for NGS starts with isolating the...
            02-10-2025, 01:58 PM
          • seqadmin
            An Introduction to the Technologies Transforming Precision Medicine
            by seqadmin


            In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
            01-27-2025, 07:46 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 02-07-2025, 09:30 AM
          0 responses
          72 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 02-05-2025, 10:34 AM
          0 responses
          113 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 02-03-2025, 09:07 AM
          0 responses
          90 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 01-31-2025, 08:31 AM
          0 responses
          49 views
          0 likes
          Last Post seqadmin  
          Working...
          X