Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • earonesty
    replied
    Bowtie 0.12.7 patch for gzip support:

    Leave a comment:


  • simonandrews
    replied
    Thanks for the suggestions Justin. I hadn't realised there was a paired end format already - I might just make up a wrapper for that.

    I'm sorely tempted to just patch in gzip support to bowtie, but don't want to have the hassle of maintaining a fork in perpetuity.

    Leave a comment:


  • BAMseek
    replied
    Originally posted by simonandrews View Post
    ... but have now moved to bowtie for everything (which is also a pain as you can't run paired end gzipped data through it without uncompressing first).
    Hi Simon,

    If the pain is about unzipping the files into temporary storage first, I think there might be some ways to uncompress and feed into Bowtie on the fly.

    One way that I think works would be with named pipes like this:
    Code:
    rm -f pipe1
    rm -f pipe2
    mkfifo pipe1
    mkfifo pipe2
    gunzip -c reads/e_coli_1000_1.fq.gz > pipe1 & \
    gunzip -c reads/e_coli_1000_2.fq.gz > pipe2 & \
    ./bowtie e_coli -1 pipe1 -2 pipe2
    You could also try uncompressing and merging the paired files on-the-fly into the special tab-delimited format accepted by Bowtie, where the columns are

    read_name[tab]sequence_1[tab]quality_1[tab]sequence_2[tab]quality_2
    and pipe that into Bowtie.

    best,
    Justin

    Leave a comment:


  • cnicolet
    replied
    Thanks Simon! We did find some info on that using the --fastq-cluster-count parameter. But it's a little confusing, the manual says "Specify 0 to ensure creation of a single FASTQ file" then later it says "If you need to generate one unique fastq gzipped file for use in a third-party tool, you can set the --fastq-cluster-count option to -1" But I agree with you about Eland, I think our days of using that are over too.

    Leave a comment:


  • simonandrews
    replied
    There's supposed to be an option in 1.8.2 to generate only a single fastq.gz file which might make things a bit easier.

    I'm afraid we gave up trying to work with Eland since 1.8 due to the extra complexity of the run commands. We were perfectly happy with the performance of the old Eland, but have now moved to bowtie for everything (which is also a pain as you can't run paired end gzipped data through it without uncompressing first).

    Leave a comment:


  • cnicolet
    started a topic Casava1.8.2

    Casava1.8.2

    Hi:
    How are people feeling about CASAVA 1.8.2? We are just starting to use it and so far are underwhelmed. The new formats are cumbersome and the way it delivers multiple files is complicating everything we do downstream. We also benchmarked the new ELAND aligner and it performed exactly the same as the old aligner (which in our hands only delivers about 80% of the mapped reads as bwa), despite claims of superiority. Maybe we're not using it correctly? Any tips?

Latest Articles

Collapse

  • seqadmin
    Quality Control Essentials for Next-Generation Sequencing Workflows
    by seqadmin




    Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

    Nucleic Acid Quality Control
    Preparing for NGS starts with isolating the...
    02-10-2025, 01:58 PM
  • seqadmin
    An Introduction to the Technologies Transforming Precision Medicine
    by seqadmin


    In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
    01-27-2025, 07:46 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 02-07-2025, 09:30 AM
0 responses
66 views
0 likes
Last Post seqadmin  
Started by seqadmin, 02-05-2025, 10:34 AM
0 responses
101 views
0 likes
Last Post seqadmin  
Started by seqadmin, 02-03-2025, 09:07 AM
0 responses
81 views
0 likes
Last Post seqadmin  
Started by seqadmin, 01-31-2025, 08:31 AM
0 responses
45 views
0 likes
Last Post seqadmin  
Working...
X