Seqanswers Leaderboard Ad

**GenoMax** · 09-16-2021, 12:40 PM

You should be able to pass fastq to all BBtools. Use in=stdin.fq when you are doing that.

**Pedro Olivares** · 09-17-2021, 12:28 AM

Great this is exactly what I needed. I will test and update since now the bottle neck will be to efficiently sub-set a large collection of reads. Some tabix should help. Let's see ...

Thanks again and sorry if there were duplicated posts. I am not sure how the forum works. All previous attempts to post (and even this thread) never returned any sort of notification of their status.

I will update a final solution for the record.

**Pedro Olivares** · 09-17-2021, 03:08 AM

I think I might be hitting into some sort of bug.

When I input a fastq using its path as an argument, the whole thing runs fine but when `cat` this same file and pipe it to `tadpole.sh` using the in=stdin.fq (or stdin.fastq) things seem to run fine up to one point but then the output is empty.

Here an example of a working case

Here an example of the non-working case

Perhaps I need to go about this in a different way that I am missing?

This problem is present in a couple of setups I have access to:
- For a guix supported version I am using BBMap version 38.90 (examples are from this one)
- From a conda instance BBMap version 37.62

Thanks for the help.

**GenoMax** · 09-17-2021, 10:58 AM

I may have bad news. All bbmap tools are supposed to be able to accept input from STDIN but it appears that "tadpole.sh" may be an exception. This is something Brian (author of BBMap may know the answer to). Brian no longer participates in forums so you could try emailing him directly and see if he responds.

Something like

Code:

zcat file.fq.gz | reformat.sh -Xmx4g in=stdin.fq out=stdout.fa

does work.

**Pedro Olivares** · 09-17-2021, 10:28 PM

Oops, let's see what I can do. Thanks for the information, though. Any hint on how can I find his email address? So far none of the obvious places worked and I haven't heard back from him on Twitter.

My hope is that there should be an easy fix since it doesn't look like it doesn't work at all, the reads are actually loaded and processed but somehow downstream the analysis the are missed.

Also, I just found out that when actually building contigs, inputs from stdin work perfectly fine! The problem only manifests when the option mode=correct is set.

I will try to dig into the java code but it's really far from my comfort zone.

Thanks again.

**GenoMax** · 09-18-2021, 06:00 AM

Brian's email address is in the inline help for bbmap programs. Just run `bbmap.sh` and look through the help.

The problem only manifests when the option mode=correct is set.

It may be by design then. Error correction requires keeping large amount of sequence in memory. You could try assigning a large amount of RAM for -Xmx option and see if that works.

Other option is named pipes/FIFO etc but depends on how much effort you are willing to invest.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Run Tadpole (BBtools) only on sub-sets of reads from input files

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News