BBMap for BitSeq

dietmar13

Senior Member

Join Date: Mar 2010

Posts: 107
- Share
- Tweet
#1

BBMap for BitSeq

04-29-2015, 09:48 PM

Hello,

I want use BBmap for estimating transcript expression level from RNA-seq reads. Mapped will be against transcripts, not the genome.

The Manual for BitSeq uses Bowtie with following conditions:

Code:

bowtie -q -v 3 -3 0 -p 4 -a -m 100 --sam

3 mismatches allowed
no 3' trimming (and no 5' trimming) - both default
report reads only if < 100 possible mapping positions

how should the BBmap parameters look like?

Code:

bbmap.sh ambiguous=all maxsites2=100 ( secondary=TRUE sssr=0.95 maxsites=100 )

is there a parameter for allowed mismatches per read?
will "secondary=TRUE sssr=0.95 maxsites=100" be a good idea for mapping reads to the human transcriptome. sssr=0.95 means approximate how many mismatches per 75 bases reads ~ 2?

should the reads be trimmed and quality filtered before, and if yes with which parameters?

Dietmar
Tags: None
Brian Bushnell

Super Moderator

Join Date: Jan 2014

Posts: 2709
- Share
- Tweet
#2

04-30-2015, 08:40 AM

I imagine the main reason for specifying 3 mismatches with Bowtie is that 3 is the maximum it allows.

BBMap normally adjusts sensitivity with the mind/minratio parameter, though you can optionally set "subfilter=3" to ban alignments with more than 3 substitutions. I don't see how that would be beneficial to transcriptome mapping, though.

I would suggest:

"bbmap.sh (file parameters) ambig=all maxsites=100 maxindel=100"

...and if you really want, add "subfilter=3". For transcriptome mapping there's no reason to allow the default maxindel=16000. And there is not much need to apply any quality-trimming or filtering unless you add the "subfilter" flag and have low-quality data, though you can do that if you want with "qtrim=rl trimq=10" to trim both ends to Q10. I do, however, always recommend adapter-trimming, particularly when requiring high-identity alignments.
Comment

Previous template Next

Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by GATTACAT

Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
- Channel: Articles
07-01-2026, 11:43 AM
Nine Things a Sample Prep Scientist Thinks About Before Sequencing

by SEQadmin2

I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

Here are nine questions we think about, in roughly the order they matter, before...
- Channel: Articles
06-18-2026, 07:11 AM

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, Yesterday, 11:08 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 11 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 19 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 53 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Comment

Latest Articles

ad_right_rmr

News