Seqanswers Leaderboard Ad

**volks** · 06-24-2011, 02:39 AM

80bp are not especially long, i would say.
which instrument was used to generate the data?
did you run a quality check, e.g. FastQC?

**kenosaki** · 06-24-2011, 03:12 AM

Thanks volks,
This is an Illumina GAIIx data.
I just want to know if i need to add or change some parameters based on the read length.
Do I use the same command-line for 80 bp reads mapping as I used to use for 35 bp reads? I feel i need to change some parameters based on the read length.
May be I'm wrong.

**DZhang** · 06-25-2011, 07:21 AM

Hi kenosaki,

No need to adjust bowtie parameters for read length.

Douglas

https://www.contigexpress.com

**rorymulv** · 07-01-2011, 12:11 PM

I've found a big improvement from trimming the reads, for aligning some of the longer Illumina reads. Here's a function in zsh that I've used for clipping everything from the right end of a read below a certain read quality.

[ord.awk

# ord.awk --- do ord and chr
# taken from the gawk texinfo manual
# therefore, this may be covered by the GNU Free Documentation License
# the GFDL still allows commercial redistribution, however

# Global identifiers:
# _ord_: numerical values indexed by characters
# _ord_init: function to initialize _ord_
BEGIN { _ord_init() }

function _ord_init( low, high, i, t)
{
low = sprintf("%c", 7) # BEL is ascii 7
if (low == "\a") { # regular ascii
low = 0
high = 127
} else if (sprintf("%c", 128 + 7) == "\a") {
# ascii, mark parity
low = 128
high = 255
} else { # ebcdic(!)
low = 0
high = 255
}

for (i = low; i <= high; i++) {
t = sprintf("%c", i)
_ord_[t] = i
}
}

function ord(str, c)
{
# only first character is of interest
c = substr(str, 1, 1)
return _ord_[c]
}

function chr(c)
{
# force c to be numeric by adding 0
return sprintf("%c", c + 0)
}

#### test code ####
# BEGIN \
# {
# for (;

{
# printf("enter a character: ")
# if (getline var <= 0)
# break
# printf("ord(%s) = %d\n", var, ord(var))
# }
# }

]//

[trimReadsRaw
#! /usr/bin/zsh

# the 'raw' version of this this doesn't subtract 33 from the (raw)
# qualities
# also, it only trims the back end part of the read (not the front)
# cuts off everything from the end less than $1
function trimReadsRaw() {
thresh=$1
awk -f ord.awk \
--source '{name=$0; getline; read=$0;
getline; strand=$0; getline; qual=$0; len=length(qual); start=len;
start=1; minEnd=start+20; end=0;
for (i=len; i>=minEnd; i--) {
if (ord(substr(qual,i,1)) >= '$thresh') { end=i; break; }
}
if ( (end-start) < 20 ) { next; }
print name; print substr(read,start,end-start+1); print strand;
print substr(qual,start,end-start+1);
}' --
}

]//trimReadsRaw

**colindaven** · 07-03-2011, 02:24 AM

If you have genomic data I would use another aligner because bowtie can't deal with indels. Bwa is good for example, as it novoalign,

For transcriptome data you could try adjusting the following settings.

-n/--seedmms <int> max mismatches in seed (can be 0-3, default: -n 2)
-e/--maqerr <int> max sum of mismatch quals across alignment for -n (def: 70)
-l/--seedlen <int> seed length for -n (default: 28)

eg -n 3 -e 100 -l 40

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 12 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

bowtie 80 mer-long mapping

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News