Originally posted by david2
View Post
Unconfigured Ad
Collapse
X
-
n's in data
Hi Ben,
I have a couple of questions regarding how bowtie handles the data. I have a reference genome and a set of reads. My ref sequence has n's and a lot of reads also have n's in between.
How does bowtie handle these n's
1) while building ref genome index?
2) for reads?
Elsewhere in the same thread, I found you sayin that "When Bowtie indexes the reference, it elides non-A/C/G/T characters. So if you index a reference with stretches of Ns, Bowtie will never report an alignment spanning any of the stretches."
Does this mean you chuck n's while building the index? if yes, don't these n's have positional information? are these n's of any importance?
Thanks
Nash
Comment
-
-
Reverse complement
I am working on RNA-Seq data and I aligned the reads to a reference genome with bowtie.
I got the output files in SAM format and now I wonder how can I know for each read if it aligns directly or whether it is the reverse complement that aligns to the reference genome in the SAM format file.
I would really appreciate any help.
Best Regards
alvin
Comment
-
-
Hi All,
I am a new user of NGS softwares. Thanks a lot for providing such a list. Are there any updates????????????
with so many tools (e.g. in the ALign/assemble field), do anyone has any idea on which tool are the best for use (in terms of ease of usage and computation time required)?????
Thanks a lot.
jo
Comment
-
-
Generally, there is not a tool simply better than the others. It depends on what your scientific questions are, what kind of data you have, what the purpose is to analyze the data. For example, Bowtie is not suitable to deal with RNA-seq data.Originally posted by jyoshna.jo View PostHi All,
I am a new user of NGS softwares. Thanks a lot for providing such a list. Are there any updates????????????
with so many tools (e.g. in the ALign/assemble field), do anyone has any idea on which tool are the best for use (in terms of ease of usage and computation time required)?????
Thanks a lot.
jo
Comment
-
-
hg19 and allocation issues
I am new to bowtie and I am having a couple problems. First, I downloaded the hg19 ebwt files and attempted to transfer them to the server where I will be running bowtie but received errors for 5 of the 6 files. Despite the errors the file names still appeared on the server and to check if they were functional I tried a trial run:
./bowtie -c -t hg19 CTGAGCTTGACGCTTTGCTAATATNGTAAGAAGAGAAACTATTAATTATGGCTTTCTAAAATTGAATATCCTTGTACACA
this was the response:
Out of memory allocating plen[] in Ebwt::read() at ebwt.h:3153
Overall time: 00:00:00
What can I do?
Thanks, Greg
Comment
-
-
Hi XiWang,Originally posted by Xi Wang View PostGenerally, there is not a tool simply better than the others. It depends on what your scientific questions are, what kind of data you have, what the purpose is to analyze the data. For example, Bowtie is not suitable to deal with RNA-seq data.
Could you please tell me why is bowtie not suitable for RNA seq? Especially since Bowtie is utilised by tophat software.
Thanks, Pete
Comment
-
-
I think it's because Bowtie doesn't recognize splice junctions. Ie. when you sequence your RNA is often aligns across introns so there is a large gap in the alignment. Tophat uses the Bowtie alignment algorithm but can align across splice junctions. ...or something like that.Originally posted by tonge View PostHi XiWang,
Could you please tell me why is bowtie not suitable for RNA seq? Especially since Bowtie is utilised by tophat software.
Thanks, Pete
Comment
-
-
It is not able to handle spliced reads, as biznatch mentioned.Originally posted by tonge View PostHi XiWang,
Could you please tell me why is bowtie not suitable for RNA seq? Especially since Bowtie is utilised by tophat software.
Thanks, Pete
Just to be clear, Tophat doesn't align anything by itself. Tophat creates a junction reference, where possible splice junctions are represented as contiguous sequences in the FASTA reference, allowing bowtie to map properly to these putative junctions. Take a look at the Tophat paper for more information.Originally posted by biznatch View PostI think it's because Bowtie doesn't recognize splice junctions. Ie. when you sequence your RNA is often aligns across introns so there is a large gap in the alignment. Tophat uses the Bowtie alignment algorithm but can align across splice junctions. ...or something like that.
Comment
-
-
Nowadays, spliced read mappers first split reads into segments, then apply mappers such as Bowtie to map those segments onto the reference genome. The segments belong to a read can be mapped with a long distance between, so that the splice junctions can be detected.
What Nishomer mentioned is an old version of Tophat, when the read length is short.
Comment
-
-
question about bowtie's handling of long reads
Hey guys,
I have a question about bowtie's performance when we increase the length of the reads. Initially i used to run bowtie for reads length = 35. Now I am running the exps with reads length 51.
When I read bowtie's manual, I noticed they say bowtie's performance decreases as the read length increases. On the contrary, i am seeing its performance become better when I shifted from 35 to 51. Could you guys please tell me why? is it normal for bowtie to behave this way??
how short is short reads and how long is long reads (in terms of base pairs) ?
Comment
-
-
Repeats
Does the hg19 index mask repeats in the genome?
I have illumina data that has a large number of repeats. The sequences have been mapped using ELAND and found that ~30% had >10 matches. When using bowtie about 10% have >10 matches. What accounts for this difference? Does the hg19 index mask repeats?
Comment
-
-
Originally posted by gntc View PostDoes the hg19 index mask repeats in the genome?
I have illumina data that has a large number of repeats. The sequences have been mapped using ELAND and found that ~30% had >10 matches. When using bowtie about 10% have >10 matches. What accounts for this difference? Does the hg19 index mask repeats?
Depends whether the index was made from the masked version of hg19 or not. I'm pretty sure the pre-made index from the Bowtie website is made from the non-masked genome. Both masked and non-masked are available here:
"chromFa.tar.gz - The assembly sequence in one file per chromosome.
Repeats from RepeatMasker and Tandem Repeats Finder (with period
of 12 or less) are shown in lower case; non-repeating sequence is
shown in upper case.
chromFaMasked.tar.gz - The assembly sequence in one file per chromosome.
Repeats are masked by capital Ns; non-repeating sequence is shown in
upper case."
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
-
by SEQadmin2
Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.
Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...-
Channel: Articles
05-06-2026, 09:04 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Yesterday, 08:59 AM
|
0 responses
14 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
22 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
||
|
Started by SEQadmin2, 06-02-2026, 11:40 AM
|
0 responses
19 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 11:40 AM
|
||
|
Started by SEQadmin2, 05-28-2026, 11:40 AM
|
0 responses
32 views
0 reactions
|
Last Post
by SEQadmin2
05-28-2026, 11:40 AM
|
Comment