Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
But the read IS mappable (1 mismatch only for 36 bp)! And I can see that when I increase M. I mean, with M 1, it did not map anywhere on the human genome, but it did so when I increased M (keeping all other parameters constant - N, L, i etc. And when I use -a (all) alignments option I get tens of alignments. I find that very strange.
I expect that regardless of the M value, at least one of the alignments would be reported, not necessarily the best one.Last edited by amaer; 05-24-2012, 09:28 PM.
Leave a comment:
-
It seems that it has nothing to do with M if a read is not mappable. Because in this mode, "The search terminates when it can't find more distinct valid alignments, or when it finds M+1 distinct alignments, whichever happens first." As the read is unmappable, the searching will always ends up with the former condition, no matter whatever M is.
Leave a comment:
-
the M option in Bowtie2 beta1.3
Hi,
I have a question about the use of the M parameter in Bowtie2 beta1.3. I understand it's the number of valid, distinct alignments found for each read (M+1 actually). And reports only 1, the best alignment of THOSE examined.
If a read maps to multiple places, in human genome for example, I expect that Bowtie2 to possibly have different best alignment reported for different M values.
Is it possible to have a 36 bp read NOT mapped at all (to human genome) with M =1, but mapped in other places with a higher values for M? If yes, why?
Thanks!
Leave a comment:
-
Duplicate outputs
I have some reads that were reported to map to hg19 uniquely (only one place in the genome) but they have two lines of output. The two lines are completely identical. This happened for many reads. Has anyone else had this problem?
Leave a comment:
-
Is this the right way to use bowtie to align reads from an 1000 genome sample (e.g. NA18526) to the human genome?
First, I downloaded three files from 1000genomes
ftp://ftp.1000genomes.ebi.ac.uk/vol1....filt.fastq.gz
ftp://ftp.1000genomes.ebi.ac.uk/vol1....filt.fastq.gz
ftp://ftp.1000genomes.ebi.ac.uk/vol1....filt.fastq.gz
I downloaded the hg19 index files from
ftp://ftp.cbcb.umd.edu/pub/data/bowt...g19.ebwt.1.zip
ftp://ftp.cbcb.umd.edu/pub/data/bowt...g19.ebwt.2.zip
Unzipped them and put them in the indexes subdirectory.
Then I run the following command on my dual hex-core machine with 48GB RAM:
time ./bowtie -p 12 --chunkmbs 200 -S hg19 -1 NA18526/ERR031854_1.filt.fastq -2 NA18526/ERR031854_2.filt.fastq NA18526.sam
It took about 65min to finish. Am I doing it right?
What is the use of the small ERR031854.filt.fastq at 1000genomes.org??
Thanks a lot in advance!
Leave a comment:
-
Bowtie-Output
Hey! I am running the bowtie aligner on about 1000 files and storing the results in .map files. I want to save the timing results in the output files as well. Any idea how to do this? Thanks a lot!
Leave a comment:
-
No worries Xi Wang. Thanks for your help.
I just submitted this as a bug on their sourceforge site (ID: 3496148). There's also a similar report there too so I know it's not just me having this problem! Hopefully they can fix this easily.
Regards,
Rich
Leave a comment:
-
Originally posted by rfrancis View PostThanks Xi Wang. I just gave that a go on my test data but I still get the read that matches the reference having a truncated ID. Looks like this option (--fullref) only applies to the reference sequence name and not the readID. Any chance someone could replicate this?
Regards,
Rich
Leave a comment:
-
Thanks Xi Wang. I just gave that a go on my test data but I still get the read that matches the reference having a truncated ID. Looks like this option (--fullref) only applies to the reference sequence name and not the readID. Any chance someone could replicate this?
Regards,
RichLast edited by rfrancis; 02-29-2012, 10:19 PM.
Leave a comment:
-
There is an option in Bowtie v0.12.7: --fullref, which can make the output file contain the entire names.
Originally posted by rfrancis View PostDear all,
Has anyone seen this before? I am using bowtie v0.12.7 to align reads from the short read archive which have IDs as follows:
SRR064286.51418 HWI-EAS418:1:5:1357:1070 length=50
In the resultant SAM file where bowtie finds a match, for some reason the ID is truncated to the first space:
SRR064286.51418
However when no match is found the ID is reported in full.
This seems odd, so I would appreciate someone trying to replicate this for me. Below are a couple of reads and a very short sequence to use as a reference. The first read should match but the other should not. Can someone try and align these using bowtie and let me know what you get.
Many thanks in advance.
Reads: Save as test.fq
@SRR064286.10 HWI-EAS418:1:4:1:147 length=50
TGGCTTCTTCTGTCTTCATAAGTTTTTCCAGGCGGTCTTCCAAGTCCAAA
+SRR064286.10 HWI-EAS418:1:4:1:147 length=50
BCBCCCCCCCCA8::>:?:>8!/@:1&7>6@BCBA@CACCA6>!<BB<BA
@SRR064286.11 HWI-EAS418:1:4:1:119 length=50
GGTTGTAGGACAGCATTTCAAGAACTAAACAGAGATGGTTTCGGAACATA
+SRR064286.11 HWI-EAS418:1:4:1:119 length=50
BBABA@BAABB:3707::9</!.B>:76:8;B9BAAAB>BBC<!<BCBB?
Ref: Save as ref.fa and run "bowtie-build ref.fa ref" to make a reference
>testref
ATTTCGATGCGAGCTTATTCGAGGCGTATCGTAGCGAGTGCTAGGGCTAT
TGGCTTCTTCTGTCTTCATAAGTTTTTCCAGGCGGTCTTCCAAGTCCAAA
GCGGATTGCTGATGCGAGCGTAGTCGTAGTGTGCGTATTGCGATTCGATG
Run bowtie with "bowtie --sam ref test.fq test.sam" and check out the SAM file test.sam.
Thanks for your help
Rich
Leave a comment:
-
Bowtie truncates ID line if it has spaces
Dear all,
Has anyone seen this before? I am using bowtie v0.12.7 to align reads from the short read archive which have IDs as follows:
SRR064286.51418 HWI-EAS418:1:5:1357:1070 length=50
In the resultant SAM file where bowtie finds a match, for some reason the ID is truncated to the first space:
SRR064286.51418
However when no match is found the ID is reported in full.
This seems odd, so I would appreciate someone trying to replicate this for me. Below are a couple of reads and a very short sequence to use as a reference. The first read should match but the other should not. Can someone try and align these using bowtie and let me know what you get.
Many thanks in advance.
Reads: Save as test.fq
@SRR064286.10 HWI-EAS418:1:4:1:147 length=50
TGGCTTCTTCTGTCTTCATAAGTTTTTCCAGGCGGTCTTCCAAGTCCAAA
+SRR064286.10 HWI-EAS418:1:4:1:147 length=50
BCBCCCCCCCCA8::>:?:>8!/@:1&7>6@BCBA@CACCA6>!<BB<BA
@SRR064286.11 HWI-EAS418:1:4:1:119 length=50
GGTTGTAGGACAGCATTTCAAGAACTAAACAGAGATGGTTTCGGAACATA
+SRR064286.11 HWI-EAS418:1:4:1:119 length=50
BBABA@BAABB:3707::9</!.B>:76:8;B9BAAAB>BBC<!<BCBB?
Ref: Save as ref.fa and run "bowtie-build ref.fa ref" to make a reference
>testref
ATTTCGATGCGAGCTTATTCGAGGCGTATCGTAGCGAGTGCTAGGGCTAT
TGGCTTCTTCTGTCTTCATAAGTTTTTCCAGGCGGTCTTCCAAGTCCAAA
GCGGATTGCTGATGCGAGCGTAGTCGTAGTGTGCGTATTGCGATTCGATG
Run bowtie with "bowtie --sam ref test.fq test.sam" and check out the SAM file test.sam.
Thanks for your help
RichLast edited by rfrancis; 02-29-2012, 08:57 PM.
Leave a comment:
-
Originally posted by Lien View PostApparently, it is normal that some reads are skipped because they can't align. Just hope the percentage that is skipped isn't too high!
The search tree is exceeding the default available memory.
Leave a comment:
-
Originally posted by biznatch View Postmediator, are you using Bowtie to align RNA-Seq data? You should use Tophat for RNA-Seq data, as Bowtie can't deal with splice sites, which would be why you're getting a low alignment percentage.
Leave a comment:
-
mediator, are you using Bowtie to align RNA-Seq data? You should use Tophat for RNA-Seq data, as Bowtie can't deal with splice sites, which would be why you're getting a low alignment percentage.
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...-
Channel: Articles
11-06-2024, 07:24 PM -
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:09 AM
|
0 responses
23 views
0 likes
|
Last Post
by seqadmin
Today, 11:09 AM
|
||
Started by seqadmin, Today, 06:13 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
Today, 06:13 AM
|
||
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
30 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
21 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|
Leave a comment: