Seqanswers Leaderboard Ad

**swaraj** · 04-03-2012, 04:05 AM

I would ask you to include the genome in your tophat run

so you new command will be

"tophat -p 24 -G genes.gtf /path/to/genome -o K562_1 hg19 reads.fastq"

The genome file name would be the common prefix for files you generate using the genome fasta file and bowtie

"bowtie-build genome.fa genome"

**Annibal** · 04-03-2012, 04:50 AM

I've included it.
As mentioned above it is "hg19 ". This are the .ebwt index and bowtie correctly build the hg19.fa reference file.

I've tried the same procedure but using paired end fastq reads (2x76) from different Rna-seq (wgEncodeCshlLongRnaSeqK562CellLongnonpolyaFastqRd1Rep1.fastq.gz and wgEncodeCshlLongRnaSeqK562CellLongnonpolyaFastqRd2Rep1.fastq.gz ) and it worked.
Maybe the problem is the format of the single end reads data of 152 nt?

Thanx

**Julien Roux** · 04-06-2012, 06:35 AM

Originally posted by Annibal View Post

Maybe the problem is the format of the single end reads data of 152 nt?

Probably this is part of the problem since by default Tophat only allows 2 mismatches on the whole read. I had a similar problem when analyzing reads of 107bp. Switching to --bowtie-n mode might help since the mismatches are counted only in the seed region (28 first bp). But still, I found no way to increase the parameter "-e" of Bowtie from Tophat command line, and I suspect it might be too restrictive for long reads.
If you find a way to improve the alignment, please please keep us informed!

**anurag.gautam** · 04-09-2012, 04:58 AM

can anybody help with this error
I tried to map Illumina paired-end RNA seq reads of Rice to reference genome.
I ran the tophat with the following command:

/opt/tophat-1.4.1.Linux_x86_64/tophat -p 4 -o output -G osa.gtf /home/anurag.gautam/03_Genomes/Oryza_sativa_Indica/bowtie/osa SRR037735_1.fastq SRR037735_2.fastq
[Mon Apr 9 18:11:39 2012] Beginning TopHat run (v1.4.1)
-----------------------------------------------
[Mon Apr 9 18:11:39 2012] Preparing output location output/
[Mon Apr 9 18:11:39 2012] Checking for Bowtie index files
[Mon Apr 9 18:11:39 2012] Checking for reference FASTA file
[Mon Apr 9 18:11:39 2012] Checking for Bowtie
Bowtie version: 0.12.7.0
[Mon Apr 9 18:11:39 2012] Checking for Samtools
Samtools Version: 0.1.16
[Mon Apr 9 18:11:39 2012] Generating SAM header for /home/anurag.gautam/03_Genomes/Oryza_sativa_Indica/bowtie/osa
format: fastq
quality scale: phred33 (default)
[Mon Apr 9 18:11:39 2012] Reading known junctions from GTF file
Warning: TopHat did not find any junctions in GTF file
[Mon Apr 9 18:11:39 2012] Preparing reads
left reads: min. length=75, count=9884891
right reads: min. length=75, count=9873028
[Mon Apr 9 18:18:36 2012] Creating transcriptome data files..
[FAILED]
Error: gtf_to_fasta returned an error.

Please help with this error.

**Annibal** · 04-19-2012, 01:40 AM

Originally posted by Julien Roux View Post

Probably this is part of the problem since by default Tophat only allows 2 mismatches on the whole read. I had a similar problem when analyzing reads of 107bp. Switching to --bowtie-n mode might help since the mismatches are counted only in the seed region (28 first bp). But still, I found no way to increase the parameter "-e" of Bowtie from Tophat command line, and I suspect it might be too restrictive for long reads.
If you find a way to improve the alignment, please please keep us informed!

Don't know if it works, i haven't reviewed the code but i suppose you can trick the program editing the tophat file since it only calls the bowtie exec...
At line 717:
if option == "--bowtie-n":
self.bowtie_alignment_option = "-n"

Just replace the "-n" with "-e your_value -n" and when you run tophat with --bowtie-n it will invoke bowtie with -e yourvalue -n

**yingzhang** · 05-18-2012, 12:34 PM

I will first check whether the gtf file is in the right format. Then I will check whether I have allocated enough memory for TopHat. My job once got killed at the exact step because it used much more memory than I specified.

Originally posted by anurag.gautam View Post

can anybody help with this error
I tried to map Illumina paired-end RNA seq reads of Rice to reference genome.
I ran the tophat with the following command:

/opt/tophat-1.4.1.Linux_x86_64/tophat -p 4 -o output -G osa.gtf /home/anurag.gautam/03_Genomes/Oryza_sativa_Indica/bowtie/osa SRR037735_1.fastq SRR037735_2.fastq
[Mon Apr 9 18:11:39 2012] Beginning TopHat run (v1.4.1)
-----------------------------------------------
[Mon Apr 9 18:11:39 2012] Preparing output location output/
[Mon Apr 9 18:11:39 2012] Checking for Bowtie index files
[Mon Apr 9 18:11:39 2012] Checking for reference FASTA file
[Mon Apr 9 18:11:39 2012] Checking for Bowtie
Bowtie version: 0.12.7.0
[Mon Apr 9 18:11:39 2012] Checking for Samtools
Samtools Version: 0.1.16
[Mon Apr 9 18:11:39 2012] Generating SAM header for /home/anurag.gautam/03_Genomes/Oryza_sativa_Indica/bowtie/osa
format: fastq
quality scale: phred33 (default)
[Mon Apr 9 18:11:39 2012] Reading known junctions from GTF file
Warning: TopHat did not find any junctions in GTF file
[Mon Apr 9 18:11:39 2012] Preparing reads
left reads: min. length=75, count=9884891
right reads: min. length=75, count=9873028
[Mon Apr 9 18:18:36 2012] Creating transcriptome data files..
[FAILED]
Error: gtf_to_fasta returned an error.

Please help with this error.

Topics	Statistics	Last Post
Enhanced Neoantigen Detection: Introducing NeoHunter by seqadmin Started by seqadmin, Today, 07:17 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 07:17 AM
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, 05-02-2024, 08:06 AM	0 responses 19 views 0 likes	Last Post by seqadmin 05-02-2024, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 20 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM

Seqanswers Leaderboard Ad

Announcement

Tophat problem: failing reads alignment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News