Seqanswers Leaderboard Ad

**gringer** · 02-09-2012, 09:28 AM

Can we assume that the beginning of the transcript would be the beginning of the transcription start site for the gene?

This is unlikely. Trinity assembles the whole RNA, including untranslated regions. If you want to look at protein predictions de-novo, then you need something like frameDP to correct for frame shift errors and identify the most likely start/stop sites for translation.

**Simon Anders** · 02-09-2012, 12:35 PM

Are you sure you were in the right reading frame when checking for stop codons? Look for start codons to fix the reading frame.

Can we assume that the beginning of the transcript would be the beginning of the transcription start site for the gene?

I would say, possibly yes, if you meant to say transcription start site. But gringer might have been right in assuming that you meant the translation start.

**gringer** · 02-09-2012, 03:59 PM

Originally posted by Simon Anders View Post

I would say, possibly yes, if you meant to say transcription start site. But gringer might have been right in assuming that you meant the translation start.

Whoops, yes, I read that wrong. Just to add to this, depending on the mapping quality the start point may not necessarily be exactly at the transcript start location.

**StopCodon** · 02-10-2012, 03:48 AM

Thanks. That clarifies some concepts. Yes, I had meant the Translation start site instead of transcription. Now my data makes more sense as well since very few of the assembled transcripts were starting with a start codon. Is FrameDP a good choice for finding peptide sequences? I saw it had only 10 citations to date. Are there any other tools for doing this?

Thanks.

**gringer** · 02-10-2012, 03:52 AM

Is FrameDP a good choice for finding peptide sequences? I saw it had only 10 citations to date. Are there any other tools for doing this?

Have a look here for other options:

Any pipeline to find automatically ORF in consensus sequences? - SEQanswers

http://seqanswers.com/forums/showthread.php?t=4036

Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

FrameDP generates huge amounts of files, something like 5-10 times the number of transcripts, so you need to be careful with that. My computer claimed to have run out of space (with ~300GB free) due to me not being careful enough running that program.

**GillermoPonz** · 07-08-2015, 02:58 AM

I'm having the same problem just now. When I translate the assembled transcripts (in the 6 pssible reading frames), they are full of stop codons and I don't know what to do. I've been thinking about not normalizing the libraries when assembling in Trinity but I don't know if it make sense...

Did you finally solve the problem?

Thank you

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 24 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Transcripts from RNA-seq assembly

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News