Unconfigured Ad

**sarvidsson** · 03-02-2015, 01:54 AM

Sorry, overlooked that you were talking about transcriptomes. For transcontigs, standalone InParanoid is worth a try - limited to protein coding transcontigs, however.

**mruhsam** · 03-02-2015, 02:00 AM

Mauve is a good program to do the alignment but at the moment I don't have the input file to use Mauve, i. e. I don't have a file with all the contigs which occur in each of the 15 species. I don't know how I can pull out those contigs to create an input file in the first place.

**sarvidsson** · 03-02-2015, 02:02 AM

See the edited post

**mruhsam** · 03-02-2015, 02:13 AM

I actually had a look at InParanoid but got the impression that my data are compared to the data in their database. Or did I miss something there, i. e. can I use InParanoid to create pairwise comparisons between my data sets?

**sarvidsson** · 03-02-2015, 03:40 AM

Originally posted by mruhsam View Post

I actually had a look at InParanoid but got the impression that my data are compared to the data in their database. Or did I miss something there, i. e. can I use InParanoid to create pairwise comparisons between my data sets?

You can make your own pairwise comparisons - but you need the standalone version (bottom of the page).

**GenoMax** · 03-02-2015, 04:41 AM

@mruhsam: Can you give us an idea of some numbers? How many contigs per species (or are you referring to transcript models assembled as "contigs", so thousands)? What is the size range on those contigs?

**mruhsam** · 03-02-2015, 04:43 AM

There are well over 100.000 contigs for each species varying in length from 200 bp to about 20.000 bp.

**GenoMax** · 03-02-2015, 04:53 AM

You may want to start with the largest of the lot (assuming they were assembled reasonably correctly) say 10K bp or more. If the species are very similar then you could use stringent search criteria. It sounds like depending on what you find you are going to need to some custom parsing of the results.

If you are able to convert these into protein sequences then OrthoMCL (like InParanoid mentioned above) would come in useful: http://www.orthomcl.org/orthomcl/

Topics	Statistics	Last Post
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 12 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 14 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 54 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM

Unconfigured Ad

Pulling out homologous sequences from different transcriptomes

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News