Seqanswers Leaderboard Ad

**maubp** · 05-29-2012, 11:47 AM

If you can't align the sequences because they are too different, you shouldn't make a tree out of them.

**Artem** · 05-29-2012, 12:24 PM

To construct a tree you want the sequences to have homology, a common evolutionary origin. A good introduction to bioinformatics and trees can be found at. It's targeted at biology students so it's more straightforward to understand than most bioinformatics texts.

http://helix.biology.mcmaster.ca/courses.html

As to your experiment, by using a primer pair you don't only amplify the region you are interested in, you will also amplify any other sequence that also happens to match that primer pair and can arise due to chance (remember the genome is not uniform, some sequences are more common then others).

If you amplify a region in many species, in some you may be amplifying one locus, and in others you can amplify a completely different one.

AB cdefghi JK where AB, JK is your primer pair and ABCDEFGHIJK is the locus you are interested in. In some species they can have AB q835%9 JK, a sequence completely unrelated in evolutionary terms and therefore you shouldn't be building a tree to compare them.

Hope that helps.

**Mark** · 05-29-2012, 01:14 PM

What is the purpose of this work (other than the desire to draw a tree)?

**mike.t** · 05-29-2012, 09:27 PM

try to reverse complement the sequences that don't align with the others and see if they'll align.

**akbowser** · 06-21-2012, 03:58 PM

Thanks for the replies so far.

The purpose of my work is to identify species within a mixed (and unknown composition) sample. The problem is that there is no complete reference database for me to use to identify all of my sequences. I figured a tree was my best bet at assigning some type of taxonomic identity to my unknown sequences, but now I'm seeing that some people use operational taxonomic units (OTU) with this type of work. I started looking into programs that deal with OTUs but I am already extremely intimidated by the basic programming skills required to run such programs. I don't know where to begin! Please help!

**Wurstmensch** · 06-22-2012, 03:57 AM

You could try a metagenomic program like MEGAN (http://ab.inf.uni-tuebingen.de/software/megan/). In my opinion they are easy to start, you only have to blast your reads versus a sufficient database and just import them to the program. But beware that blasting a bunch os sequences could last a lot of time. In addition to this some formats need a lot of disk space, so choosing the right ones in the start could safe you a lot of time.

**Mark** · 06-22-2012, 04:45 AM

Yes, MEGAN is a useful tool for this. When you say you have a bunch of sequence do you mean 100s, 1000s, 1000000s ? Note when using MEGAN one should generally interpret the output as "these sequences are most similar to sequences in taxon X" not "these sequences are from taxon X". This is particualarly true the nearer to species level you go (MEGAN can make taxonomic assignments at multiple levels).

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, 07-25-2024, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin 07-25-2024, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

Newbie... need help with the basics

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News