Unconfigured Ad

**simonandrews** · 09-08-2009, 11:28 PM

I would use this term to describe a read which mapped only once in a genome with a given number of mismatches. Hopefully the match would be a unique exact match, but if there was a single SNP then so long as there was no other place in the genome which the read could match with only one mismatch then it would still count as a uniquely mapped read.

You normally find that once you get above 2 mismatches in a 36bp read you're very unlikely to be able to map it uniquely so the majority of uniquely mapping reads will be exact matches or have just 1 or 2 mismatches.

**Patrick** · 09-08-2009, 11:51 PM

Hi simon,

Thanks a lot for your suggestion. It is very clear and easy to understand.
Thanks for helping me solved my doubts

In your opinion, the definition for the uniquely mapped reads that you explained to me just now. Is it also applied for the long base pair read, like 454,Sanger read,etc?
I got read some bioinformatics journal paper recently.
Some scientist will use the uniquely mapped read to assemble a high-quality consensus sequence of some specific organism's genome.
Do you know what is the purpose that scientist use the uniquely mapped read to assemble a high-quality consensus sequence of some specific organism's genome?
Thanks again for your help

Originally posted by simonandrews View Post

I would use this term to describe a read which mapped only once in a genome with a given number of mismatches. Hopefully the match would be a unique exact match, but if there was a single SNP then so long as there was no other place in the genome which the read could match with only one mismatch then it would still count as a uniquely mapped read.

You normally find that once you get above 2 mismatches in a 36bp read you're very unlikely to be able to map it uniquely so the majority of uniquely mapping reads will be exact matches or have just 1 or 2 mismatches.

**westerman** · 09-09-2009, 05:52 AM

In your opinion, the definition for the uniquely mapped reads that you explained to me just now. Is it also applied for the long base pair read, like 454,Sanger read,etc?

Certainly it can. If I had a bunch of long 1000-base Sanger reads they could be mapped either uniquely or non-uniquely to a reference genome. Depending on the number of SNPs expected then the number of allowed mismatches may need to be raised.

Do you know what is the purpose that scientist use the uniquely mapped read to assemble a high-quality consensus sequence of some specific organism's genome?

Probably for the same reason that anyone wants a sequence -- in order to find out what makes that specific organism's genome different than other genomes ... SNPs, InDels, unique genes, unique control mechanisms, etc.

It may be obvious but you can assemble a sequence either via:

1) De-novo assembly
or
2) Mapping unique reads onto a reference
or
3) Mapping unique and non-unique reads onto a reference
or
4) A combination of the above

**Patrick** · 09-09-2009, 05:00 PM

Thanks a lot, westerman.
Your reply makes me more understanding about how the scientist analyze the data.

Sad to said that I still not very clear about why the scientist will use the uniquely mapped read of specific organism genome A to assemble a high-quality consensus sequence of some specific organism genome B?
What is the purpose that they doing these method to analyze the data?
Do you know what is the general pipeline to analyze the 454 or Illumina data?
I very appreciate and thanks for your suggestion and opinion

Originally posted by westerman View Post

Certainly it can. If I had a bunch of long 1000-base Sanger reads they could be mapped either uniquely or non-uniquely to a reference genome. Depending on the number of SNPs expected then the number of allowed mismatches may need to be raised.

Probably for the same reason that anyone wants a sequence -- in order to find out what makes that specific organism's genome different than other genomes ... SNPs, InDels, unique genes, unique control mechanisms, etc.

It may be obvious but you can assemble a sequence either via:

1) De-novo assembly
or
2) Mapping unique reads onto a reference
or
3) Mapping unique and non-unique reads onto a reference
or
4) A combination of the above

**lh3** · 09-10-2009, 12:41 AM

It is easy to define uniqueness when you require the entire read to be aligned without gaps. But things get complicated when you allow clipping and gaps, both of which are related to the underlying scoring system and therefore uniqueness is related to scoring system. In addition, although we may define a read being unique when its best two matches have identical scores according to a scoring system, such a definition is not useful in practice. What if the second best match has a lower score just by 1 or 2?

**Patrick** · 09-10-2009, 12:44 AM

Thanks for your reply...
What you mention,make senses too...
I will try to find out more about the "unique mapped read" and share it with everybody

Originally posted by lh3 View Post

It is easy to define uniqueness when you require the entire read to be aligned without gaps. But things get complicated when you allow clipping and gaps, both of which are related to the underlying scoring system and therefore uniqueness is related to scoring system. In addition, although we may define a read being unique when its best two matches have identical scores according to a scoring system, such a definition is not useful in practice. What if the second best match has a lower score just by 1 or 2?

Topics	Statistics	Last Post
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, Yesterday, 11:10 AM	0 responses 7 views 0 reactions	Last Post by SEQadmin2 Yesterday, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 42 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 103 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 125 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM

Unconfigured Ad

Unique mapped reads definition confusing...

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News