Seqanswers Leaderboard Ad

**Brian Bushnell** · 07-10-2014, 09:18 AM

As pseudogenes are not really conserved and thus have a high mutation rate, you might have better luck mapping pseudogene reads to their real counterparts with BBMap, as it has higher sensitivity. Of course, it depends on your goal - when you have a read from a pseudogene that is not part of the reference genome, do you want it to map to a real gene, or nowhere at all? If it maps to a real gene, that could cause false variation calls, but on the other hand, by examining them closely, you can determine that the reads originated from a pseudogene and thus improve the reference.

The way you can tell is that DNA reads originating from a pseudogene will map like RNA-seq reads (spanning introns) and have the reads that span introns will typically have a high SNP rate, with the same SNPs expressed in all reads spanning the intron but none of the reads that map to both the intron and exon.

**coryfunk** · 07-10-2014, 10:29 AM

Our goal is to better understand how several aligners handle the problem. Just as you say, BBMap, with its higher sensitivity will behave differently than other aligners. We want to understand that difference and find the algorithm-specific parameters that adjust that sensitivity.

Our reason for this goal is because we've seen more than one paper point out that pseudogene reads and existence of pseudogenes in the genome (and transcriptome) are likely causes for errors in read counts with RNA-seq. I haven't seen anyone actually show to what extent this is the case, and we'd like to understand the size of the effect on a genome scale.

Does that help to clarify our objectives?

**Brian Bushnell** · 07-10-2014, 11:10 AM

Ah, I understand better now. Possibly, as a negative control, references should be used in which all known pseudogenes are masked, under the assumption that they should not be expressed in RNA-seq data.

Topics	Statistics	Last Post
Genetic Mosaicism More Prevalent Than Previously Thought by seqadmin Started by seqadmin, Yesterday, 03:16 PM	0 responses 14 views 0 likes	Last Post by seqadmin Yesterday, 03:16 PM
Comprehensive Sequencing of Great Ape Sex Chromosomes Yields Insights into Evolution and Genetic Variability by seqadmin Started by seqadmin, 05-29-2024, 01:32 PM	0 responses 14 views 0 likes	Last Post by seqadmin 05-29-2024, 01:32 PM
New Toolkit Enhances Plant Mitochondrial Genome Research by seqadmin Started by seqadmin, 05-24-2024, 07:15 AM	0 responses 203 views 0 likes	Last Post by seqadmin 05-24-2024, 07:15 AM
Catalog of Gene-Isoform Variation in Developing Human Brain by seqadmin Started by seqadmin, 05-23-2024, 10:28 AM	0 responses 225 views 0 likes	Last Post by seqadmin 05-23-2024, 10:28 AM

Seqanswers Leaderboard Ad

Announcement

Characterizing the problem of pseudogene reads in mapping and parameter tuning

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News