Seqanswers Leaderboard Ad

**koadman** · 01-31-2012, 11:54 PM

I love it. The good folks at Life Tech may have in fact helped to make analysis pipelines for MiSeq better by publicizing a bias that is probably much more fixable than the homopolymer issues on their own platform. Keep up the good work Ion.

**TonyBrooks** · 02-01-2012, 03:36 AM

Surely as this is strand specific it's not too big a problem. You just need to be sure that any SNP is visible in both forward and reverse reads. If it's only seen in reads from one direction, then you should ignore it, treat it with caution or at least give it a really low mappability score) - something I think most aligners do (correct me if I'm wrong).

The only problem is if you had a single base flanked by homopolymers in both directions. Then the base would be miscalled on both strands.

**ulz_peter** · 02-01-2012, 03:59 AM

Is someone else also getting tired of companies trying to prove the weaknesses of the opponent rather than focussing on their own system?

**sinaian** · 02-01-2012, 10:02 AM

So ONLY NOW someone finally realizes weakness of opponent is not a proper subject? How convenient is the timing ...

BTW, trading sensitivity for specificity is always a great solution.

**SeqAA** · 02-01-2012, 12:46 PM

I wonder if this is related to the fast chemistry times of Illumina's newest platforms? Seems odd such a prevalent error profile would go missed.

**lh3** · 02-01-2012, 07:14 PM

Let me discard the previous post.

IonTorrent is finding something real. However, I think this is not caused by homopolymer run, at least not mainly caused by that, but by the "GGC" and/or the invert repeat artifact [PMID:21576222]. This region is particularly enriched with GGC on both forward and backward strands. In addition, the screenshot is exaggerating the Illumina problem a little bit: they disabled shading in IGV; the majority of mismatches have quality below 10 and are barely visible under the IGV default setting. Some mismatches do get Q20 recurrently, which is worrying.

**snetmcom** · 04-29-2012, 09:17 PM

Originally posted by sinaian View Post

So ONLY NOW someone finally realizes weakness of opponent is not a proper subject? How convenient is the timing ...

BTW, trading sensitivity for specificity is always a great solution.

just poking through their documentation, there are several publications that have found this before.

**pmiguel** · 04-30-2012, 03:58 AM

Originally posted by snetmcom View Post

just poking through their documentation, there are several publications that have found this before.

Yes, I think MIRA creator, Bastien Chevreux, noticed it first -- and changed MIRA to compensate for the Illumina GGCxG issue. Bizarre Illumina has not fixed it themselves, but there are a handful of issues Illumina seems blind to.

--
Phillip

**alanwan** · 05-20-2012, 09:16 PM

The system bias indeed exists. But it is usually very small - no more than 1/1000 detected SNVs are caused by system errors. Therefore few people realize it.

However it is fatal to rare disease causal novel SNP detection, because system errors occur randomly to the whole genome, and since the known SNPs occupy only 1/100 (db135 ~30M/3G) of the genome base positions, most of the errors SNVs exist in novel sites. That leads to a high false positive rate in your novel SNPs.

This problem could be far more worse if you want to find common novel SNPs in size>=3 population samples. Actually we found a terrible FPR (>98%) in detected common novel SNVs of a whole exome sequencing project (family samples, size=3, sequence generated by one GAII) in 2010. However, it is important to note that not all our Illumina sequence data have such a high error rate.

In my observation, the proceeding homopolymer leads to most of the false positives,while GGC problem is light. I think it may depend on sample properties and other factors.

As you guys may already find, there have been many articles introducing methods to solve the system bias problems of the NGS instruments, such as GATK variants calibration, VarScan, CRISP, SERVIC4E, and etc. Unfortunately there is no common conclusion that which method provides the best solution. No offense, I personally had bad experience with GATK's old versions, which crashed again and again and was too picky to my BAM files exported by other aligner. I did not try other tools yet, and I am still using my own scripts to filter the false positives.

**james hadfield** · 05-21-2012, 01:56 AM

Is

Originally posted by sinaian View Post

weakness of opponent

a popular strategy in the US at the moment due to your 57th presidential election?

Bashing opponents always makes jucier headlines than demonstrating minor improvments to your own system. I would very much prefer to hear Ion discussing the very real improvments they have made. The technology has raced forward as fast as we hoped it would.

**sinaian** · 05-21-2012, 11:51 AM

Originally posted by james hadfield View Post

Bashing opponents always makes jucier headlines than demonstrating minor improvments to your own system. I would very much prefer to hear Ion discussing the very real improvments they have made. The technology has raced forward as fast as we hoped it would.

Fully agreed. But it is just intereting to compare the atmosphere when one party came out bashing the other, versus when the opponenent actually answers back.

**pmiguel** · 05-22-2012, 04:03 AM

Can anyone verify that this is the old "GGCxG" issue?

If so, I have my doubts that Illumina will address the issue on the basis of LifeTech pointing it out. Seems either to be firmly in their corporate blind spot or an intractable issue.

--
Phillip

**alanwan** · 05-22-2012, 04:31 AM

Originally posted by pmiguel View Post

Can anyone verify that this is the old "GGCxG" issue?

If so, I have my doubts that Illumina will address the issue on the basis of LifeTech pointing it out. Seems either to be firmly in their corporate blind spot or an intractable issue.

--
Phillip

This system bias problem probably can never be completely solved. But I believe new algorithms will help distinguish the error calls.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Ion Torrent claims of MiSeq showing post-homopolymer substitution errors

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News