Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Great discussion and thanks for sharing the plots. In short, Nextseq generates higher error rate in variant calling. This is often overlooked when buying the machine. It poses great challenges for bioinformaticians.
-
Originally posted by dsobral View PostJust to add our 2 cents.
We have a MiSeq running since 2013, and after some hickups we're now stable with it and reasonably happy.
We just recently installed a NextSeq500 and our first tests are not great. Q30 is >80%, but there are many low quality bases (constantly Q=14 "/"), and the worst part is that instead of being towards the end, they seem a bit randomly distributed. When comparing PhiX in a 2x150bp NextSeq with a 2x250bp MiSeq, after alignment I see a 0.2-0.3% error rate with MiSeq and 0.9-1% error rate with NextSeq (1M sampled reads). In the "randomly" distributed Q=14 bases I seem to notice more A to T transitions, but I didn't have time to gather more systematic statistics... If I do quality trim on the MiSeq I can easily get higher quality data, with the NextSeq since its randomly distributed is harder...
We've complained to the Illumina people, let's see what they say...
Were you/anyone else ever able to resolve this issue with random Q14 scores throughout the length of the read? We currently see the same in our data.
Our core recently mothballed our HiSeq2500 due to outrageously high service contract costs. We have converted to a NextSeq500 running NextSeq control software v2.2.0.4, RTA v2.4.11. We are running Blue Pippin selected multiplexed Nextera XT libraries (we are an environmental microbiology lab so we can have very low nucleic acid yields so we stick with XT) for 300 cycle PE High output runs . I'm told we can get around this random low quality issue for metagenome assembly as long as the quality scores of the other bases are true and we sequence deep enough but we are also interested in SNP analysis on some of our other projects and this would be problematic (I don't do these analyses myself-I'm on the front end of library making and sequencing). Our MiSeq is also a bit aged and misbehaving so currently on-site we only have access to instruments running two-dye chemistry. One problem that we seem to see consistently on our NextSeq besides the data quality (now for 3 runs) is that the green channel intensity is consistently low. Illumina tells me that this is not good and could be the camera. My core tells me that this issue only occurs on my runs (they seem to blame the Nextera XT). Any help/insight would be appreciated.
Leave a comment:
-
Originally posted by bini View PostHi,
I recently had an error message during a run " Camera 5 lane 3 bottom surface disabled: Failed to detect Clusters" " Camera 6 lane 3 bottom surface disabled: Failed to detect Clusters"and so on for multiple cameras. Does this mean there is something wrong with my cameras?
Please advise if you have come across similar error. Thanks
Leave a comment:
-
Next Seq Error
Hi,
I recently had an error message during a run " Camera 5 lane 3 bottom surface disabled: Failed to detect Clusters" " Camera 6 lane 3 bottom surface disabled: Failed to detect Clusters"and so on for multiple cameras. Does this mean there is something wrong with my cameras?
Please advise if you have come across similar error. Thanks
Leave a comment:
-
Originally posted by Brian Bushnell View PostUnfortunately, Illumina's taken a turn for the worse again. I just analyzed some recent data from the NextSeq, HiSeq2500, and HiSeq 1T platforms of the same library. The NextSeq data is dramatically worse than last time I looked at it. Error rates are several times higher, there's a major A/T base frequency divergence in read 2, and the quality scores are inflated again at ~6 points higher than the actual quality. More disturbingly, the HiSeq quality scores are completely inaccurate now, as well, though the actual measured quality is still very high - average Q33 for read 1 and Q29 for read 2 for HiSeq2500, versus Q24 for read 1 and Q18 for read 2 on the NextSeq (those numbers are as measured by counting the match/mismatch rates from mapping, so essentially, NextSeq has roughly 10X the error rate of HiSeq). But the measured discrepancy between claimed and measured quality scores for the HiSeq2500 and HiSeq 1T are BOTH worse than the NextSeq, despite the NextSeq having binned quality scores, and as you can see there are large regions of quality scores simply missing from the HiSeq2500, such as Q3-Q11, Q17-Q21, and Q29. There are clearly major problems with Illumina's current base-calling software, as quality score assignment has drastically regressed since last time I measured it.
You can see the graphs in this Excel sheet that I've linked. "Raw" is the raw data, "Recal" is after recalibration (which changes the quality scores but nothing else). "NS" is NextSeq, "2500" is HiSeq2500, and "1T" is HiSeq 1T which unfortunately was only run at 2x101bp instead of 2x151bp on the other 2 platforms.
https://drive.google.com/file/d/0B3l...ew?usp=sharing
Leave a comment:
-
Originally posted by alexhaj View PostGood work Brian! I am skeptical about your claim that NextSeq has less crossover than HiSeq or Miseq however.
Would you please provide some data to back this up.
And if that is the case maybe it's simply because demultiplexing is being done by CASAVA v2 on NextSeq and CASAVA v1 on HiSeq and Miseq. What if you did demultiplexing yourself, taking into account Quality scores (which I assume is not typically done).
Leave a comment:
-
Good work Brian! I am skeptical about your claim that NextSeq has less crossover than HiSeq or Miseq however. Would you please provide some data to back this up. And if that is the case maybe it's simply because demultiplexing is being done by CASAVA v2 on NextSeq and CASAVA v1 on HiSeq and Miseq. What if you did demultiplexing yourself, taking into account Quality scores (which I assume is not typically done).
Leave a comment:
-
Originally posted by dsobral View PostWe've complained to the Illumina people, let's see what they say...
Leave a comment:
-
Just to add our 2 cents.
We have a MiSeq running since 2013, and after some hickups we're now stable with it and reasonably happy.
We just recently installed a NextSeq500 and our first tests are not great. Q30 is >80%, but there are many low quality bases (constantly Q=14 "/"), and the worst part is that instead of being towards the end, they seem a bit randomly distributed. When comparing PhiX in a 2x150bp NextSeq with a 2x250bp MiSeq, after alignment I see a 0.2-0.3% error rate with MiSeq and 0.9-1% error rate with NextSeq (1M sampled reads). In the "randomly" distributed Q=14 bases I seem to notice more A to T transitions, but I didn't have time to gather more systematic statistics... If I do quality trim on the MiSeq I can easily get higher quality data, with the NextSeq since its randomly distributed is harder...
We've complained to the Illumina people, let's see what they say...
Leave a comment:
-
NextSeq suitable for allele-specific analysis?
Originally posted by Brian Bushnell View PostIf the cost is reasonably similar, I'd go with HiSeq 2000. But perhaps you can get a sample of NextSeq data for your project, on a library you already ran on the HiSeq, to compare without committing yourself?
Leave a comment:
-
If the cost is reasonably similar, I'd go with HiSeq 2000. But perhaps you can get a sample of NextSeq data for your project, on a library you already ran on the HiSeq, to compare without committing yourself?
Leave a comment:
-
NextSeq suitable for allele-specific analysis?
Do you think that NextSeq would be suitable for allele-specific analysis? I am using mouse cells with hybrid genome and sort the reads belonging to different alleles based on SNP content. So far I was using HiSeq2000 which worked well. With NextSeq I would get the data several times faster but having read this whole thread I am not sure whether the NextSeq data quality will be good enough.
Leave a comment:
-
Originally posted by AllSeq View PostThe NextSeq uses 2 color chemistry, but the 3000/4000 uses the 'standard' 4 color chemistry (with patterned flow cells, just like the X).
Leave a comment:
-
The NextSeq uses 2 color chemistry, but the 3000/4000 uses the 'standard' 4 color chemistry (with patterned flow cells, just like the X).
Leave a comment:
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 11:49 AM
|
0 responses
15 views
0 likes
|
Last Post
by seqadmin
Yesterday, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Leave a comment: