Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
BBMap requires a reference; all of those histograms (other than bhist) are based on mapping. But it doesn't have to be a good assembly. If you have sufficient coverage, a quick draft assembly is adequate for mapping to generate that QC data.
-
^Is it possible to do quality control without a reference with bbmap?
I have some Ion Xpress read data. I have been told the quality control has been done, but when just checked through FastQC, it seems really bad at the end. But then again its my first time working with Ion Torrent/PGM data.
Thank you in advance for the help.Attached Files
Leave a comment:
-
I uploaded a new version of BBMap that can tell you the observed quality for claimed qualities, assuming you have a reference. You run it like this:
bbmap.sh ref=x.fasta in=reads.fastq nodisk qahist=qahist.txt
This will give 7 columns:
quality, match, sub, insertion, deletion, observed quality, observed quality (subs only)
Since deletions occur between bases, I add deletion events to bases neighboring the deletion. That would over-represent deletions, so all the other columns are multiplied by 2 to compensate. Anyway, if you plot the first column against the 6th column, it will tell you how the observed error rates correlate with quality scores.
You can get QC-relevant additional histograms with these flags:
bhist=bhist.txt (base composition by read position)
qhist=qhist.txt (average quality by read position)
mhist=mhist.txt (match, substitution, insertion, deletion events by read position)
ihist=ihist.txt (insert size distribution)
...and if you want the actual mapped reads, just add "out=mapped.sam".
Leave a comment:
-
Originally posted by super0925 View PostMy work is RNA-seq.
So the trimming step sould be 'gentle' or even not which is suggested by some experts in SeqAnswers and Biostars. Am I right?
Leave a comment:
-
Originally posted by GenoMax View PostI do not think it would be possible to equate Q-scores from Illumina/Ion by a formula because they are not using the same predictors.
Some recent studies seem to suggest that trimming may not be needed unless adapters are present or the raw qualities are poor. I would suggest doing some analysis without trimming to see what happens. BTW, What kind of analysis are you doing (re-seq, RNA-seq etc)?
So the trimming step sould be 'gentle' or even not which is suggested by some experts in SeqAnswers and Biostars. Am I right?
Leave a comment:
-
I do not think it would be possible to equate Q-scores from Illumina/Ion by a formula because they are not using the same predictors.
Some recent studies seem to suggest that trimming may not be needed unless adapters are present or the raw qualities are poor. I would suggest doing some analysis without trimming to see what happens. BTW, What kind of analysis are you doing (re-seq, RNA-seq etc)?
Leave a comment:
-
Originally posted by GenoMax View Post@super0925: There is a technical note available on the ioncommunity site that describes six Quality Score predictors (and some additional ones that are apparently not listed) used for ion PGM. I did not see one specifically for proton but the one for PGM may work.
You should be able to create a free account on ioncommunity site and search for that document.
The Quality Score in Proton is based on the 6 predictors which are different from Illumina. Am I right?
But I am still confused that the relationship between Proton Q score and Illumina Q score, like Q=22 Proton related to Q=30 MiSeq.
So far I could only directly judge that OK Q>22 is good reads and Q<22 is required for trimming...... Am I right?
Leave a comment:
-
@super0925: There is a technical note available on the ioncommunity site that describes six Quality Score predictors (and some additional ones that are apparently not listed) used for ion PGM. I did not see one specifically for proton but the one for PGM may work.
You should be able to create a free account on ioncommunity site and search for that document.
Leave a comment:
-
I think the error model for the IonProton would be more like that for 454, and involve homopolymer indels rather than substitutions.
Leave a comment:
-
Originally posted by snetmcom View PostThese are raw quality scores, so only the manufacturers know how they are calculated. That is why platform comparisons at this level are not always the best method.
I agree with you. It is hard to say which machine (HiSeq/MiSeq/Proton) is better from QC score but I want to know the equivalent quality score in Proton .
Like nbahlis said
Illumina score 30 corresponding to Proton QC score 22
So how about other score e.g. 20, 40 ?
Leave a comment:
-
Originally posted by super0925 View PostThank you!!! But I want to know the reason and why .
e.g. How do they calculate the QC score? why lower than Illumina (is that different equation?)
and, how to compare two measurement ?e,g, is that Illumina score -8 =Proton score?
P.S. This only works if the error model is almost entirely substitutions. If IonTorrent reads have indels then it requires a bit more effort. I'm never used IonTorrent data.Attached FilesLast edited by Brian Bushnell; 04-15-2014, 04:07 PM.
Leave a comment:
-
Originally posted by super0925 View PostThank you!!! But I want to know the reason and why .
e.g. How do they calculate the QC score? why lower than Illumina (is that different equation?)
and, how to compare two measurement ?e,g, is that Illumina score -8 =Proton score?
Leave a comment:
-
Originally posted by nbahlis View PostThat is typical with the proton. We are told by lifetech that a22-24 score on the proton equals the 30 on the illumina platform. It has to do apparently withe the proton more conservative score calling.
e.g. How do they calculate the QC score? why lower than Illumina (is that different equation?)
and, how to compare two measurement ?e,g, is that Illumina score -8 =Proton score?Last edited by super0925; 04-15-2014, 05:06 AM.
Leave a comment:
-
That is typical with the proton. We are told by lifetech that a22-24 score on the proton equals the 30 on the illumina platform. It has to do apparently withe the proton more conservative score calling.
Leave a comment:
-
QC control
Hi all
Does proton has its own supported QC control measurement?
My reads quality looks very strange (I get this result by FASTQC software which is Illumina-based).
In Illumina we know that QC= -10*log10(p)>30 is good reads but Proton seems strange.
Does anybody know that how to measure it in Proton? Thank you!Attached FilesLast edited by super0925; 04-15-2014, 03:34 AM.Tags: None
Latest Articles
Collapse
-
by seqadmin
The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...-
Channel: Articles
07-08-2024, 03:19 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 06:46 AM
|
0 responses
9 views
0 likes
|
Last Post
by seqadmin
Yesterday, 06:46 AM
|
||
Started by seqadmin, 07-24-2024, 11:09 AM
|
0 responses
25 views
0 likes
|
Last Post
by seqadmin
07-24-2024, 11:09 AM
|
||
Started by seqadmin, 07-19-2024, 07:20 AM
|
0 responses
159 views
0 likes
|
Last Post
by seqadmin
07-19-2024, 07:20 AM
|
||
Started by seqadmin, 07-16-2024, 05:49 AM
|
0 responses
127 views
0 likes
|
Last Post
by seqadmin
07-16-2024, 05:49 AM
|
Leave a comment: