Seqanswers Leaderboard Ad

**lh3** · 03-31-2009, 12:41 AM

Thanks for this. I am always fancinated by slider. I guess this is the first SNP caller that explicitly use four quality values. James Bonfield and Mark Daly both believe and show some preliminary result that using four values leads to better SNP calls. Some comments on the figures at your website:

1. It is interesting to see you also come to the point of using known allele frequency as a prior, the same as BGI's SNP caller. When I did SNP calling for that NA18507, I also suggested this, but all the rest of people said it is cheating somehow and rejected my suggestion. They more like to think there are two problems: SNP discovery and genotyping. For SNP discovery, we only use a flat prior and for genotyping, we use the allele frequency.

2. How Slider detect paralogous regions? To detect CNV first and then filter out the SNPs in CNVs? I agree that setting maximum depth as is used by maq is not a good way.

3. I am not sure if I read your paper properly. As I understand, only one mutation (not sequencing errors) is allowed on one read. Is that right?

**bosTau2** · 03-31-2009, 05:53 AM

step by step

I checked http://www.bcgsc.ca/platform/bioinfo/software/SliderII
and think it does alignment by steps.

# Alignment.Java: Find read locations on the reference sequence with an exact match and one-off match (one base mismatch) to prb derived sequences.
# Extend.java: Expand reads to include up to 3 mismatches

**bioinfosm** · 03-31-2009, 06:58 AM

Any insight on how slider results compare to MAQ SNP calling on single/paired data?

Originally posted by lh3 View Post

Thanks for this. I am always fancinated by slider. I guess this is the first SNP caller that explicitly use four quality values. James Bonfield and Mark Daly both believe and show some preliminary result that using four values leads to better SNP calls. Some comments on the figures at your website:

1. It is interesting to see you also come to the point of using known allele frequency as a prior, the same as BGI's SNP caller. When I did SNP calling for that NA18507, I also suggested this, but all the rest of people said it is cheating somehow and rejected my suggestion. They more like to think there are two problems: SNP discovery and genotyping. For SNP discovery, we only use a flat prior and for genotyping, we use the allele frequency.

2. How Slider detect paralogous regions? To detect CNV first and then filter out the SNPs in CNVs? I agree that setting maximum depth as is used by maq is not a good way.

3. I am not sure if I read your paper properly. As I understand, only one mutation (not sequencing errors) is allowed on one read. Is that right?

**nmalhis** · 03-31-2009, 01:32 PM

Yes, just check the link:

SliderII | Genome Sciences Centre

http://www.bcgsc.ca/platform/bioinfo/software/SliderII

High quality SNP calling using Illumina data at minimal coverage

N.

**nmalhis** · 03-31-2009, 01:40 PM

- Regarding paralogous, Slider identify paralogous SNPs (and contig edge SNPs) as they are likely to be at the edges of the reads.
- Yes, Slider (and SliderII) allows up to one mutation, plus, it consider all possible bases in prb data, and when using PET reads, SliderII force align reads if other side is aligned.

Nawar

**lh3** · 04-01-2009, 12:52 AM

2. Do you mean you exclude SNPs towards the ends of a read? These are the false SNPs caused by indels. A better strategy would be to filter out SNPs close to predicted indels.

3. Sorry that I did not read through the whole page. I now realize that this is a seeding-extension algorithm. You allow maximum one mutation in the seed but may extend the seed to allow more. By the way, the page said "the smaller the seed size is, the faster the alignment will be". Is this a typo?

**nmalhis** · 06-26-2009, 08:06 PM

> Hi,
>
> I am very interested in your SNP Caller SilderII. I am trying to use it. I have one question for you. What's the meaning of SNP_in in the config file? I didn't find the explanation for the item from sliderII website.
>
> Thank you very much.
>
> Rebecca

SNP_in is the expected number of bases in the reference genome for each one SNP, for the human genome, this number should be 1000.

Nawar

**Nix** · 08-24-2009, 12:54 PM

What coordinate system are you using for generating a table of known snps to feed into SliderII. Is this 1 based? or 0 based? Does anyone have a table for mouse 2007, mm9?

**nmalhis** · 08-26-2009, 04:54 PM

Hi Nix,

I used the Ensembl Variation database (version 50) SNPs.
You need to adjust the format.

Nawar

**pengchy** · 10-09-2011, 11:49 PM

Is SliderII's paper published?
And the picture in this link can not be displayed

SliderII | Genome Sciences Centre

http://www.bcgsc.ca/platform/bioinfo/software/SliderII

High quality SNP calling using Illumina data at minimal coverage

Topics	Statistics	Last Post
Study Reveals How Bacteria Defend Against Viral Attacks by seqadmin Started by seqadmin, 08-27-2024, 04:40 AM	0 responses 16 views 0 likes	Last Post by seqadmin 08-27-2024, 04:40 AM
New Single-Molecule Sequencing Platform Introduces Advanced Features for High-Throughput Genomics by seqadmin Started by seqadmin, 08-22-2024, 05:00 AM	0 responses 293 views 0 likes	Last Post by seqadmin 08-22-2024, 05:00 AM
New DNA Code Discovered Revealing Complex Gene Regulation Mechanisms by seqadmin Started by seqadmin, 08-21-2024, 10:49 AM	0 responses 135 views 0 likes	Last Post by seqadmin 08-21-2024, 10:49 AM
Epigenetic Clocks Derived from Retroelements Offer New Insights into Aging by seqadmin Started by seqadmin, 08-19-2024, 05:12 AM	0 responses 124 views 0 likes	Last Post by seqadmin 08-19-2024, 05:12 AM

Seqanswers Leaderboard Ad

Announcement

SliderII: High Quality SNP Calling Using Illumina Data at Shallow Coverage

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News