Galaxy problem

fran.vitiello

Junior Member

Join Date: Aug 2012

Posts: 1
- Share
- Tweet
#1

Galaxy problem

10-05-2012, 06:03 AM

Dear all,
I' really new to the use of Galaxy, NGS analysis and SNP calling.
I have a problem.
Now I describe you my work step-by-step.
I have two fasta sequence one forward and one reverse.
I check with FastQC, result all ok.
I groomed them with FastQ grommer, all ok.
I map them whit BWA, all ok (bowtie gives me error).
After that I apply Filter SAM:
Input Parameter Value
Select dataset to filter -->10: Map with BWA for Illumina on data 8 and data 7: mapped reads
Type --> Read is paired; Set the states for this flag --> Yes
Type --> Read is mapped in a proper pair; Set the states for this flag --> Yes
Type --> The read is unmapped; Set the states for this flag --> No
All ok.
I convert SAM to BAM, no problem.

After that I generate pileup:
I leave all the setting as usual, I change only: Call consensus according to MAQ model? yes
Result: all ok.

Next: Filter pileup,
Input Parameter Value
Select dataset--> 13: Generate pileup on data 12: converted pileup
which contains--> ten
Do not consider read bases with quality lower than--> 20
Do not report positions with coverage lower than--> 3
Only report variants?--> Yes
Convert coordinates to intervals?--> Yes
Print total number of differences?--> No
Print quality and base string? --> No
Result: all ok.
The resulting data set contain about 2200 SNPs
At least I compare my result data set whit dbSNP132.txt and 1kg.lc.2010_7.CEU.liftedhg19.pgSnp.
The resulting data set contain 700 SNPs.

Now the question:
The problem is that I must have 200 SNPs and no 700....
I have check the resutls and I saw that some result are duplicated for all parameters but had a different rs ID.

how can I remove the wrong one???

please help me.

some one I heard about SOAPsnp but I don't know how to use. Could anyone please help me.

Thx Francesco
Tags: None

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 18 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Latest Articles

ad_right_rmr

News