Seqanswers Leaderboard Ad

**jimmybee** · 06-24-2013, 03:20 PM

What are you trying to do? Do you just want the cp and mtDNA?

Why do you say you have contaminants?

**hanshart** · 06-24-2013, 08:17 PM

Originally posted by Guigra View Post

... realized that there are contaminants in sequencing. How to remove them?

Hi Guigra,
without deeply understanding of your problem: If you know the type of contaminant you can always build an index of its corresponding genome/identifier sequences and map all reads to this index at first. The unmapped reads can than be used for the mapping against the genome. But I'm not sure if this is the answer you were looking for.

**Guigra** · 07-02-2013, 04:57 AM

Hi hanshart,

Is exactly what I want. How do I do that?

**GenoMax** · 07-02-2013, 06:16 AM

Making an index of the contaminats should be straightforward. Once you have the BAM files from those alignments you can recover the unmapped reads following the suggestions in these threads: http://seqanswers.com/forums/showthread.php?t=12283 and http://seqanswers.com/forums/showthread.php?t=30528

**hanshart** · 07-02-2013, 10:04 AM

An example for Bowtie:

1. Determine the sequences of your contaminants and write them to a FASTA file like

>seq1
ACGT...
>seq2
GCAG...

(or directly use an available FASTA describing your contaminants)

2. Build a bowtie-index of this FASTA file in a folder called IDX or so:

bowtie-build FASTA IDX/contaminants_idx

3. Map your reads (READFILE) against this contaminants reference and extract unmapped reads (=non-contaminants) to a fastq file (NO_CONTAMINANTS.fastq) directly with the --un flag:

bowtie --un NO_CONTAMINANTS.fastq IDX/contaminants_idx READFILE OUTPUTFILE

The reads in NO_CONTAMINANTS.fastq can finally be mapped against the reference of interest

A non-Bowtie way would be the same: 1. build index for contaminants, 2. map against this index, 3. extract unmapped reads from alignment file to a new fastq file and 4. use only those reads in the new file for the mapping against your reference.

**westerman** · 07-02-2013, 10:26 AM

As hanshart says, using bowtie (I actually use bowtie2) is a good -- and easy -- method.

**Guigra** · 07-03-2013, 08:04 AM

Thank you all. Were of great help!

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, 07-25-2024, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin 07-25-2024, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

How to remove reads contaminants?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News