Seqanswers Leaderboard Ad

**rhinoceros** · 08-29-2013, 12:38 PM

What about bowtie2 against the human genome? They even have prebuilt indexes available. Blastn of over 100M reads against nt sounds rather wasteful use of computing resources..

**ssully** · 08-29-2013, 12:46 PM

Yes, it's far from good. But that's how many were left in our metagenomic set after filtering out short reads, duplicate reads, and (via BMTagger) human reads. So we''d like to try a better human read remover, to help insure that the final read set for downstream analysis (e.g. blastn) is all nonhuman. And smaller.

**rhinoceros** · 08-29-2013, 12:49 PM

Originally posted by ssully View Post

Yes, it's far from good. But that's how many were left in our metagenomic set after filtering out short reads, duplicate reads, and (via BMTagger) human reads. So we''d like to try a better human read remover, to help insure that the final read set for downstream analysis (e.g. blastn) is all nonhuman. And smaller.

If I were you, I'd do trimming, bowtie2 against the human genome, assembly, and then blasts. Although for certain things like species distribution, assembly tends to introduce rather big bias (in my experience it increases the apparent presence of the most common taxa).

p.s. If you have human reads, you probably have other contaminants too, like bacteria from human skin among other stuff. Keep that in mind especially if your contamination rate is high..

**ssully** · 08-29-2013, 01:39 PM

We don't want to do assembly, because our main goal is to interrogate the diversity of taxa in our samples. We've done quality score filtering, length filtering, adapter trimming, duplicate removal - more vigorous quality trimming may be detrimental to uncovering diversity according to this study

We are studying a surface microbiome that humans interact with, so we don't mind skin bacteria; we want to catalog those, as well as any eukaryotic seqs. We don't even 'mind' the human sequences, it's just that their numbers make the seq files very large, so we want to split them out and treat human/nonhuman sets separately.

**GenoMax** · 08-29-2013, 04:58 PM

Perhaps one of these would be useful:

http://edwards.sdsu.edu/labsite/index.php/robert/301-how-to-remove-human-dna-sequence-contamination-from-metagenomes

Site not found · DreamHost

http://clovr.org/hmp-dacc/hmp-dacc-clovr-human-contaminant-screening-walkthrough/

The owner of this domain has not yet uploaded their website.

**lac302** · 09-27-2013, 09:06 AM

deconseq?...i haven't used it for anything larger than microbial genomes, but it works fairly well.

**leopal** · 10-24-2013, 02:37 AM

Great help!

Originally posted by GenoMax View Post

Perhaps one of these would be useful:

http://edwards.sdsu.edu/labsite/index.php/robert/301-how-to-remove-human-dna-sequence-contamination-from-metagenomes

http://clovr.org/hmp-dacc/hmp-dacc-c...g-walkthrough/

These links are quite good options!

Thanks!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 49 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 50 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 43 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

filtering out human seqs from metagenomic reads

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News