How to combine reads from different samples after pipeline?

julia_

Junior Member

Join Date: Apr 2015

Posts: 1
- Share
- Tweet
#1

How to combine reads from different samples after pipeline?

10-12-2016, 04:33 AM

Hi everyone,

this is more of a statistical than a bioinformatics problem, but I thought this was probably the best forum to post it in as everyone probably knows their way around sequencing data.

My project:
I want to look at the bacterial composition of cow farmers noses and cows and see how much the nasal microbiome of the cow farmer is influenced by the contact to cows (compared to a non-exposed control group)

My samples:
Nasal swabs from cow farmers and cows:
Number of farms: 30
two cows sampled per farm.
Varying number of farmers sampled on each farm: range, 1-4

What I did so far:
I performed amplicon sequencing using the 16S V4 region
Platform: Illumina MiSeq 2*250 bp
pipeline: mothur

What I have now:
A shared file with the number of reads per OTU with a total number of 550 OTUs and 300 samples.

My problem:
My PI insists to combine the farmers to make one meta farmer in case I have more than one farmer per farm and to create one meta-cow out of the two cows I have.
He says that it makes only sense to look at the beta diversity with these pooled samples. So far I could not convince him to not want to see the pooled data.
I have a bad feeling to pool different samples together because it distorts the results in my opinion.
My PIs suggestion is to take all the farmers from one farm and add up all reads from each OTU and then divide this read number by the number of farmers (aka I am taking the mean number of reads per OTU).
However, I think this will leave me with a much higher richness than those farmers actually have.

My questions:
Does anyone have a good (statistical) reason why it is wrong to pool your samples like that? (something that convinces PIs)

Does anyone know a publication where samples have been pooled after sequencing? (Usually they get pooled right in the beginning)

How should I combine those samples instead if taking the mean number of reads is not the way to do it?

Thanks everyone for reading this far and I am greatly appreciating any input.
Tags: statistical analysis

Previous template Next

Addressing Off-Target Effects in CRISPR Technologies

by seqadmin

The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality¹. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes². This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways³. Identifying the full range...
- Channel: Articles
08-27-2024, 04:44 AM
Selecting and Optimizing mRNA Library Preparations

by seqadmin

Sequencing mRNA provides a snapshot of cellular activity, allowing researchers to study the dynamics of cellular processes, compare gene expression across different tissue types, and gain insights into the mechanisms of complex diseases. “mRNA’s central role in the dogma of molecular biology makes it a logical and relevant focus for transcriptomic studies,” stated Sebastian Aguilar Pierlé, Ph.D., Application Development Lead at Inorevia. “One of the major hurdles for...
- Channel: Articles
08-07-2024, 12:11 PM

Topics	Statistics	Last Post
Study Reveals How Bacteria Defend Against Viral Attacks by seqadmin Started by seqadmin, 08-27-2024, 04:40 AM	0 responses 16 views 0 likes	Last Post by seqadmin 08-27-2024, 04:40 AM
New Single-Molecule Sequencing Platform Introduces Advanced Features for High-Throughput Genomics by seqadmin Started by seqadmin, 08-22-2024, 05:00 AM	0 responses 293 views 0 likes	Last Post by seqadmin 08-22-2024, 05:00 AM
New DNA Code Discovered Revealing Complex Gene Regulation Mechanisms by seqadmin Started by seqadmin, 08-21-2024, 10:49 AM	0 responses 135 views 0 likes	Last Post by seqadmin 08-21-2024, 10:49 AM
Epigenetic Clocks Derived from Retroelements Offer New Insights into Aging by seqadmin Started by seqadmin, 08-19-2024, 05:12 AM	0 responses 124 views 0 likes	Last Post by seqadmin 08-19-2024, 05:12 AM

Seqanswers Leaderboard Ad

Announcement

How to combine reads from different samples after pipeline?

Latest Articles

ad_right_rmr

News