Couple of questions regarding read data

kwatts59

Member

Join Date: Apr 2011

Posts: 46
- Share
- Tweet
#1

Couple of questions regarding read data

06-28-2011, 04:45 PM

Hello all.
I have a couple of basic questions regarding threshold values and standard deviation.

I have a fastq file of about 40million 50bp single end reads that I aligned to a genome via bowtie. Using the sam file generated from bowtie and the gff file for the genome, I wrote a PERL script to count how many reads aligned to each gene on the genome.
Question 1, what is the minimum number of reads that must map to a gene before I can say the gene is "expressed"? What is the threshold?

I have a gene of interest that lets say has 100 reads aligned to it. Due to cost constraints, I cannot run the same sample multiple times to calculate the standard deviation.
Question 2, what is the approximate standard deviation of those reads? Is there some quick calculation I could perform to estimate the standard deviation?

Thanks in advance.
Tags: None

Previous template Next

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad