I have final gene expression data (RNA Sequencing), four treatments in duplicate. I want to analyze this data for cumulative frequency distribution and heatmaps. How should I proceed? This is first time I am dealing with such a data.
Unconfigured Ad
Collapse
X
-
Hi scifiction,
If you are comfortable in R, you can use DESeq2 package and perform differential expression analysis. In that case, this link might help you: https://bioconductor.org/packages/re...doc/DESeq2.pdf
There is another web-server designed for small RNA data analysis. But I guess you can run differential gene expression analysis for your count data under "DE Analysis" tab. Here is the link: https://oasis.dzne.de/small_rna_de.php
It will give you heatmap and other diagnostic plots.Persistent LABS
-
-
assembly vs mapping
Hi guys,
I'm new here! I'm a PhD student engaged in a transcriptomic project. I have one control sample and four different stressed, all sequenced using HiSeq2000, without replicates. At a first moment, my tutor submitted the analysis to a ver big and famous company, requesting de novo assembly and differential expression analysis. The assembly was not good for me, because there was a low number of assembled transcripts (686, while actually there are 1936 annotated genes) and a very low number of DE genes was found. So, I tried to perform again the analysis using Trinity, as the company done, and I couldn't assemble the transcriptome: I found more than 15.000 trasncripts and many were too short. Furthermore, when I performed a blast against all NCBI database, I found many and many contaminants (i.e. human sequences). So, I tried with other programs (i.e. TransAByss, Velvet-Oases and Rockhopper) but they didn't work correctly. So I leave my tutor's idea of de novo assembly, because the genome of this bacterial strain became available in NCBI and its annotation too. I mapped the reads with STAR using its own genome; I was surprised because the percentages of mapped reads were low. I performed counting with FeatureCounts and did differential analysis with NOISeq (due to the absence of replicates in the experimental design). At a first moment, I used all mapped reads, but I was in a saturation condition, so I found the right compromise using only 10 million of mapped reads from each bam file. About one million resulted unassigned (NoFeatures).
Finally, I found many DE genes, but I'm frustrated because my results are the opposite of company's results. I did inspect them by an experienced researcher and he told me that I performed the analysis properly.
What should I think?What should I do? I quarrel with my tutor every day and I'm about to leave the PhD course because the situation is unbearable. Please help me
Comment
-
-
If your replicates are not so good, you can use edgeR that is more robust.
I ask another type of help: I performed RNA-seq analysis of bacterial transcriptome in four different stressed conditions, mapping the reads on its own genome available in NCBI. Then I used FeatureCounts for reads counting and finally I performed differential analysis with NOISeq R package, because of the absence of replicates.
Before that, my tutor submitted the analysis to a famous company requiring a de novo assembly (they used the trinity pipeline for assembly and differential analysis).
I used the same fastq files, and finally I found a larger number of DE genes, but my results are the opposite of company's results. How is it possible? I know that mapping is better than assembly when a reference genome is available and above all I know that the trinity pipeline have some problems for differential analysis, because it uses DESeq or edgeR after quantification by RSEM.
What do you think about?
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.
Here are nine questions we think about, in roughly the order they matter, before...-
Channel: Articles
06-18-2026, 07:11 AM -
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population
by SEQadmin2
Started by SEQadmin2, 06-17-2026, 06:09 AM
|
0 responses
33 views
0 reactions
|
Last Post
by SEQadmin2
06-17-2026, 06:09 AM
|
||
|
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism
by SEQadmin2
Started by SEQadmin2, 06-09-2026, 11:58 AM
|
0 responses
97 views
0 reactions
|
Last Post
by SEQadmin2
06-09-2026, 11:58 AM
|
||
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
117 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
111 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
Comment