Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
With any RNA-seq data, you're almost always going to get rRNA contamination. It makes subsequent mapping quicker if you filter out the rRNA reads first (i.e. prior to any other mapping that is done).
-
To all,
I discovered my problem with my RNA-seq data. It looks like there was some rRNA contamination in my sample which accounted for 5% of the total reads in the sample. The rRNA genes are located on chromosome 9. There are also some regions on chromosome 2 that show high homology to the rRNA genes on chromosome 9. This is the cause of the reads overloading chromosomes 9 and 2.
The latest version of Tophat works fine.
If other people are having problems with reads overloading a chromosome, check to see if there is rRNA contamination.
Thanks all.
Leave a comment:
-
More problems with Tophat (v1.3.2)
I ran tophat version 1.3.2 to align my deep sequencing data to my reference genome via the following command
tophat --solexa1.3-quals –p 6 –o /control/8905X1/tophat /rice_index/rice_index /8905X1/8905X1.txt
It generated a file called accepted_hits.bam.
I converted the file to a bed file using the bamToBed tool.
Then split the file up based on chromosomes. Here are the results:
331723028 May 11 09:26 accepted_hits_bed_Chr1
119325730 May 11 09:26 accepted_hits_bed_Chr10
123994839 May 11 09:26 accepted_hits_bed_Chr11
121898474 May 11 09:27 accepted_hits_bed_Chr12
416137943 May 11 09:27 accepted_hits_bed_Chr2
334052893 May 11 09:27 accepted_hits_bed_Chr3
161529836 May 11 09:27 accepted_hits_bed_Chr4
347228298 May 11 09:28 accepted_hits_bed_Chr5
189465060 May 11 09:28 accepted_hits_bed_Chr6
200695493 May 11 09:28 accepted_hits_bed_Chr7
171194241 May 11 09:28 accepted_hits_bed_Chr8
759976461 May 11 09:29 accepted_hits_bed_Chr9
2184791 May 11 09:29 accepted_hits_bed_ChrSy
539606 May 11 09:29 accepted_hits_bed_ChrUn
Tophat overloaded chromosome 9 which is one of the smaller chromosomes.
Unless this can be resolved, I recommend not using Tophat.Tags: None
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 11:49 AM
|
0 responses
13 views
0 likes
|
Last Post
by seqadmin
Yesterday, 11:49 AM
|
||
Started by seqadmin, 04-24-2024, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
04-24-2024, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
61 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
Leave a comment: