Seqanswers Leaderboard Ad

**NicoBxl** · 09-03-2012, 02:13 AM

You can maybe use a genome of a related species and then perform an alignment (with tophat per example). After that you can extract the number of read per feature (gene, isoform,...) with htseq. After that use DESeq for differential expression analysis

Or you can use a de-novo approach. Assemble de-novo the transcriptome of the plant ( with trinity, oases,...) . Align your reads against the transcriptome. extract the read count for each transcript. perform differential expression analysis with DESeq (or edgeR)

**mht** · 09-03-2012, 05:04 PM

Much thanks for your advice.

The second approach is what I'm doing now. However, I'm not sure how to extract the read count for each transcript, since some of the reads are multi-mapped. I know there's RSEM which works for transcriptomes without a reference.. but from what I've been reading, RSEM output is not so suitable as DESeq input since the read counts are only estimates.

Does anyone know of any other programs which can give me read counts without references other than RSEM?

**lzsph** · 09-12-2012, 12:59 AM

Hi guys,

I have similar situation with what mht has.
Could anyone fix this problem?

Thanks.

Regards,

Senhao

Originally posted by mht View Post

Much thanks for your advice.

The second approach is what I'm doing now. However, I'm not sure how to extract the read count for each transcript, since some of the reads are multi-mapped. I know there's RSEM which works for transcriptomes without a reference.. but from what I've been reading, RSEM output is not so suitable as DESeq input since the read counts are only estimates.

Does anyone know of any other programs which can give me read counts without references other than RSEM?

**areyes** · 09-12-2012, 01:32 AM

This paper does transcript de novo assembly and then count gene features based on the output of their assemblies, might be of your interest:

Genome Res. 2012 Apr;22(4):602-10. Epub 2011 Dec 29.
Comparative RNA sequencing reveals substantial genetic variation in endangered primates.

**Simon Anders** · 09-12-2012, 01:40 AM

You should obtain read counts per gene, not per transcript. If you align reads to a transcriptome, each read will typically align to several transcripts. Verify that they are all transcripts of the same gene and then count this as one for this gene. Of course, you will need to write a custom script to process the aligner output and do the counting, but this should be easy.

**lzsph** · 09-12-2012, 05:23 AM

Hi Simon,

Thanks for your advice. Unfortunately, without that background, I don't know how to write such a script. I may need your help if you have time and I wish it will not bother you too much.

I align our reads back to the transcriptome using a script within Trinity package (alignReads.pl), the transcriptome was de novo assembled using Trinity, I got my align results consist of several files, such as

Code:

bowtie_out.coordSorted.bam
bowtie_out.coordSorted.bam.bai
bowtie_out.nameSorted.bam
bowtie_out.nameSorted.PropmapPairsForRSEM.bam

[I]et al.[/I]

I don't know which file listed above should be used to count genes.
(p.s. non-model plant; two replicates per sample; 100bp paired-end reads obtained using HiSeq 2000)

I really need your generous help, or I don't know how to do downstream analysis.

Thank you very much.

Yours sincerely,
Senhao

Originally posted by Simon Anders View Post

You should obtain read counts per gene, not per transcript. If you align reads to a transcriptome, each read will typically align to several transcripts. Verify that they are all transcripts of the same gene and then count this as one for this gene. Of course, you will need to write a custom script to process the aligner output and do the counting, but this should be easy.

**lzsph** · 09-12-2012, 05:28 AM

Hi areyes,

Thank you very much.

I will read the paper.

Yours sincerely,
Senhao

Originally posted by areyes View Post

This paper does transcript de novo assembly and then count gene features based on the output of their assemblies, might be of your interest:

Genome Res. 2012 Apr;22(4):602-10. Epub 2011 Dec 29.
Comparative RNA sequencing reveals substantial genetic variation in endangered primates.

Topics	Statistics	Last Post
Mechanical Forces in DNA Transcription Uncovered by Clemson Researchers by seqadmin Started by seqadmin, 10-02-2024, 04:51 AM	0 responses 13 views 0 likes	Last Post by seqadmin 10-02-2024, 04:51 AM
New Epigenetic Clock Links Cheek Cells to Mortality Risk by seqadmin Started by seqadmin, 10-01-2024, 07:10 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-01-2024, 07:10 AM
AI-Powered Blood Test Shows Promise for Early Ovarian Cancer Detection by seqadmin Started by seqadmin, 09-30-2024, 08:33 AM	0 responses 25 views 0 likes	Last Post by seqadmin 09-30-2024, 08:33 AM
Stem Cell Research Suggests Human Cells May Enter Developmental Pause by seqadmin Started by seqadmin, 09-26-2024, 12:57 PM	0 responses 18 views 0 likes	Last Post by seqadmin 09-26-2024, 12:57 PM

Seqanswers Leaderboard Ad

Announcement

HTseq without reference genome

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News