Unconfigured Ad

**phoss** · 05-02-2012, 07:51 PM

Hi cerebralrust,

I was curious if you've tried M. truncatula? This a close relative of G. max.
Have you had any luck with genomes off of phytozome?

**sdriscoll** · 05-02-2012, 11:06 PM

You might try one of the "no genome" assemblers like Trinity or Abyss to build a "gene" library from your data. I think those put together some consensus set of sequences assembled from your reads. The you could build a bowtie reference from those FASTA sequences and align your reads to it with bowtie. Finally you can count reads aligned to each one and compare samples using something like DESeq.

You'll need some major computer power to run Trinity, from what I hear. That process of assembling sequences from reads is much more reasource consuming than the bowtie alignment stage.

**jujubix** · 05-03-2012, 01:21 PM

De novo assembly, as sdriscoll mentioned, is the typical solution when no decent reference genome exists.

Given that you're dealing with gene expression, I assume you have transcriptome reads, in which case you could look into Trans-ABySS, which is the transcriptome-specific version of ABySS. It is a single software pipeline that aims to assemble reads into transcripts and quantify transcript abundance, all without a reference genome. In theory you would end up with two sets of transcripts and expression levels, after which standard DE analysis could be conducted. Although finding corresponding transcripts between the two sets could be tricky...

Software link is here and paper is here

**sdriscoll** · 05-03-2012, 01:37 PM

indeed. do you think that one would have to engage in a massive pairwise BLAST session between assemblies in order to match them up?

Maybe, for that reason, it would be easiest to pool all reads into a massive FASTQ and run them through ABySS at once to get a master list of transcripts and then perform quantification through other means.

**jujubix** · 05-03-2012, 01:40 PM

Yeah, at this point building a common reference via assembly is looking mighty tempting. This of course, assuming cerebralrust has the major computer power to run everything

**sdriscoll** · 05-03-2012, 01:42 PM

yeah. i'd be a little nervous to try it myself. but that's why i have more than one computer.

**cerebralrust** · 05-04-2012, 02:57 AM

Hello members. Thank you for your valuable advice.

I've run assemblies on my data using Trinity,Newbler,MIRA,velvet on my HP laptop which has 4GB RAM and i3 processsor. About 800k reads with both genotypes pooled together.
No, i have not tried M.truncatula, phoss.I will,thanks!
I've pooled all the reads and assembled using MIRA + CAP3.Trinity, although a really good assembler is quite bad for plant genomes.(poor annotation, poor N50 etc)
Yes it is transcriptome. Now i suppose i will map the reads back to this 'reference', quantify and continue with the analyses.

Thanks for the Abyss suggestion and paper, jujubix & sdriscoll. I will try it out.

Topics	Statistics	Last Post
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 22 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 27 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 38 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 61 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM

Unconfigured Ad

Differential gene expression analysis without reference

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News