Hello,
I have been given two sets 454 data which have already been assembled into contigs using the newbeler assembler. The data is transcript sequence from two different flower types, from a plant species which has not yet had its genome sequenced.
They want me to compare the expression levels between two sets of data by comparing the number of reads which were used to make up matching contigs. The coverage isn't that great so some genes have multiple contigs. I can tell this as their are many cases several smaller contigs from one sample (but still in the 100's) align with a larger ones in the other.
I thought the best way to deal with this would be to pool the reads from the two samples and assemble them. Then compare the number of reads from each sample that make up each of the contigs.
I don't have access to newbeler so I tired re-assembling one of the samples with velvet. But this resulted in 25,091 contigs whereas newebler had produced 13,990.
Does anyone know which of the publicly available assemeblers works bets on 454 data? Or has anyone got an suggestion on how to deal with this data?
Thanks,
John
I have been given two sets 454 data which have already been assembled into contigs using the newbeler assembler. The data is transcript sequence from two different flower types, from a plant species which has not yet had its genome sequenced.
They want me to compare the expression levels between two sets of data by comparing the number of reads which were used to make up matching contigs. The coverage isn't that great so some genes have multiple contigs. I can tell this as their are many cases several smaller contigs from one sample (but still in the 100's) align with a larger ones in the other.
I thought the best way to deal with this would be to pool the reads from the two samples and assemble them. Then compare the number of reads from each sample that make up each of the contigs.
I don't have access to newbeler so I tired re-assembling one of the samples with velvet. But this resulted in 25,091 contigs whereas newebler had produced 13,990.
Does anyone know which of the publicly available assemeblers works bets on 454 data? Or has anyone got an suggestion on how to deal with this data?
Thanks,
John
Comment