I took advantage of CLC's offer last spring to get a free 6 mo trial evaluation of full version of their Genomic Workbench. Now it is coming to end and I need a make decision whether it worth to pay for. So far it was mostly frustrating experience, but I wonder if someone is more satisfied.
The main problem that keeps me avoiding commercial supposedly more user-oriented and conveniently integrated software (that is what you are paying for, right?) was in fact it is totally opposite. Once the juggernaut is assembled and sold to you - you, stupid end user do not even try to question programmers' logic behind its beautiful design as you just not capable to get its beauty from your end user pimple point of view. That was always my experience when I tried to ask - why this way and why not this way? Can I do this? Why not, as this makes a little sense?
For example, why I have to waste the storage space to import gigabytes of my existing databases to create yet another database in a program-specific format that cannot be read by other software? Convenience of associated metadata? OK, but why this convenient format does not allow a reference assembly against selected genome regions, while open-source software allows that using less convenient data formats? No, you have to use the entire genome - it will be slow though. Nice.
Illumina software very rarely, but does throw into reads bizarre letters, like @, F, or Q, which may be found in a few reads out of, say 20 million. There is absolutely no justification why a user-friendly software cannot handle that and upon importing in its wonderful convenient specific format labels the entire 20+ million fasta dataset as protein leaving the end user with no other choice but to search the dataset manually to remove those 1-2 bizarre reads just to make it usable. No, you cannot question the logic behind that - a standard, however stupid, cannot be changed because of end user, but we do strive to meet expectations.
Well, all right, at least you pay for well documented and well maintained software. Yep, the manual is intimidating - hundreds of pages. Not much useful though, as indices at the end are a way off, illustrations and examples provided do not correspond what you see on the screen, selections to be made are missing - you got to be kidding me?
The last thing I was trying to do is just to import an annotated hg19. Not a big deal - you download hg19, and then RefGene.txt, and then connect them to each other so you can see in your reference assembly matches to specific genes, like it is done in a simple and convenient way in 454 GS RefMapper. No way. After downloading overnight hg19 (again!) from somewhere, I ended up only with sequences separated by chromosomes. That is a big help. Why I cannot just use my existing hg19.fna and RefGene.txt the manual does not say, at least I could not find it there - if anything can be ever found in this messed up volume.
Sorry for being long - it accumulated for six months to the point I cannot hold it anymore. But, if someone has a better experience, please give me a few good reasons why this juggernaut is worth paying for.
The main problem that keeps me avoiding commercial supposedly more user-oriented and conveniently integrated software (that is what you are paying for, right?) was in fact it is totally opposite. Once the juggernaut is assembled and sold to you - you, stupid end user do not even try to question programmers' logic behind its beautiful design as you just not capable to get its beauty from your end user pimple point of view. That was always my experience when I tried to ask - why this way and why not this way? Can I do this? Why not, as this makes a little sense?
For example, why I have to waste the storage space to import gigabytes of my existing databases to create yet another database in a program-specific format that cannot be read by other software? Convenience of associated metadata? OK, but why this convenient format does not allow a reference assembly against selected genome regions, while open-source software allows that using less convenient data formats? No, you have to use the entire genome - it will be slow though. Nice.
Illumina software very rarely, but does throw into reads bizarre letters, like @, F, or Q, which may be found in a few reads out of, say 20 million. There is absolutely no justification why a user-friendly software cannot handle that and upon importing in its wonderful convenient specific format labels the entire 20+ million fasta dataset as protein leaving the end user with no other choice but to search the dataset manually to remove those 1-2 bizarre reads just to make it usable. No, you cannot question the logic behind that - a standard, however stupid, cannot be changed because of end user, but we do strive to meet expectations.
Well, all right, at least you pay for well documented and well maintained software. Yep, the manual is intimidating - hundreds of pages. Not much useful though, as indices at the end are a way off, illustrations and examples provided do not correspond what you see on the screen, selections to be made are missing - you got to be kidding me?
The last thing I was trying to do is just to import an annotated hg19. Not a big deal - you download hg19, and then RefGene.txt, and then connect them to each other so you can see in your reference assembly matches to specific genes, like it is done in a simple and convenient way in 454 GS RefMapper. No way. After downloading overnight hg19 (again!) from somewhere, I ended up only with sequences separated by chromosomes. That is a big help. Why I cannot just use my existing hg19.fna and RefGene.txt the manual does not say, at least I could not find it there - if anything can be ever found in this messed up volume.
Sorry for being long - it accumulated for six months to the point I cannot hold it anymore. But, if someone has a better experience, please give me a few good reasons why this juggernaut is worth paying for.
Comment