Is anyone using/testing Bioscope as a replacement for corona lite and the whole transcriptome pipeline? I've recently installed it on our cluster and was curious to find other opinions/experiences with it.
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
I have been using it for re-sequencing (still testing). BS bundles a bunch of different experiment, WT among them. I would say, download the software and start by
running the examples that come with it. Once you have it up and running modify it
to work with your data and test it out.
ABi supports SGE and PBS. THe installation in non-root mode is not extremely invasive so I would suggest you start by that.
Let us know how it goes.-drd
-
Bioscope has, in theory, a very nice workflow where you specify a 'plan' of modules to use for a given project. As an example the 'plan' may call the 'quality value filter' module, then the 'mapping' module, then in parallel the 'mapping statistics' module plus the 'GFF' module. And so on. Each module has their own 'ini' (initialization) file which in addition to per-module commands can read in a global ini file and a per-project ini file. Once you have a 'plan' set up then you just hand it off to the workflow schedule and it runs the project.
All very nice. Except ... the ini files do not pass information between each other. Nor do they use consistent parameter names. Some of the parameters are used but undocumented. The modules make unwarranted assumptions on what the previous module has done. A common example involves file names where the output file name of a module may not be in the format that the next module can understand as its input file name. Ditto with other parameters. I set a parameter in the per-project ini file and expect it to filter down through the modules -- which it does to the most part but I will often find at least one module which does not accept the parameter.
That is what I mean by 'fragile' and 'fall apart'. Touch or change one small part of the plan and the pipeline fails.
Of course this is version 1.0 of Bioscope. The software is really only used by a handful of people (as compared to, say, Microsoft Word). Thus we should expect that we will be de-facto beta testers and will encounter rough spots. Lifetech/ABI support is very responsive in trying to fix problems that I find.
Comment
-
Originally posted by westerman View PostBioscope has, in theory, a very nice workflow where you specify a 'plan' of modules to use for a given project. As an example the 'plan' may call the 'quality value filter' module, then the 'mapping' module, then in parallel the 'mapping statistics' module plus the 'GFF' module. And so on. Each module has their own 'ini' (initialization) file which in addition to per-module commands can read in a global ini file and a per-project ini file. Once you have a 'plan' set up then you just hand it off to the workflow schedule and it runs the project.
All very nice. Except ... the ini files do not pass information between each other. Nor do they use consistent parameter names. Some of the parameters are used but undocumented. The modules make unwarranted assumptions on what the previous module has done. A common example involves file names where the output file name of a module may not be in the format that the next module can understand as its input file name. Ditto with other parameters. I set a parameter in the per-project ini file and expect it to filter down through the modules -- which it does to the most part but I will often find at least one module which does not accept the parameter.
That is what I mean by 'fragile' and 'fall apart'. Touch or change one small part of the plan and the pipeline fails.
Of course this is version 1.0 of Bioscope. The software is really only used by a handful of people (as compared to, say, Microsoft Word). Thus we should expect that we will be de-facto beta testers and will encounter rough spots. Lifetech/ABI support is very responsive in trying to fix problems that I find.
My five cents:
PROs:
+ dramatic improvement in terms of running time compared with CL
+ increase of sensitivity with same specificity.
+ Much more resource efficient both IO and CPU (it is multithreaded now)
+ Easier to start analysis (at least compared to corona lite)
CONs:
+ Still to many unnecessary files being generated
+ BAM is not the standard format to drop the alignments. Valuable
CPU cycles and I/O bandwidth wasted in postprocessing.
+ Changes in the reporting stats don't match the old corona lite. They mainly
report uniquely mapped reads.-drd
Comment
-
Originally posted by drio View PostI have been using it for re-sequencing (still testing). BS bundles a bunch of different experiment, WT among them. I would say, download the software and start by
running the examples that come with it. Once you have it up and running modify it
to work with your data and test it out.
ABi supports SGE and PBS. THe installation in non-root mode is not extremely invasive so I would suggest you start by that.
Let us know how it goes.
Comment
-
Originally posted by skblazer View PostExcuse me, where can I download Bioscope-drd
Comment
-
I think that ABI/LifeTech are still just releasing the bioscope software on an 'as-need' basis until such time as they have it in releasable form. The last public link that have at the web site is for SAET. I do not see a public mention of bioscope.
Comment
-
Yes. I can run fragment to diBayes calls, pairing to diBayes calls and most recently the whole transcriptome calling. All command line. It has been a bear to get running smoothly since the assumptions that the various pipelines run under seem to be different.
Comment
-
Just saw this post. We were able to use a Whole-transcriptome pipeline of BioScope (1.0.1-42) on a RNA-seq dataset. And a note about its mapping statistics. I confirmed with their specialists that the current version of BS has bug on those numbers. so it will be fixed in next release, hopefully very soon.
We have a feeling that a large proportion of reads are wasted for SOLiD data compared to Solexa. For example, for a current chip-seq dataset, we have seen a average of 80M reads generated for a sample (quad). However, after filtering of low quality alignment and non-unique hits, only ~4% of reads could be used for further peak detection. Has anyone have similar experience? Does this sound normal?
Comment
-
Originally posted by westerman View PostYes. I can run fragment to diBayes calls, pairing to diBayes calls and most recently the whole transcriptome calling. All command line. It has been a bear to get running smoothly since the assumptions that the various pipelines run under seem to be different.
Comment
Latest Articles
Collapse
-
by seqadmin
Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.
Nobel Prize for MicroRNA Discovery
This week,...-
Channel: Articles
10-07-2024, 08:07 AM -
-
by seqadmin
Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...-
Channel: Articles
09-23-2024, 06:35 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 10-11-2024, 06:55 AM
|
0 responses
11 views
0 likes
|
Last Post
by seqadmin
10-11-2024, 06:55 AM
|
||
Started by seqadmin, 10-02-2024, 04:51 AM
|
0 responses
110 views
0 likes
|
Last Post
by seqadmin
10-02-2024, 04:51 AM
|
||
Started by seqadmin, 10-01-2024, 07:10 AM
|
0 responses
114 views
0 likes
|
Last Post
by seqadmin
10-01-2024, 07:10 AM
|
||
Started by seqadmin, 09-30-2024, 08:33 AM
|
1 response
120 views
0 likes
|
Last Post
by EmiTom
10-07-2024, 06:46 AM
|
Comment