Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • AB SOLiD Bioscope

    Is anyone using/testing Bioscope as a replacement for corona lite and the whole transcriptome pipeline? I've recently installed it on our cluster and was curious to find other opinions/experiences with it.

  • #2
    I have been using it for re-sequencing (still testing). BS bundles a bunch of different experiment, WT among them. I would say, download the software and start by
    running the examples that come with it. Once you have it up and running modify it
    to work with your data and test it out.
    ABi supports SGE and PBS. THe installation in non-root mode is not extremely invasive so I would suggest you start by that.

    Let us know how it goes.
    -drd

    Comment


    • #3
      Bioscope seems ... fragile ... at the moment. Fragile in the sense that I can run all of the test examples just fine and they seemly give good results. But when I modify the scripts to work with my data then things fall apart.

      Comment


      • #4
        Can you be more specific? What do you mean fall apart?
        -drd

        Comment


        • #5
          Bioscope has, in theory, a very nice workflow where you specify a 'plan' of modules to use for a given project. As an example the 'plan' may call the 'quality value filter' module, then the 'mapping' module, then in parallel the 'mapping statistics' module plus the 'GFF' module. And so on. Each module has their own 'ini' (initialization) file which in addition to per-module commands can read in a global ini file and a per-project ini file. Once you have a 'plan' set up then you just hand it off to the workflow schedule and it runs the project.

          All very nice. Except ... the ini files do not pass information between each other. Nor do they use consistent parameter names. Some of the parameters are used but undocumented. The modules make unwarranted assumptions on what the previous module has done. A common example involves file names where the output file name of a module may not be in the format that the next module can understand as its input file name. Ditto with other parameters. I set a parameter in the per-project ini file and expect it to filter down through the modules -- which it does to the most part but I will often find at least one module which does not accept the parameter.

          That is what I mean by 'fragile' and 'fall apart'. Touch or change one small part of the plan and the pipeline fails.

          Of course this is version 1.0 of Bioscope. The software is really only used by a handful of people (as compared to, say, Microsoft Word). Thus we should expect that we will be de-facto beta testers and will encounter rough spots. Lifetech/ABI support is very responsive in trying to fix problems that I find.

          Comment


          • #6
            Originally posted by westerman View Post
            Bioscope has, in theory, a very nice workflow where you specify a 'plan' of modules to use for a given project. As an example the 'plan' may call the 'quality value filter' module, then the 'mapping' module, then in parallel the 'mapping statistics' module plus the 'GFF' module. And so on. Each module has their own 'ini' (initialization) file which in addition to per-module commands can read in a global ini file and a per-project ini file. Once you have a 'plan' set up then you just hand it off to the workflow schedule and it runs the project.

            All very nice. Except ... the ini files do not pass information between each other. Nor do they use consistent parameter names. Some of the parameters are used but undocumented. The modules make unwarranted assumptions on what the previous module has done. A common example involves file names where the output file name of a module may not be in the format that the next module can understand as its input file name. Ditto with other parameters. I set a parameter in the per-project ini file and expect it to filter down through the modules -- which it does to the most part but I will often find at least one module which does not accept the parameter.

            That is what I mean by 'fragile' and 'fall apart'. Touch or change one small part of the plan and the pipeline fails.

            Of course this is version 1.0 of Bioscope. The software is really only used by a handful of people (as compared to, say, Microsoft Word). Thus we should expect that we will be de-facto beta testers and will encounter rough spots. Lifetech/ABI support is very responsive in trying to fix problems that I find.
            Thanks for sharing,

            My five cents:

            PROs:

            + dramatic improvement in terms of running time compared with CL
            + increase of sensitivity with same specificity.
            + Much more resource efficient both IO and CPU (it is multithreaded now)
            + Easier to start analysis (at least compared to corona lite)

            CONs:

            + Still to many unnecessary files being generated
            + BAM is not the standard format to drop the alignments. Valuable
            CPU cycles and I/O bandwidth wasted in postprocessing.
            + Changes in the reporting stats don't match the old corona lite. They mainly
            report uniquely mapped reads.
            -drd

            Comment


            • #7
              Originally posted by drio View Post
              I have been using it for re-sequencing (still testing). BS bundles a bunch of different experiment, WT among them. I would say, download the software and start by
              running the examples that come with it. Once you have it up and running modify it
              to work with your data and test it out.
              ABi supports SGE and PBS. THe installation in non-root mode is not extremely invasive so I would suggest you start by that.

              Let us know how it goes.
              Excuse me, where can I download Bioscope

              Comment


              • #8
                Originally posted by skblazer View Post
                Excuse me, where can I download Bioscope
                Solid Software adalah situs berita teknologi, program dan software dengan visi dan misi memajukan industri teknologi untuk warga Indonesia.
                -drd

                Comment


                • #9
                  Originally posted by drio View Post
                  I am sorry, but I cannot find Bioscope here.

                  Comment


                  • #10
                    I think that ABI/LifeTech are still just releasing the bioscope software on an 'as-need' basis until such time as they have it in releasable form. The last public link that have at the web site is for SAET. I do not see a public mention of bioscope.

                    Comment


                    • #11
                      Sorry a little late to this discussion, but has anyone been able to get Bioscope to run on their own data? (that is the CL version BioScope-1.0.1-42)

                      Comment


                      • #12
                        Yes. I can run fragment to diBayes calls, pairing to diBayes calls and most recently the whole transcriptome calling. All command line. It has been a bear to get running smoothly since the assumptions that the various pipelines run under seem to be different.

                        Comment


                        • #13
                          That is great news that you were able to get it running on your own data. Thanks for the quick post. I am in the process of getting up running on our data too. Just wanted to know if someone has been successful.

                          Comment


                          • #14
                            Just saw this post. We were able to use a Whole-transcriptome pipeline of BioScope (1.0.1-42) on a RNA-seq dataset. And a note about its mapping statistics. I confirmed with their specialists that the current version of BS has bug on those numbers. so it will be fixed in next release, hopefully very soon.

                            We have a feeling that a large proportion of reads are wasted for SOLiD data compared to Solexa. For example, for a current chip-seq dataset, we have seen a average of 80M reads generated for a sample (quad). However, after filtering of low quality alignment and non-unique hits, only ~4% of reads could be used for further peak detection. Has anyone have similar experience? Does this sound normal?

                            Comment


                            • #15
                              Originally posted by westerman View Post
                              Yes. I can run fragment to diBayes calls, pairing to diBayes calls and most recently the whole transcriptome calling. All command line. It has been a bear to get running smoothly since the assumptions that the various pipelines run under seem to be different.
                              How many nodes, processors and memory you have for the offline cluster? We installed bioscope on the cluster and got it run using the example data but failed when we moved to test our own data with whole transcriptome pipeline. Dose it need a lot of memory to run?

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Exploring the Dynamics of the Tumor Microenvironment
                                by seqadmin




                                The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
                                07-08-2024, 03:19 PM
                              • seqadmin
                                Exploring Human Diversity Through Large-Scale Omics
                                by seqadmin


                                In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
                                06-25-2024, 06:43 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 06:53 AM
                              0 responses
                              12 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 07-10-2024, 07:30 AM
                              0 responses
                              34 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 07-03-2024, 09:45 AM
                              0 responses
                              204 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 07-03-2024, 08:54 AM
                              0 responses
                              213 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X