Hello all,
I would like to let everyone know about Bpipe, a new tool we have created to help run bioinformatics pipelines.
Many people will be familiar with tools like Galaxy and Taverna, etc. that help you run pipelines and give a graphical view of the pipeline, inputs, outputs and many other features to make analysis pipelines more robust and manageable. Bpipe is similar in many ways but aimed at users who are command line oriented. It lets you write your pipelines almost like how you write a shell script, but it automatically adds features such as:
Bpipe is BSD licensed and available, along with documentation and examples, at http://bpipe.org. We also have a publication accepted in Bioinformatics which may be of interest as well.
Bpipe is very young and I hope to make many improvements, so I would love to have feedback from anybody here about it.
Thanks!
Simon
I would like to let everyone know about Bpipe, a new tool we have created to help run bioinformatics pipelines.
Many people will be familiar with tools like Galaxy and Taverna, etc. that help you run pipelines and give a graphical view of the pipeline, inputs, outputs and many other features to make analysis pipelines more robust and manageable. Bpipe is similar in many ways but aimed at users who are command line oriented. It lets you write your pipelines almost like how you write a shell script, but it automatically adds features such as:
- Transactional management of tasks - commands that fail get outputs cleaned up, log files saved and the pipeline cleanly aborted.
- Automatic connection of pipeline stages - Bpipe manages the file names for input and output of each stage in a systematic way so that you don't need to think about it
- Easy stopping or restarting - when a job fails it is easy to cleanly restart from the point of failure
- Audit trail - Bpipe keeps a journal of exactly which commands executed and what their inputs and outputs were
- Modularity - It's easy to make a library of pipeline stages (or commands) that you frequently use and mix and match them in different pipelines
- Parallelism - easily run many samples/files at the same time or split one sample and run analysis on many parts of it in parallel
- Integration with cluster resource managers - Bpipe supports PBS/Torque and more systems can be added easily
- Notifications - Bpipe can send you alerts by email or instant message to tell you when your pipeline finishes or even as each stage completes.
Bpipe is BSD licensed and available, along with documentation and examples, at http://bpipe.org. We also have a publication accepted in Bioinformatics which may be of interest as well.
Bpipe is very young and I hope to make many improvements, so I would love to have feedback from anybody here about it.
Thanks!
Simon
Comment