Header Leaderboard Ad

Collapse

Getting Trans-ABySS running

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Getting Trans-ABySS running

    We're interested in testing out Trans-ABySS and comparing it with our other methods for RNA-Seq analysis. It seems to have a lot of potential for metatranscriptomics, since we may not always have a metagenome.

    I've finally gotten ABySS compiled (our environment was more locked down in compiler and MPI versions than I would have liked, and made it more difficult) and I am running a test right now. I think I have all the scripts and support tools for trans-ABySS installed, but there's a chance I missed one.

    Trans-ABySS expects a directory structure with results from ABySS runs with different sizes of k. Is there a script packaged with ABySS or Trans-ABySS that generates all those runs? That will make my life a lot easier than running ABySS manually, and I didn't see that in the documentation. I've got ABySS 1.2.4 and trans-ABySS 1.0, FWIW.

    I'll post my continued experience with these programs here, in case anyone else looks into going down this road.

  • #2
    What about a simple bash script?

    #!/bin/bash
    for i in {25..30}
    do
    mkdir k$i;
    cd k$i;
    ABYSS -k$i -o output input.fa
    done

    Maybe this is too basic and I didn't get the appropriate question but if it helps...

    Comment


    • #3
      That bash script should work... I was way over-thinking this problem.

      Thanks!

      Comment


      • #4
        mrawlins,

        Did you able to run abyss 1.2.4?

        I could not make it work. Whenever I run, I get the following error. I tried to run on different servers. I get the same error.

        COMMAND USED:
        abyss-pe k=29 n=2 c=10 e=0 np=14 in='../s_3_1_sequence_fat.txt ../
        s_3_2_sequence_fat.txt' name=run82_test

        ERROR MESSAGE:
        ABYSS-P: NetworkSequenceCollection.cpp:472: void
        NetworkSequenceCollection::runControl(): Assertion `opt::erode > 0'
        failed.
        [test2:19382] *** Process received signal ***
        [test2:19382] Signal: Aborted (6)
        [test2:19382] Signal code: (-6)
        [test2:19382] [ 0] /lib64/libc.so.6 [0x3aa58302d0]
        [test2:19382] [ 1] /lib64/libc.so.6(gsignal+0x35) [0x3aa5830265]
        [test2:19382] [ 2] /lib64/libc.so.6(abort+0x110) [0x3aa5831d10]
        [test2:19382] [ 3] /lib64/libc.so.6(__assert_fail+0xf6) [0x3aa58296e6]
        [test2:19382] [ 4] ABYSS-P [0x40caab]
        [test2:19382] [ 5] ABYSS-P [0x4059e7]
        [test2:19382] [ 6] /lib64/libc.so.6(__libc_start_main+0xf4)
        [0x3aa581d994]
        [test2:19382] [ 7] ABYSS-P(__gxx_personality_v0+0x169) [0x404c99]
        [test2:19382] *** End of error message ***

        Thanks,

        Comment


        • #5
          I did get 1.2.4 running. So far we haven't run any pair-end or mate-pair experiments, so I wouldn't be able to replicate your problem.
          It looks like you've got an assert that failed, which is odd.

          It looks like your problem comes from the e=0 parameter. Somewhere the function NetworkSequenceCollection::SetState is being called with NAS_ERODE when the opt::erode variable is set to 0 (opt::erode gets set to 0 by e=0).

          I can see a few different places that could happen, so you may want to put in some debugging couts and figure out which one is being called right before the assert fails.

          Comment


          • #6
            mrawlins,

            Thank you for your input. I am using transcriptome data. It looks like abyss does not like setting e to 0 for RNA-seq data. After I ran abyss removing c and e from my command, it completed the job without any error. Abyss could guess the correct values for c and e.

            Thank you,
            RK

            Comment


            • #7
              Well, I figured out a little bug I had with ABySS and now trans-ABySS is running ( in -1 mode) so we'll see how it turns out. My first impressions are:

              This is not production-ready code.

              ABySS seems to be pretty picky about compiler and MPI versions. Trans-ABySS doesn't just have one configuration file, it has 5 general config files and 2 run-specific config files. Also, some of the general config files have some parameters that seem to be run-specific, which is a bit annoying.

              I'm not dissuaded by difficult software, so I'll keep plugging along. So far trans-ABySS has been neither easy to use nor fast to run. I'm hoping I can get it working.

              Comment


              • #8
                Well, I finally got all the ABySS output ready for trans-ABySS only to find that it dies because I don't have paired-end data. Does anyone know if there is a way around this, or does trans-ABySS not handle single end data?

                Comment


                • #9
                  Originally posted by mrawlins View Post
                  Well, I finally got all the ABySS output ready for trans-ABySS only to find that it dies because I don't have paired-end data. Does anyone know if there is a way around this, or does trans-ABySS not handle single end data?
                  You will get proper answer if you post your question on their discussion group: http://groups.google.com/group/trans-abyss

                  Comment


                  • #10
                    agree its intimidating and not production ready. I am just looking at the trans-abyss manual and quite lost.

                    From this it seems I first need abyss separately and that would not be part of any trans-abyss tool set / wrappers.. pheww .. but definitely worth it seeing what the tool offers!
                    --
                    bioinfosm

                    Comment


                    • #11
                      So, I haven't posted much lately on this as it's taken a while to get things up and running.

                      Trans-ABySS doesn't yet officially support colorspace. I was able to (using an inelegant hack) get it to run fairly well with my colorspace single-end reads, and we're encouraged. The details of that are on the Trans-ABySS Google group.

                      To those getting started, I would recommend getting ABySS working first. In general I've needed as much RAM as about three times the size of my input fasta in order for ABySS to run. If you don't have that kind of RAM you may need to run on a cluster, which requires the MPI version.

                      There are examples in the Trans-ABySS folders of the commands/parameters used to run ABySS. Basically, use abyss-pe (regardless of whether you're using single or pair end) and if you want it to use MPI give it the np=<# procs> argument.

                      The latest version of Trans-ABySS should be helpful in getting at least the 1st stage up and running. The Google group is very responsive despite its being small, so it's probably not a bad idea to post there if you're having trouble.

                      Now that it's working, it's looking much more promising. We have a few kinks to work out, but that's at least partly because the dataset we tested on was poorly suited to this kind of analysis.

                      Comment


                      • #12
                        I posted this on the google discussion, but might have a different audience here:

                        In preparation for trans-abyss, going through the steps in the manual,
                        I get an error when creating indexes:
                        $ python ensembl.py ../../annotations/hg18/ensGene.txt -i ../../
                        annotations/hg18/ensGene.idx
                        Traceback (most recent call last):
                        File "ensembl.py", line 1, in <module>
                        from analysis import transcript
                        ImportError: No module named analysis

                        Also, I am confused about the projects.cfg file. Is there a greater
                        detail or more notes on setting projects.cfg up?

                        I am looking to use a paired-end lane of Illumina to run through trans-
                        abyss. I have figured out abyss and completed analyzing that using
                        different k values.

                        TIA
                        --
                        bioinfosm

                        Comment

                        Latest Articles

                        Collapse

                        • seqadmin
                          How RNA-Seq is Transforming Cancer Studies
                          by seqadmin



                          Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                          09-07-2023, 11:15 PM
                        • seqadmin
                          Methods for Investigating the Transcriptome
                          by seqadmin




                          Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                          Whole Transcriptome RNA-seq
                          Whole transcriptome sequencing...
                          08-31-2023, 11:07 AM

                        ad_right_rmr

                        Collapse

                        News

                        Collapse

                        Topics Statistics Last Post
                        Started by seqadmin, 09-22-2023, 09:05 AM
                        0 responses
                        14 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 09-21-2023, 06:18 AM
                        0 responses
                        12 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 09-20-2023, 09:17 AM
                        0 responses
                        13 views
                        0 likes
                        Last Post seqadmin  
                        Started by seqadmin, 09-19-2023, 09:23 AM
                        0 responses
                        28 views
                        0 likes
                        Last Post seqadmin  
                        Working...
                        X