Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Avadis NGS vs CLC Genomics?

    Hi,

    I'm a bench biologist with a load of RNA-seq data, who is trying to find a good integrated tool for the analysis. I'm neither a bioinformatician nor a mathematician or a statistics fan, so I’m looking for a program where I can put in my data and without getting into much details and complex algorithms, get some answers. With ‘I’, I am sure I represent many other colleagues. Have until today trialed Avadis NGS and CLC Genomics workbench. I have one question before offering my view on these programs.

    When it comes to the aligning algorithms within these programs, which one is the better one? I know that CLC uses the Bowtie, Tophat, etc, but I do not know anything about Avadis.

    Before offering my evaluation of the programs, should say that I have only used the programs for two weeks (trial periods) and so have not gone into the deep with each. Therefore, I have most probably missed other pros and cons of the programs. Then again, if I have not been able to find a specific feature in the software isn’t that something for the developers to think of?!

    Both softwares
    For groups that have no bioinformatician and no collaborations with such people (sometimes collaborations can take a long long time), it is good that there are companies or goups with an effort to make bioinformatics tools available.

    In my opinion, the makers have more or less thrown the user-friendliness out of the window. These softwares are supposed to be made for people with no prior knowledge of bioinformatics, or at least so does the companies behind it claim, but nope. They talk about the algorithms and explain them in a way so that a biologist like me gets lost half the way (this is especially true about the Avadis). And when you finally have passed the aligning step, it needs a whole education to understand and interpret your results.

    None of the softwares allows you to explore the program without the manual. Honestly, how many does read the manual for that new TV they have bought or the license agreements? And when you start reading the manuals, sometimes you wish you hadn’t, because they leave you with more confusion.

    And what is this obsession with SNPs? Not everyone is interested in SNPs.

    All I want to have is a simple list of which genes are significantly up- or downregulated in my cells with a specific treatment. None of the programs helps you analyze time series experiments to the fullest either.

    The idea with these programs should be to allow the user analyze the whole data within the same program, ie upload, handle replicates, align, give data together with statistics for expression of each gene, allow for comparison between samples and over series of samples and do statistical analysis, and help you produce tabular and graphical data.

    Avadis
    Pros
    + The integrated genome browser
    + Compatibility with UCSC genes
    + Data statistics (e.g. PCA plot)
    + The software helps combining replicates when data in different files
    + Ability to analyze the expression of a single gene and its different isoforms, in particular the fact that they can be traced in different samples graphically and with tabular data
    + The graphics
    + Stability
    + Requires less memory compared to CLC.
    + Reference genomes available one click away

    Cons
    - The average expression for a specific gene in some cases may mask the difference in expression of its isoforms
    - Cannot handle trends over samples
    - Does not offer statistical data for finding difference in expression of a specific gene in samples
    - The manual (although short, but very incomprehensible) – leaves you with the same feeling as Kafka’s ‘The process’
    - No visualization of quality scores
    - When trial period over, you are not even allowed to open the program
    - A lot of money for a program that only handles NGS data

    CLC Genomics
    Pros
    + Uses the alignment algorithms widely accepted in the RNA-seq community (one of the two reasons why I still consider this software)
    + Visualization of quality scores for the reads
    + Can do other things than just analyzing NGS data (although not very comprehensively), which makes it more worth the money (the second reason)

    Cons
    - The program crashed 4-5 times (3 times while running reads alignment over night, so don’t think you can be efficient!)
    - Requires a lot of memory
    - Does not offer a simple way to combine replicates
    - Reference sequence not available, have to download manually – while attempting to download, it disconnected several times (forget about doing it overnight or while doing something else in the lab)
    - No graphic visualization of data
    - Cannot handle time series analysis or treatment
    - No statistical data (PCA plot) offered
    - Incompatibility with UCSC genomes
    - The manual – thicker than Tolstoy’s ‘War and Peace’ and in some parts you’ll have to guess whether it is related to your type of analysis or not. You have to go through most of the manual for finding the related pieces.
    - Search not capable of finding same gene in different samples


    I am on my way to trial the Partek program, so perhaps I could continue on this thread…

  • #2
    As far as I know, Avadis doesn't do alignment. It's more for downstream analysis of aligned reads. Also, CLC Bio does *not* use Bowtie etc by default, but rather its own proprietary algorithm. It *can* use Bowtie or other executables through its plug-in mechanism, however.

    Comment


    • #3
      P.S. You could try Galaxy and see if you like it. If you do, you can probably get someone to help you do a local installation. Galaxy is free and lets you use tools like Bowtie, Samtools and so on in a pretty intuitive way.

      Comment


      • #4
        To get a feel for Galaxy, you can use the public Galaxy server at usegalaxy.org. However, as assembly is too resource intensive, you won't be able to do that on the public server. You would need to install your own galaxy locally (see http://getgalaxy.org and http://community.g2.bx.psu.edu/), or using the cloud (https://bitbucket.org/galaxy/galaxy-central/wiki/cloud).

        Dave C

        Comment


        • #5
          I can second (or third) the advice to check out Galaxy. If you have someone fairly good with unix it's pretty easy to set up a local install, even if you work on an organism not included as a default.

          I think you also need to think about the need to understand what you're doing. Imagine I turned this round and said I'm a Bioinformatician and I want to do some cloning. I've heard Gateway is a good way to go but I've ordered the kits and now have no idea how to start: mad!

          Bioinformatics is a field in itself and you need to understand the data and handling steps to sensibly use and interpret the data. Certainly software can make the job more user friendly (like Gateway cloning is easier than doing everything yourself) but that does not remove the need to know the details of what you're doing.

          Comment


          • #6
            Thank you for trying Avadis NGS (version 1.1.1) and letting us and others know about your experience. It is nice to get such comprehensive feedback on our product – because the more feedback we get, the better and more useful our product can be.

            Allow us to comment on some of the issues you described:

            First, Avadis NGS doesn’t include an alignment step; it works with pre-aligned data only. With algorithm development for alignment being such a rapidly progressing field, any particular alignment technology incorporated into our software may become outdated quickly. There are many great, free alignment tools available (as some others already pointed out), so we decided to start our software offering from that step onwards.

            Second, your discontent with our manual is noted. We are working on making the manual significantly more reader-friendly, in addition to creating step-by-step tutorials for each workflow, so you can more effectively learn about the analyses that are relevant to you without going deep into the algorithms. (You’re quite right about people not reading the instructions for setting up a television, but I’m sure you agree that NGS data analysis is more diverse and complex than that. )

            Third, to address the issue you raise about the different expression of the isoforms, see the workflow step “Differential splicing” in the section "Expression Analysis”. This feature will analyze all the different isoforms for a gene and report if one or more of the isoforms shows a different expression behavior. The resulting Entity List with the splicing results can then be visualized with the Gene View (right-click on the entity list and choose “Show in Gene View”) to investigate the results and see the corroborating evidence for the alternative splicing.

            Fourth, to investigate trends in the expression values for genes, you can try the “Filter on Parameters” workflows step. This will allow you to find those genes that follow a particular value of a parameter (for instance “Dose” or “Time”). You can set up a numeric experimental parameter and the “Filter on Parameters” workflow step will then show you those genes that show a similar behavior to the trend of you data. (We think this is what you meant with handling trends over the samples).

            Fifth, to find the differential expression behavior for a set of genes, you can either load a set of genes from a file or use the “Filter on Entity List” on any results Entity List to create a subset of genes you are interested in. Once a set of target genes is created, any visualization, such as the profile plot, the gene view, or spreadsheet view will then be limited to only the set of target genes you are interested in.

            Sixth, we’re unsure what you meant when you wrote that there is no visualization of quality scores, as there are a number of representations for tile/flow/read quality in Avadis NGS’ Quality Inspection features.

            Finally, our customer support team is readily accessible by phone and email. They are always happy to help you troubleshoot any issues, even if you are using a trial license at the time. Do not hesitate to contact us directly if you need a better explanation of a particular feature.

            Best wishes,

            The Avadis NGS Team

            Comment


            • #7
              Originally posted by Novice View Post
              I am on my way to trial the Partek program, so perhaps I could continue on this thread…
              Please go ahead, unless you work for them
              Thanks for your post, and let us know if you manage to install Galaxy.

              Comment


              • #8
                Care to join the free data analysis workshop in London next week?

                Originally posted by Novice View Post
                I am on my way to trial the Partek program, so perhaps I could continue on this thread…
                Hello,

                Thanks for trying out Partek. We've had a load of updates, particularly related to your data analysis queries.
                I see you're in London, you are most welcome to sign up for our free workshop and even bring your data along for a deep dat analysis experience.

                Hope to see you there!

                Comment


                • #9
                  Avadis

                  I recently attended a webinar from Avadis and found that it is an excellent tool for downstream analysis for RNA-seq data analysis it has incorporated DEseq for differential expression. I dont work for Avadis but want to share the information

                  Comment


                  • #10
                    Originally posted by Strand SI View Post
                    Thank you for trying Avadis NGS (version 1.1.1) and letting us and others know about your experience. It is nice to get such comprehensive feedback on our product – because the more feedback we get, the better and more useful our product can be.

                    Allow us to comment on some of the issues you described:

                    First, Avadis NGS doesn’t include an alignment step; it works with pre-aligned data only. With algorithm development for alignment being such a rapidly progressing field, any particular alignment technology incorporated into our software may become outdated quickly. There are many great, free alignment tools available (as some others already pointed out), so we decided to start our software offering from that step onwards.

                    Second, your discontent with our manual is noted. We are working on making the manual significantly more reader-friendly, in addition to creating step-by-step tutorials for each workflow, so you can more effectively learn about the analyses that are relevant to you without going deep into the algorithms. (You’re quite right about people not reading the instructions for setting up a television, but I’m sure you agree that NGS data analysis is more diverse and complex than that. )

                    Third, to address the issue you raise about the different expression of the isoforms, see the workflow step “Differential splicing” in the section "Expression Analysis”. This feature will analyze all the different isoforms for a gene and report if one or more of the isoforms shows a different expression behavior. The resulting Entity List with the splicing results can then be visualized with the Gene View (right-click on the entity list and choose “Show in Gene View”) to investigate the results and see the corroborating evidence for the alternative splicing.

                    Fourth, to investigate trends in the expression values for genes, you can try the “Filter on Parameters” workflows step. This will allow you to find those genes that follow a particular value of a parameter (for instance “Dose” or “Time”). You can set up a numeric experimental parameter and the “Filter on Parameters” workflow step will then show you those genes that show a similar behavior to the trend of you data. (We think this is what you meant with handling trends over the samples).

                    Fifth, to find the differential expression behavior for a set of genes, you can either load a set of genes from a file or use the “Filter on Entity List” on any results Entity List to create a subset of genes you are interested in. Once a set of target genes is created, any visualization, such as the profile plot, the gene view, or spreadsheet view will then be limited to only the set of target genes you are interested in.

                    Sixth, we’re unsure what you meant when you wrote that there is no visualization of quality scores, as there are a number of representations for tile/flow/read quality in Avadis NGS’ Quality Inspection features.

                    Finally, our customer support team is readily accessible by phone and email. They are always happy to help you troubleshoot any issues, even if you are using a trial license at the time. Do not hesitate to contact us directly if you need a better explanation of a particular feature.

                    Best wishes,

                    The Avadis NGS Team
                    Hi,
                    Do you know if Avadis NGS support Pair end reads generated from Illumina HiSeq? Also do I need to worry about the adapter sequence in my file? Thank you!

                    Comment


                    • #11
                      Originally posted by mediator View Post
                      Hi,
                      Do you know if Avadis NGS support Pair end reads generated from Illumina HiSeq? Also do I need to worry about the adapter sequence in my file? Thank you!
                      Yes, you can import paired end reads generated from any next-gen sequencing platform, including Illumina HiSeq.

                      Avadis NGS 1.3 will support alignment for raw DNA and small RNA reads. When setting the parameters for alignment, you can choose to trim adaptor sequences.

                      The Avadis NGS Team
                      http://avadis-ngs.com

                      Comment


                      • #12
                        Originally posted by Novice View Post
                        Hi,

                        I'm a bench biologist with a load of RNA-seq data, who is trying to find a good integrated tool for the analysis. I'm neither a bioinformatician nor a mathematician or a statistics fan, so I’m looking for a program where I can put in my data and without getting into much details and complex algorithms, get some answers. With ‘I’, I am sure I represent many other colleagues.
                        I came from biology and transitioned into bioinformatics. I understand your frustration. But I think it is very naive to think you can get away with doing cutting-edge work with cutting-edge technology and expect everything to be handed to you on a platter.

                        Perhaps in 5 years more there will be software out there for NGS data to give you answers without you having to *think* about the details and understand how the results are generated.

                        Becareful of the "cargo cult" mentality, whereby just having a NGS machine will generate you papers. Unfortunately, the HiSeq 2000 does not have a big red button that says "push here for published paper" after dumping your sample into the flow cell.

                        Originally posted by Novice View Post

                        For groups that have no bioinformatician and no collaborations with such people (sometimes collaborations can take a long long time), it is good that there are companies or goups with an effort to make bioinformatics tools available.
                        Yes, it is good. But you must be careful not to buy in too readily to the promises of the companies to give you a software package that can do it all. Yes it would it be great not to deal with bioinformaticians and their intellectual rigidness about how to do things properly (and share authorship with them, after all YOU generated all the data).

                        The sad thing is that if you do not have the ability to analyze your own data then perhaps you shouldn't have been given the funding to carry it out to begin with!

                        Originally posted by Novice View Post
                        In my opinion, the makers have more or less thrown the user-friendliness out of the window. These softwares are supposed to be made for people with no prior knowledge of bioinformatics, or at least so does the companies behind it claim, but nope. They talk about the algorithms and explain them in a way so that a biologist like me gets lost half the way (this is especially true about the Avadis). And when you finally have passed the aligning step, it needs a whole education to understand and interpret your results.
                        Well unfortunately Steve Jobs passed away, and I don't see Apple taking over genomic analysis any time soon. Not only that but the methods of analyzing NGS data are constantly changing, improving, and therefore have not had time to settle yet. To expect the companies and the bioinformaticians writing the software for you to not worry about the details (because in reality, the details do matter when methods are not stable) just comes across as very entitled.

                        Which is another point - in this day and age we should not allow any biologists to finish a degree without some computer science and statistics training. Biology will eventually turn into an information science and if you want to do research you will have to use computers.

                        Harsh words but this is coming from someone like you who started as a biologist and fought his way into bioinformatics. It's not easy, but for your own career I would strongly urge you to learn - otherwise you will be forever dependent on others to do your analysis - and that means you will be waiting. What will happen when the next technology arrives (and it will), will you be expecting all the software again to be one-click ready for you?
                        Last edited by NGSfan; 12-15-2011, 08:47 AM.

                        Comment


                        • #13
                          You must not have demoed (sp?) CLC Bio very long. All the cons you listed it can do.

                          Also I disagree with the above post. Cloud computing with easy to use GUI graphics will likely take over all NGS analysis very soon. Illumina is already working on this and I'm sure dozens of other companies are too.

                          Comment


                          • #14
                            Originally posted by NextGenSeq View Post
                            Also I disagree with the above post. Cloud computing with easy to use GUI graphics will likely take over all NGS analysis very soon. Illumina is already working on this and I'm sure dozens of other companies are too.
                            True that may be, but as NGSfan pointed out, the analysis scene is constantly changing. Say that it takes 6-12 months to make a tool user friendly. A new and better tool will come out about as often, so the user-friendly GUIs will always be one generation behind.

                            Comment


                            • #15
                              Originally posted by NextGenSeq View Post
                              Also I disagree with the above post. Cloud computing with easy to use GUI graphics will likely take over all NGS analysis very soon. Illumina is already working on this and I'm sure dozens of other companies are too.
                              I'm sure they are, but I can't see that ending up being the solution. People seem to assume that any NGS analysis requires huge amounts of compute power, which just isn't true for many (even most) applications. A decent high-end PC with a good chunk of RAM is enough for mapping and analysing most types of experiment. I'm sitting in front of my fairly standard Dell Optiplex windows pc which currently has a GUI open showing me half a billion reads, along with derived quantitations and statistical analysis results and not a bit of cloud in sight. Just because it's possible to write an inefficient Perl script which will

                              Sure, there are types of analysis for which serious hardware is required (assembly of large genomes would be the obvious example), but you can't just go on to amazon and fire up a cloud instance of a machine with 500GB+ of RAM. There are also instances where you would benefit from more processing power (mapping reads to existing genomes), where the cloud can provide this power, but then you end up being limited by the rate at which you can upload data and download results (we're still on a 20Mb connection, but even 100Mb which is pretty standard is very slow for this).

                              Where the cloud could be very useful is in the reanalysis of data which is already uploaded, to save you the bother and time of pulling this down to your local machine, but there are still great advantages to keeping your local data local.

                              What still amazes me is that people who are prepared to spend tens of thousands of pounds on sequencing won't stump up a thousand pounds for a beefy PC to help with the analysis. It's painful to see a postdoc wasting their time watching analyses run slowly on a 4 year old PC, which could be done in a fraction of the time if they spent a modest amount on a new machine.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Strategies for Sequencing Challenging Samples
                                by seqadmin


                                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                                03-22-2024, 06:39 AM
                              • seqadmin
                                Techniques and Challenges in Conservation Genomics
                                by seqadmin



                                The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                                Avian Conservation
                                Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                                03-08-2024, 10:41 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 06:37 PM
                              0 responses
                              10 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, Yesterday, 06:07 PM
                              0 responses
                              9 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-22-2024, 10:03 AM
                              0 responses
                              50 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 03-21-2024, 07:32 AM
                              0 responses
                              67 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X