Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to call the core genome of bacteria

    Hi everyone,

    I am trying to build a SNP tree from a set of bacterial genomes (the same species), and the first step is going to get the core genome. However, I did not find any detail workflow for that. Does anyone know some software or pipelines that can complete such task?

    Thank you very much!

    Victor

  • #2
    You could look around NCBI's ftp. Maybe your species is already in the SNP DB? If not, you might have to do all-vs-all blasts and clustering (maybe OrthoMCL) to find out the shared genes. What you want might be possible with eutils too. Oh and there's this pipeline called HAL that might do what you want..
    Last edited by rhinoceros; 08-05-2013, 10:17 AM.
    savetherhino.org

    Comment


    • #3
      How many genomes are you working with? And are you working with assemblies or raw reads?

      For a smaller number of genome assemblies, you could do whole genome alignments with Mugsy (http://sourceforge.net/projects/mugsy/files/) or ProgressiveMauve (http://gel.ahabs.wisc.edu/mauve/), filter for blocks that are common to all genomes, then infer a phylogeny from the concatenated alignment? Is that what you had in mind?

      Jason

      Comment


      • #4
        Hi rhinoceros,

        Thank you for your reply. The aim of my project is to distinguish outbreak strains, so I have to do the analysis by myself. The HAL pipeline is what I want exactly. However I am not so familiar with commend line, is there any other more user-freindly pipeline or programs under Windows system?

        Victor
        Originally posted by rhinoceros View Post
        You could look around NCBI's ftp. Maybe your species is already in the SNP DB? If not, you might have to do all-vs-all blasts and clustering (maybe OrthoMCL) to find out the shared genes. What you want might be possible with eutils too. Oh and there's this pipeline called HAL that might do what you want..

        Comment


        • #5
          Hi Jason,

          Thank you for your reply. I am working with 15-20 genomes, which have been assembled. I am using Mauve now, but I think it is not easy to filter the common sequences by manual in Mauve. Do you mean that I just concatenate all common sequences by copy paste? Which alignment and phylogenetic tools are you going to use afterwards? Also Mauve?

          Victor

          Originally posted by themerlin View Post
          How many genomes are you working with? And are you working with assemblies or raw reads?

          For a smaller number of genome assemblies, you could do whole genome alignments with Mugsy (http://sourceforge.net/projects/mugsy/files/) or ProgressiveMauve (http://gel.ahabs.wisc.edu/mauve/), filter for blocks that are common to all genomes, then infer a phylogeny from the concatenated alignment? Is that what you had in mind?

          Jason

          Comment


          • #6
            Victor,

            If you have an aversion to command line tools, you could try this web server:



            You can upload assemblies, fastqs, vcfs, and/or bams and it will compute a tree. I haven't used it, but it might fit your needs.

            Jason

            Comment


            • #7
              This is a nice web tool. Thank you very much, Jason.

              Originally posted by themerlin View Post
              Victor,

              If you have an aversion to command line tools, you could try this web server:



              You can upload assemblies, fastqs, vcfs, and/or bams and it will compute a tree. I haven't used it, but it might fit your needs.

              Jason

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Non-Coding RNA Research and Technologies
                by seqadmin


                Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                [Article Coming Soon!]...
                Today, 08:07 AM
              • seqadmin
                Recent Developments in Metagenomics
                by seqadmin





                Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                09-23-2024, 06:35 AM
              • seqadmin
                Understanding Genetic Influence on Infectious Disease
                by seqadmin




                During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                09-09-2024, 10:59 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 10-02-2024, 04:51 AM
              0 responses
              14 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 10-01-2024, 07:10 AM
              0 responses
              24 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-30-2024, 08:33 AM
              1 response
              31 views
              0 likes
              Last Post EmiTom
              by EmiTom
               
              Started by seqadmin, 09-26-2024, 12:57 PM
              0 responses
              19 views
              0 likes
              Last Post seqadmin  
              Working...
              X