Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • NGS data analysis

    I have a NGS data of podophyllum, can somebody tell me how can I analyse my data and design possible pathway for phyllotoxin biosynthesis. I am totally new to NGS, so please answer me in details like what softwares I should use.

  • #2
    What kind of NGS data do you have? What do you estimate the genome coverage to be, based on the expected genome size? Is there a closely related genome available in the databases?

    A project like this is not just about using "software". It will require some thought and a lot of work if you want to ultimately get anywhere close to the aim of "design possible pathway for phyllotoxin biosynthesis".

    Comment


    • #3
      What was sequenced (DNA or RNA)? In either case, assembly (since it is 454 data, Newbler is a reasonable assembler choice) would probably be the first step.

      Are there any known genes/protein in that pathway? Then you'd probably want to predict genes (if you have DNA sequences) or predict proteins (if you have RNA sequences) and perform functional annotation or compare to known sequences from known parts of that pathway or similar pathways. Software choices will dependent a lot on what kind of data and previous knowledge you have.

      Comment


      • #4
        Originally posted by Soumi
        Actually I am new to NGS. I only know basic bioinformatics.One of my lab seniors performed 454 pyrosequencing of a P. hexandrum. Now my supervisor is asking me to perform bioinformatics study using that data. As I said I am new to this, I need help. I want to know how should I plan my work.
        Start with doing some basic QC (or post QC results if they are already available) for your data. Is this DNA or RNA sequence? Secondary metabolite genes are often clustered in fungi and if you have an idea of what chemical functions are required by the phyllotoxin pathway then that could be a starting point for your hunt. All this would require enough sequence coverage. If there is not enough coverage you would need to convince your supervisor why that is important.

        In any case before you start doing anything get up to speed on NGS applications that are relevant to your needs: http://www.nature.com/nrg/series/nex...ion/index.html. Specifically assembly (if that is what you will need to do): http://en.wikibooks.org/wiki/Next_Ge..._novo_assembly

        Comment


        • #5
          Its a DNA sequence. My senior has already performed denovo assembly by using Newbler.
          He has also done Kyoto encyclopedia of genes and genomes (KEGG) analysis using KEGG Automatic Annotation Server (KAAS). He has proposed a probable pathway for phyllotoxin biosynthesis. But since the actual pathway is still unknown, I want to know what can I do with the data?

          Comment


          • #6
            If you are working in this area, you must have seen this paper: http://www.ncbi.nlm.nih.gov/pubmed/23161544

            It appears that a model for the pathway has been described in the paper (http://www.ncbi.nlm.nih.gov/pmc/arti...044/figure/F2/). Finding remaining genes in the cluster may be the only bioinformatic component of this study. Additional studies would require functional genomics (doing knockouts to see what intermediate compound accumulates to nail the pathway steps down). I don't know if this is possible to do with P. hexandrum. You probably know that better.

            Comment


            • #7
              We have already told you the broad outline of what can/needs to be done.

              Identify the two genes described in the JBC paper in your data. Look around the region to see if there are additional genes that seem to be present in a cluster (they always need not be, but in general, genes for secondary metabolites are in a cluster). You could do some bioinformatic analysis (blast, multiple sequence alignments (use protein sequence, if possible)) to see if you can identify a putative set of functions. This analysis will only get you so far as to come up with a hypothesis (e.g. gene A --> step 3 in pathway). Proving this hypothesis will require experimental work to ascribe a definitive function for any genes you may find.

              Don't take the following the wrong way. I assume you are a graduate student just starting a project? If so, propose some definite steps/show us results of your analysis using the data you have. If you are not able to figure something out on your own then we can try to help. Don't expect to get a set of "steps" to complete your project.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-25-2024, 11:49 AM
              0 responses
              19 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-24-2024, 08:47 AM
              0 responses
              18 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              62 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              60 views
              0 likes
              Last Post seqadmin  
              Working...
              X