Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Velvet and Oases - choice of k value?

    Hi all,
    I am using Velvet and Oases to assemble a transcriptome (de novo - no genome available) from unpaired short reads. The k-mer value which yields the highest average length of the transcripts which constitute the final output from Oases is different from the k-mer value which yields the highest N50 value for the contig lengths. In that case, which k-mer value should I choose and why?

    Related, the kmer value yielding the highest average length of transcripts also yields more blast results than the kmer value yielding highest N50. For annotation purposes, it would seem more blast results = better choice. Is there a potential complication in using the kmer value yielding highest average length over the kmer value yielding highest N50?

    Lastly, if I want to combine assemblies for different k-mer values using Vmatch software, do I use the contigs output by Velvet or the transcripts output by Oases? Which would be more appropriate?

    Are there publicly available software to assemble transcripts output by Oases corresponding to different k-mer values?

    As always, thanks so much for all of your help!
    Cheers,
    Mikey

  • #2
    Originally posted by MikeyG View Post
    Hi all,
    I am using Velvet and Oases to assemble a transcriptome (de novo - no genome available) from unpaired short reads. The k-mer value which yields the highest average length of the transcripts which constitute the final output from Oases is different from the k-mer value which yields the highest N50 value for the contig lengths. In that case, which k-mer value should I choose and why?

    Related, the kmer value yielding the highest average length of transcripts also yields more blast results than the kmer value yielding highest N50. For annotation purposes, it would seem more blast results = better choice. Is there a potential complication in using the kmer value yielding highest average length over the kmer value yielding highest N50?

    Lastly, if I want to combine assemblies for different k-mer values using Vmatch software, do I use the contigs output by Velvet or the transcripts output by Oases? Which would be more appropriate?

    Are there publicly available software to assemble transcripts output by Oases corresponding to different k-mer values?

    As always, thanks so much for all of your help!
    Cheers,
    Mikey
    We are working on a tool, GAM http://services.appliedgenomics.org/software/gam/, that does that.
    At the moment it is Sanger based but we are close to a NGS release. In the first version it will merge different assemblies (different tools or same tool with different parameters, e.g. kmers) for the same set of reads.

    Best,
    Simone

    Comment


    • #3
      Is there a potential complication in using the kmer value yielding highest average length over the kmer value yielding highest N50?
      no, looks like you have found a metric that works for you. N50 is not the last word in assemblies.

      Lastly, if I want to combine assemblies for different k-mer values using Vmatch software, do I use the contigs output by Velvet or the transcripts output by Oases? Which would be more appropriate?
      This approach is fairly innocuous (it just clusters sequences with 100% one-sided overlap), you can run it on the transcripts.
      --
      Jeremy Leipzig
      Bioinformatics Programmer
      --
      My blog
      Twitter

      Comment


      • #4
        Originally posted by MikeyG View Post
        Hi all,
        I am using Velvet and Oases to assemble a transcriptome (de novo - no genome available) from unpaired short reads. The k-mer value which yields the highest average length of the transcripts which constitute the final output from Oases is different from the k-mer value which yields the highest N50 value for the contig lengths. In that case, which k-mer value should I choose and why?

        Related, the kmer value yielding the highest average length of transcripts also yields more blast results than the kmer value yielding highest N50. For annotation purposes, it would seem more blast results = better choice. Is there a potential complication in using the kmer value yielding highest average length over the kmer value yielding highest N50?

        Lastly, if I want to combine assemblies for different k-mer values using Vmatch software, do I use the contigs output by Velvet or the transcripts output by Oases? Which would be more appropriate?

        Are there publicly available software to assemble transcripts output by Oases corresponding to different k-mer values?

        As always, thanks so much for all of your help!
        Cheers,
        Mikey
        Currently i may not help you with your first few questions as i have just started velvet/oases. But your query regarding the publicly available software to assemble transcripts output with different k-mers you can use CAP3. It is very user friendly one-liner command tool.

        Comment


        • #5
          Oases comes with its own method for doing this, why not use this?

          Comment


          • #6
            vel vet work properly with my system but after oases instalation i am not be able to run that every time i got error that is:
            molbio@molbio-System-Product-Name[oases_0.2.8] oases --help [10:59AM]
            zsh: permission denied: oases
            molbio@molbio-System-Product-Name[oases_0.2.8] oases [11:18AM]
            zsh: permission denied: oases
            molbio@molbio-System-Product-Name[oases_0.2.8]

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Genetic Variation in Immunogenetics and Antibody Diversity
              by seqadmin



              The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
              Today, 07:24 PM
            • seqadmin
              Choosing Between NGS and qPCR
              by seqadmin



              Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
              10-18-2024, 07:11 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 11-01-2024, 06:09 AM
            0 responses
            24 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 10-30-2024, 05:31 AM
            0 responses
            21 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 10-24-2024, 06:58 AM
            0 responses
            25 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 10-23-2024, 08:43 AM
            0 responses
            56 views
            0 likes
            Last Post seqadmin  
            Working...
            X