Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • rdu
    Member
    • Aug 2010
    • 29

    blastx parameters for CDS finding

    I'm thinking to use BLASTX to find CDS (protein coding sequence) regions on genomes. How can I set the parameters to let BLASTX put big penalty on a stop codon found? Any experience or any useful link? Thanks.
  • robs
    Senior Member
    • May 2010
    • 116

    #2
    You might want to take a look at ORF finders instead of using BLASTx for this task. If you could provide more information of what you are trying to do, then maybe we could provide more useful help.

    Comment

    • tonybolger
      Senior Member
      • Feb 2010
      • 156

      #3
      Originally posted by rdu View Post
      I'm thinking to use BLASTX to find CDS (protein coding sequence) regions on genomes. How can I set the parameters to let BLASTX put big penalty on a stop codon found? Any experience or any useful link? Thanks.
      In short - Don't

      Use Scipio or Exonerate - this is exactly what they're designed for.

      Comment

      • robs
        Senior Member
        • May 2010
        • 116

        #4
        Originally posted by tonybolger View Post
        In short - Don't

        Use Scipio or Exonerate - this is exactly what they're designed for.
        The problem is that there is not enough information in the question. If he/she is interested in prokaryotic genomes, then Scipio would not be very useful.

        Eukaryotic genomes:


        Prokaryotic genomes:

        Comment

        • tonybolger
          Senior Member
          • Feb 2010
          • 156

          #5
          Originally posted by robs View Post
          The problem is that there is not enough information in the question. If he/she is interested in prokaryotic genomes, then Scipio would not be very useful.
          True on both fronts. Still, regardless of the actual aim, there's probably a better tool out there than 'raw' blastx.

          Comment

          • SES
            Senior Member
            • Mar 2010
            • 275

            #6
            Originally posted by robs View Post
            The problem is that there is not enough information in the question. If he/she is interested in prokaryotic genomes, then Scipio would not be very useful.

            Eukaryotic genomes:


            Prokaryotic genomes:
            http://rast.nmpdr.org/
            Since only a genome sequence was mentioned, maybe running sixpack to find all ORFs, followed by iprscan or a Conserved Domain Database search would be more what is requested. Ideally you would combine multiple sources of evidence, and more information would be helpful.

            Comment

            • rdu
              Member
              • Aug 2010
              • 29

              #7
              Sorry, I didnt get a chance to back here.

              Actually, I'm doing this. I downloaded 20 different bacteria genomes from NCBI website. These genome gene annotation files are ready there. Then I cut each genomic DNA sequence to short read length of pieces (~100nt each) to simulate the ngs reads. I was intending to use BLASTX and protein COG database to annotate these short reads to check how the short reads inherit the function annotation. Let me know if it's still not clear.

              Thanks for all your suggestions. But I just want to stick to one of NCBI provided tools.
              Happy new year to everyone.

              Comment

              • jordi
                Member
                • Apr 2009
                • 49

                #8
                I carried out something similar with 454 reads from a non-model specie, so I had to be aware to point mutations in stretches of homopolymers that could change the frame translation pattern giving rise a false stop codon. My blast parameters were not so restrictive as I think you want. I used a e-value threshold of 0.001 over repetmasked reads using BLASTN (BLASTX was really unsuccesful).
                hope this helps and happy new year!

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, Today, 08:59 AM
                0 responses
                10 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                21 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                17 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                31 views
                0 reactions
                Last Post SEQadmin2  
                Working...