Header Leaderboard Ad

Collapse

Blasting contigs against reference database

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Blasting contigs against reference database

    Apologies if this has been covered elsewhere, couldn't find a satisfactory answer easily....

    The problem: I have hi-seq 2500 PE reads from a microbial culture that contain ONE cyanobacterial genome of interest and several contaminating genomes. My understanding is that by blasting against a local reference database containing only cyanobacterial genomes, I could bin my contigs by those which contain any cyanobacterial genes and those which do not.

    Further analysis of G-C content and tetranucleotide frequencies could then be used to eliminate chimeric contigs, leaving me with a draft genome.

    Could anybody point me in the direction of resources to help me write a BLAST algorithm do perform this task, maybe using BioPython (I have just started learning python)? I don't need long stretches of sequence to align, just the presence of a single gene with a good match in a whole contig would be enough to put it in the 'keep' pile.

    I'm new to bioinformatics and essentially teaching myself so any pointers much appreciated...

    Cheers
    Nathan

  • #2
    Use a program like bowtie2 or bbduk to bin reads.

    Comment


    • #3
      Bowtie2 looks useful, certainly. However, wouldn't this only keep reads that mapped directly to the reference genome, losing some good reads from my genome of interest? I was going along the lines of assembling contigs first and searching within them for matches to the reference.

      Comment


      • #4
        This is a classic case for using BBSplit (http://seqanswers.com/forums/showthread.php?t=41288). Use the cyanobacterial genome(s) as the reference and the reads will be binned automatically. If you need help with the actual command line let us know.

        Comment


        • #5
          BBsplit looks great, thanks! Can't believe I hadn't seen it before, will try it out.
          Cheers
          N

          Comment

          Latest Articles

          Collapse

          • seqadmin
            How RNA-Seq is Transforming Cancer Studies
            by seqadmin



            Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
            09-07-2023, 11:15 PM
          • seqadmin
            Methods for Investigating the Transcriptome
            by seqadmin




            Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

            Whole Transcriptome RNA-seq
            Whole transcriptome sequencing...
            08-31-2023, 11:07 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 06:18 AM
          0 responses
          5 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-20-2023, 09:17 AM
          0 responses
          8 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-19-2023, 09:23 AM
          0 responses
          25 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-19-2023, 09:14 AM
          0 responses
          7 views
          0 likes
          Last Post seqadmin  
          Working...
          X