Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • GSNAP output to IGV

    I aligned Illumina reads to the zebrafish v9 reference genome using GSNAP. I had to build a new genome database in GSNAP/GMAP from individual fastA files containing each of the 25 chromosomes. After running GSNAP, I used Samtools to convert the .sam output into a .bam, then sorted the .bam and generated an index file .bai. I want to visualize the alignment using IGV, but I am unsure where to find the correctly-formatted reference file. When I try the fasta files for each chromosome that I used to build the genome database in GSNAP, I receive an error in IGV, saying "sequence.bam does not contain any sequence names which match the current genome". Is the correct genome file found somewhere in the genome database that GSNAP built?

  • #2
    I encounter that error a lot. Sometimes it's due to sequences with names containing whitespace, which some programs will truncate and others won't. Look at the header of the sam file and the names in the fasta file and make sure they match.

    I recommend that you first concatenate all of the fasta files into a single fasta file, and use that to build an index for mapping, and as input for IGV.

    Comment


    • #3
      samtools view -h will print the header of your BAM file. Infoseq from emboss or grep "^>" will print out the header of the reference file. You can see how to modify your reference file to fit to your BAM file.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Best Practices for Single-Cell Sequencing Analysis
        by seqadmin



        While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
        06-06-2024, 07:15 AM
      • seqadmin
        Latest Developments in Precision Medicine
        by seqadmin



        Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

        Somatic Genomics
        “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
        05-24-2024, 01:16 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 07:24 AM
      0 responses
      11 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-13-2024, 08:58 AM
      0 responses
      11 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-12-2024, 02:20 PM
      0 responses
      16 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 06-07-2024, 06:58 AM
      0 responses
      184 views
      0 likes
      Last Post seqadmin  
      Working...
      X