Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Aligning whole genome (FASTA) to reference genome

    Hello all,

    I previously aligned several fastq files (sample.merged.dedup.realn.bam) to a reference genome using bwa-mem.
    Now I want to align a different species sample to the same reference genome, but this file is a whole genome sequence in fasta format.

    I found this thread: http://seqanswers.com/forums/showthread.php?t=35608

    Which made me tempted to use bowtie2 like this:

    Code:
    bowtie2 -p 4 -f -x reference -U genome.fasta -S genome.aligned.sam
    However I'm not sure I can do this or if I should try using a different tool/method.
    After aligning and converting to bam format, I want to do variant calling for the whole dataset.

    Thank you in advance,
    Maria

  • #2
    I think it would be better to use minimap2 with the asm5 asm10 asm20 settings, see


    Preset:
    -x STR preset (always applied before other options; see minimap2.1 for details) []
    - map-pb/map-ont: PacBio/Nanopore vs reference mapping
    - ava-pb/ava-ont: PacBio/Nanopore read overlap
    - asm5/asm10/asm20: asm-to-ref mapping, for ~0.1/1/5% sequence divergence
    - splice: long-read spliced alignment
    - sr: genomic short-read mapping

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 08:47 AM
    0 responses
    12 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    59 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X