Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how do i find a specific sequence in a bam file?

    I want to extract special sequences out of my bam-file (reference-mapped with BWA).

    normally i do that with blast or blat, but this time i have a bam file, not a ready-to-use genome...
    do i have to assemble the mapped reads into a consensus sequence in bevore, or is it possible to first (1.) identify the respective scaffold via the reference-genome with blat, (2.) assemble the reads that mapped to this scaffold and then (3.) aligne my sequence to that assembled scaffold?Or is this idea totally stupid? :/

    I have never done an assembly so far. Iam really unsure what is the right way here...

    Which tool would you suggest for assembling a bam-file, when dealing with genomes of >2 GB? And how shoud I care fore heterozygous positions?


    so many thanks in advance,
    hope anyone can give me here some help

  • #2
    It is hard for me to understand what you need. It appears that you already have reference-mapped data thus denovo assembly does not seem to required. If instead you are asking either (a) how to extract reads of a certain region or (b) how to call SNPs/Indels then you should look at the samtools 'mpileup' command. See http://samtools.sourceforge.net/mpileup.shtml for a starting place on mpileup.

    Comment


    • #3
      Have you tried to convert your BAM file into a SAM or Fasta?

      Use in the command line:

      $ samtools view -h -o out.sam in.bam
      The SAM file will provide the mapped reads into a specific scaffold. There you can retrieve your reads. Then you can assemble the reads

      Comment


      • #4
        first of all, thanks for answering!

        sorry for my imprecise question.

        What i have done so far is a reference-mapping of 5 genomes. i used BWA for that. everything worked well.

        Now i want to have the sequence of ~25 genes outof these 5 inidividuals. And i am not really sure how to do it.
        I have no experience how much time it takes to assemble the reads of these 5 individuals to a consesus sequence... therefore i thougth: maybe it is enought to just assemble the parts of the individuals in which the genes are lying...

        @amarth: which program do you suggest for the assembly. And how should i care for heterozygous sites? Shoud I use ambiguity code in the final gene sequence, or "Ns"?

        Thanks!

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Non-Coding RNA Research and Technologies
          by seqadmin




          Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

          Nobel Prize for MicroRNA Discovery
          This week,...
          10-07-2024, 08:07 AM
        • seqadmin
          Recent Developments in Metagenomics
          by seqadmin





          Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
          09-23-2024, 06:35 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 02:44 PM
        0 responses
        7 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-11-2024, 06:55 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-02-2024, 04:51 AM
        0 responses
        110 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-01-2024, 07:10 AM
        0 responses
        117 views
        0 likes
        Last Post seqadmin  
        Working...
        X