Header Leaderboard Ad

Collapse

Find SNP in 454HCDiffs.txt

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Find SNP in 454HCDiffs.txt

    Hey there,

    I mapped my reads against a reference consisting of the isotigs of the de novo assembly of the same reads. I'm wondering now if the follwoing approach is really sufficient to detect SNPs in the 454HCDiff.txt:
    - get the summary line of each diff: grep '>' 454HCDiffs.txt
    - check if the start and end position are identical (SNPs need to be at the same position in the reference)
    - check if neither the ref nucleotide nor the var nucleotide is only a gap

    - check if the var nucleotide length is 1

    Regards,
    Thomas
    Last edited by dschika; 01-18-2011, 03:46 AM.

  • #2
    Yes, that approach will give you a list of putative SNP/SNVs.

    But you will want to do further filtering (e.g. on read depth, quality) to get a more trusted set of SNPs.

    Comment


    • #3
      Thanks for your quick reply!

      I thought it would be sufficient to take the 454HCDiffs.txt file, because of the High Confidence. That means that (please see the manual for full details):
      - there must be at least 3 non-duplicate reads with the difference
      - there must be forward and reverse reads with the difference, unless there are at least 5 reads with quality score over 20

      Do you think that those filtering options are still too smooth? Can you perhaps suggest some other values?

      Btw: I added another step in my first post.

      Comment


      • #4
        It will depend on a number of factors. For example if you have greater coverage then you might want to set the read depth cut-off higher. It will depend also on the quality of your reference genome - that might have errors in it. You need to take a view depending on what you are trying to do.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
          by seqadmin



          Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
          03-21-2023, 01:49 PM
        • seqadmin
          Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
          by seqadmin




          Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
          03-10-2023, 05:31 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 01:40 PM
        0 responses
        7 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-29-2023, 11:44 AM
        0 responses
        12 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-24-2023, 02:45 PM
        0 responses
        20 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2023, 12:26 PM
        0 responses
        28 views
        0 likes
        Last Post seqadmin  
        Working...
        X