Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • MattB
    Member
    • Aug 2008
    • 35

    #31
    There is a setting in SOAPdenovo that I thought had some influence on this, used when you run 'SOAPdenovo contig' separately.

    -M mergeLevel(default 1,min 0, max 3): the strength of merging similar sequences during contiging

    However, when I experimented with different values it made no difference on the contig assembly results....not sure if it did anything with the 'consensus' base, probably not.

    If you search for 'bubbles' in the Abyss, Velvet and CLC documentation you will find a lot more detail on how they deal with SNPs.

    Comment

    • Boonie
      Junior Member
      • Mar 2009
      • 6

      #32
      A 454 - SSAHA approach

      Just to throw in on the conversation, I pooled genomic DNA from 18 individuals, cut with a 4 base cutter, and sequenced a 15bp size fraction with two full runs of 454 reads (250bp). I assembled them gsAssembler which produced an average 20 reads per contig. Then I mapped the individual reads back to the contig consensus sequences using SSAHA2 and used the SSAHA_pipeline to call SNPs. It worked pretty well - wound up with about 8000 SNPs I could believe in, and the validation rate was about 95%. The predicted allele frequency was strongly correlated (>0.8) with the real allele frequency in the donors. My goal was just basic SNP discovery in a novel species and it fit the bill.

      Caveats - Beware of minor allele freqs near 0.5 which could arise from alignment of reads from duplicated loci; Screen out short tandem repeats because STR allelic differences in the alignment can cause false positive SNPs; Loci with only 4 mapped reads (minimum 2 reads per allele) may be useful but don't count on them.

      Comment

      • pierre350d
        Junior Member
        • Nov 2008
        • 7

        #33
        A piece of information,

        We developed a tool, called kisSnp that takes two sets of non assembled raw short reads and compare them for finding SNPs between these two sets.
        It outputs the SNPs with small flanking regions.
        It uses light memory and run in short time.

        All info and download can be found on the dedicated website: http://alcovna.genouest.org/

        Enjoy ! (remarks and comments are welcome)

        Comment

        • lletourn
          Member
          • Oct 2009
          • 63

          #34
          I checked your site quickly, it's very interesting.

          I do have a question though, without a reference won't you be missing all the homozygous variations?

          Also you need long enough reads to generate flanks no, anything smaller dans 50 even 75 wouldn't ne long enough.

          Or am I missing something.

          Comment

          • pierre350d
            Junior Member
            • Nov 2008
            • 7

            #35
            With the current version we detect only SNPs between individuals. One compares two set of reads, focusing on small substitutions that may be those SNPs.

            We are currently working on a version intra-individual, that will enable to detect heterozygous SNP of one individual.

            This may be done avoiding the use of a reference genome, if the coverage is sufficient.
            Reads of length 50 to 75 are indeed long enougth.

            Pierre

            Comment

            • ybfu
              Junior Member
              • Apr 2010
              • 2

              #36
              DIAL by Dr. Ratan for SNP without reference genome

              Hi, Everyone:

              I am trying to use DIAL without success for unknown reason, even following exact instructions. So I am wondering if anyone in our community is using the DIAL to get SNP and sharing some experience. I contacted Dr. Ratan at Penn State, but got no response. Any comments on DIAL?

              I have a 454 sequencing run of 8 samples with barcodes each and got individual .sff file. When I perform DIAL by adding each .sff file, it worked sometime, and some time not working. I tested it with the supplied data and it worked for Adding but not working with Update (it returns with $ without error, but I check ps showing no such task).

              Comment

              • natstreet
                Member
                • Nov 2009
                • 83

                #37
                What version of newbler are you using? I tried DIAL and it would very specifically only work with v2.0 and nothing later.

                Comment

                • ybfu
                  Junior Member
                  • Apr 2010
                  • 2

                  #38
                  I did give it a trial at 2.0 version by changing the newbler path in my .profile. What I got when I performed DIAL add is: Errors: unable to open sff file. SRR000375.sff (which is one of the test sff file).
                  Last edited by ybfu; 12-06-2010, 01:53 PM.

                  Comment

                  • arthurmelo
                    Member
                    • Jul 2012
                    • 19

                    #39
                    Hi everybody, I wondering to introduce and share the GBS-SNP-CROP:a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by- sequencing data.
                    Recently published on BMC Bioinformatics, this methodology could be useful for population genomic studies in model and non model organism when or not a reference genome is available.

                    Please see the GBS-SNP-CROP GitHub page for more details and UserManual:
                    GBS SNP Calling Reference Optional Pipeline. Contribute to halelab/GBS-SNP-CROP development by creating an account on GitHub.


                    Best regards,
                    Arthur Melo

                    Comment

                    Latest Articles

                    Collapse

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-05-2026, 10:09 AM
                    0 responses
                    14 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-04-2026, 08:59 AM
                    0 responses
                    29 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-02-2026, 12:03 PM
                    0 responses
                    33 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-02-2026, 11:40 AM
                    0 responses
                    23 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...