Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #31
    There is a setting in SOAPdenovo that I thought had some influence on this, used when you run 'SOAPdenovo contig' separately.

    -M mergeLevel(default 1,min 0, max 3): the strength of merging similar sequences during contiging

    However, when I experimented with different values it made no difference on the contig assembly results....not sure if it did anything with the 'consensus' base, probably not.

    If you search for 'bubbles' in the Abyss, Velvet and CLC documentation you will find a lot more detail on how they deal with SNPs.

    Comment


    • #32
      A 454 - SSAHA approach

      Just to throw in on the conversation, I pooled genomic DNA from 18 individuals, cut with a 4 base cutter, and sequenced a 15bp size fraction with two full runs of 454 reads (250bp). I assembled them gsAssembler which produced an average 20 reads per contig. Then I mapped the individual reads back to the contig consensus sequences using SSAHA2 and used the SSAHA_pipeline to call SNPs. It worked pretty well - wound up with about 8000 SNPs I could believe in, and the validation rate was about 95%. The predicted allele frequency was strongly correlated (>0.8) with the real allele frequency in the donors. My goal was just basic SNP discovery in a novel species and it fit the bill.

      Caveats - Beware of minor allele freqs near 0.5 which could arise from alignment of reads from duplicated loci; Screen out short tandem repeats because STR allelic differences in the alignment can cause false positive SNPs; Loci with only 4 mapped reads (minimum 2 reads per allele) may be useful but don't count on them.

      Comment


      • #33
        A piece of information,

        We developed a tool, called kisSnp that takes two sets of non assembled raw short reads and compare them for finding SNPs between these two sets.
        It outputs the SNPs with small flanking regions.
        It uses light memory and run in short time.

        All info and download can be found on the dedicated website: http://alcovna.genouest.org/

        Enjoy ! (remarks and comments are welcome)

        Comment


        • #34
          I checked your site quickly, it's very interesting.

          I do have a question though, without a reference won't you be missing all the homozygous variations?

          Also you need long enough reads to generate flanks no, anything smaller dans 50 even 75 wouldn't ne long enough.

          Or am I missing something.

          Comment


          • #35
            With the current version we detect only SNPs between individuals. One compares two set of reads, focusing on small substitutions that may be those SNPs.

            We are currently working on a version intra-individual, that will enable to detect heterozygous SNP of one individual.

            This may be done avoiding the use of a reference genome, if the coverage is sufficient.
            Reads of length 50 to 75 are indeed long enougth.

            Pierre

            Comment


            • #36
              DIAL by Dr. Ratan for SNP without reference genome

              Hi, Everyone:

              I am trying to use DIAL without success for unknown reason, even following exact instructions. So I am wondering if anyone in our community is using the DIAL to get SNP and sharing some experience. I contacted Dr. Ratan at Penn State, but got no response. Any comments on DIAL?

              I have a 454 sequencing run of 8 samples with barcodes each and got individual .sff file. When I perform DIAL by adding each .sff file, it worked sometime, and some time not working. I tested it with the supplied data and it worked for Adding but not working with Update (it returns with $ without error, but I check ps showing no such task).

              Comment


              • #37
                What version of newbler are you using? I tried DIAL and it would very specifically only work with v2.0 and nothing later.

                Comment


                • #38
                  I did give it a trial at 2.0 version by changing the newbler path in my .profile. What I got when I performed DIAL add is: Errors: unable to open sff file. SRR000375.sff (which is one of the test sff file).
                  Last edited by ybfu; 12-06-2010, 01:53 PM.

                  Comment


                  • #39
                    Hi everybody, I wondering to introduce and share the GBS-SNP-CROP:a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by- sequencing data.
                    Recently published on BMC Bioinformatics, this methodology could be useful for population genomic studies in model and non model organism when or not a reference genome is available.

                    Please see the GBS-SNP-CROP GitHub page for more details and UserManual:
                    GBS SNP Calling Reference Optional Pipeline. Contribute to halelab/GBS-SNP-CROP development by creating an account on GitHub.


                    Best regards,
                    Arthur Melo

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Understanding Genetic Influence on Infectious Disease
                      by seqadmin




                      During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                      Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                      09-09-2024, 10:59 AM
                    • seqadmin
                      Addressing Off-Target Effects in CRISPR Technologies
                      by seqadmin






                      The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
                      08-27-2024, 04:44 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Today, 06:25 AM
                    0 responses
                    13 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 01:02 PM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-18-2024, 06:39 AM
                    0 responses
                    14 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-11-2024, 02:44 PM
                    0 responses
                    14 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X