Unconfigured Ad

Collapse
This topic is closed.
X
This is a sticky topic.
X
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • odysseus
    Junior Member
    • Aug 2009
    • 8

    Thanks for your useful post.

    I’m interested in ABySS and I installed it on my linux server.
    However, I knew it does not use a reference genome, as it is a de novo assembler.
    I think it should be moved from "Align/Assemble to a reference" category to "De novo Align/Assemble" category.

    Cheers! ^^

    Comment

    • sci_guy
      Member
      • Jan 2008
      • 83

      Originally posted by odysseus View Post
      ...I think it should be moved from "Align/Assemble to a reference" category to "De novo Align/Assemble" category.
      Done. Thanks

      Comment

      • Jose Blanca
        Member
        • Aug 2009
        • 70

        Great list!
        I'd like to introduce some code I'm working on. I don't know if it deserves to go into the list, but it might be of some use for someone. Is not finished yet, but if someone is interested in trying it out or in working on it just let me know.
        You can find it at biolib.
        It's a library and a set of script targeted to NGS. There are modules to:
        - clean sequences (sanger, 454, ilumina).
        - parse caf, ace and bowtie map files.
        - clean and filter contigs.
        - look for snps and indels.
        - filter snps.
        - do statistics for: reads, contigs and snps.

        Be aware that it's just a work in progress and its in constant flux. The code is available under the AGPL.

        Comment

        • dan
          wiki wiki
          • Jul 2008
          • 194

          Beta version of the SEQwiki software database up and running...

          Check it out...





          Dan.
          Homepage: Dan Bolser
          MetaBase the database of biological databases.

          Comment

          • apfejes
            Senior Member
            • Feb 2008
            • 236

            Hey, that's awesome - Great Job!
            The more you know, the more you know you don't know. —Aristotle

            Comment

            • Dinny
              Junior Member
              • Aug 2009
              • 3

              Hi all,

              Thanks for the helpful list. Based on reading the post I've tried out Bowtie to align reads from a ChIP-seq experiment run on Illumina GA-II (the facility gave me non-aligned fastq). Worked great!

              I want to put the output into cisGenome or FindPeaks (or both). Can anyone advise me how to get it into the right format? Does FindPeaks convert any of the output options in Bowtie?

              Thanks

              Comment

              • apfejes
                Senior Member
                • Feb 2008
                • 236

                Hi Dinny,

                Findpeaks can use bowtie's reads - but it depends if you want to use PET or SET tags. if you're using PET, you'll need to convert your reads into BED format, which is the only way that you can retain the pairing information. If you're using SET, you should be able to bet Bowtie to produce a .map file (if I recall correctly), which can then be processed natively by FindPeaks.

                If you need help with findPeaks, you can always send an email to the mailing list - I tend to reply more quickly to those inquiries than those on SeqAnswers.

                Cheers,
                Anthony
                The more you know, the more you know you don't know. —Aristotle

                Comment

                • ewilbanks
                  Member
                  • Mar 2009
                  • 83

                  I don't think that CisGenome has support for bowtie output yet. Best bet would be to find scripts to convert to .bed or .aln

                  Comment

                  • Dinny
                    Junior Member
                    • Aug 2009
                    • 3

                    Thanks very much Anthony and ewilbanks.
                    I looked closer at Bowtie conversion tools and I can create a .map file from the alignment (I'm working with single-end reads), but I have to get Maq working to do it. I'll give it a go, and compare Maq alignment times while I'm at it ;-)
                    Cheers,
                    Dinny

                    Comment

                    • apfejes
                      Senior Member
                      • Feb 2008
                      • 236

                      Hi Dinny,

                      I think there is also a way to get Bowtie to produce it's own alignment in .map format, if I recall correctly.

                      Anyhow, if you're working with SET, you can also convert the bowtie reads directly to .bed: https://sourceforge.net/apps/mediawi...e=ConvertToBed
                      The more you know, the more you know you don't know. —Aristotle

                      Comment

                      • jsun529
                        Member
                        • Apr 2009
                        • 52

                        what is the definition of the header line in SOLiD csfasta file, eg
                        1_88_1830_R3 -- What is 88, 1830 stands for ? Thanks

                        Comment

                        • nilshomer
                          Nils Homer
                          • Nov 2008
                          • 1283

                          Originally posted by jsun529 View Post
                          what is the definition of the header line in SOLiD csfasta file, eg
                          1_88_1830_R3 -- What is 88, 1830 stands for ? Thanks
                          See http://seqanswers.com/forums/showthread.php?p=7572.

                          Comment

                          • Dinny
                            Junior Member
                            • Aug 2009
                            • 3

                            Hi Anthony,
                            Thanks again for the advice. Taking the reads directly into .bed would be better. The .map converter in Bowtie needs a library file created in Maq, so it would be easier to limit the number of applications the data goes through...less opportunity to completely jumble it.
                            Couldn't see a way to align straight to .map, but I'll look again.
                            Dinny

                            Comment

                            • Ka123$
                              Member
                              • Jul 2009
                              • 27

                              Solexa findpeaks

                              Using Findpeaks sort reads on bowtie mapped alignment is taking up too much memory......!!!!! So I am trying using the GERALD maps reads directly from solexa to convert to wig files...I believe the solexa GERALD mapped alignments are ELAND format?
                              So the aligner type will be -aligner eland, to perform separateReads.jar?
                              Any suggestions?

                              Comment

                              • apfejes
                                Senior Member
                                • Feb 2008
                                • 236

                                Hi Ka123,

                                There are other ways to do the sort - including several methods you could try from the linux command line. However, I'm really not sure why it's taking so memory. Could you give me a few ideas as to what your work flow is?

                                In the meantime, documentation and an example command for SeparateReads can be found here:

                                Download Vancouver Short Read Analysis Package for free. This package contains code for use with Short Read DNA Sequencing technologies, and includes packages for ChIP-Seq, Whole Transcriptome Shotgun Sequencing, Whole Genome Shotgun Sequencing, SNP Detection, Transcript expression and file conversion.
                                The more you know, the more you know you don't know. —Aristotle

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  06-02-2026, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                21 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-26-2026, 10:12 AM
                                0 responses
                                31 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...