Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • ABI Paired End Reads

    Hi,
    Does anyone have information on the new ABI Paired end reads, especially file formats and orientations of the reads.

    We are developing a colour space version of Novoalign that's performing pretty well but we would like to support paired end as well as mate pair for first release and having some sample data would help a lo, let us know if you have some reads to share, we don't need many.

    If anyone would like to try beta of NovoalignCS just ask.

    Colin

  • #2
    NovoalignCS sounds great. I'm currently comparing the performance of BWA and BioScope on SOLiD transcriptome data (single end, unfortunately). Having Novoalign as a third option would be nice, especially since other people in my group are happy using it on their non-colorspace reads. I always stumble across all kinds of bugs anyway, so volunteering as a beta tester seems obvious.

    Comment


    • #3
      hi epigen,

      That would be great if you could help. Just email me at colin at novocraft dot com I'll send you necessary info.

      Colin

      Comment


      • #4
        I have found novoalignCS very sensitive and specific when the alignments are evaluated after variant calling (SAMtools). I would definitely recommend testing this alignment tool, although it may be a little slow (like the regular novoalign) if you have 10+ sequencers (talking to you Broad/Baylor/WashU/etc.). Otherwise the multi-threaded version is the way to go, with MPI if you have it installed.

        One issue is that the # of reads from a SOLiD slide is in the hundreds of millions. If you don't have MPI installed, and you still want to align the many reads from a SOLiD run in a parallel fashion, I have created a script to convert and split the reads after using the "solid2fastq" utility in BFAST. It's a perl script, but is renamed with a .txt extension to fool the HTTP server:

        You can use this in a piped fashion:
        Code:
        solid2fastq [options] <csfastas> <quals> | bfast2novo.pl - <out.prefix> <split num>
        Hopefully this will help those with idle machines fill them up with novoalignCS.
        Last edited by nilshomer; 06-16-2010, 11:51 AM. Reason: Must fool the HTTP server

        Comment


        • #5
          data set is available on solidsoftwarecommunity.com

          Comment


          • #6
            Hmmm why not approach ABI for the sample data sets?
            with SOLiD4 being the new standard you might get different results if u used old datasets
            http://kevin-gattaca.blogspot.com/

            Comment


            • #7
              Thanks for help and suggestions. e managed to get a few files of Colour space paired end reads and latest versions of NovoalignCS now fully support paired end and mate pair reads.

              Colin

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Advanced Tools Transforming the Field of Cytogenomics
                by seqadmin


                At the intersection of cytogenetics and genomics lies the exciting field of cytogenomics. It focuses on studying chromosomes at a molecular scale, involving techniques that analyze either the whole genome or particular DNA sequences to examine variations in structure and behavior at the chromosomal or subchromosomal level. By integrating cytogenetic techniques with genomic analysis, researchers can effectively investigate chromosomal abnormalities related to diseases, particularly...
                09-26-2023, 06:26 AM
              • seqadmin
                How RNA-Seq is Transforming Cancer Studies
                by seqadmin



                Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                09-07-2023, 11:15 PM
              • seqadmin
                Methods for Investigating the Transcriptome
                by seqadmin




                Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                Whole Transcriptome RNA-seq
                Whole transcriptome sequencing...
                08-31-2023, 11:07 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:57 AM
              0 responses
              9 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-26-2023, 07:53 AM
              0 responses
              8 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-25-2023, 07:42 AM
              0 responses
              14 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 09-22-2023, 09:05 AM
              0 responses
              44 views
              0 likes
              Last Post seqadmin  
              Working...
              X