Seqanswers Leaderboard Ad



No announcement yet.
  • Filter
  • Time
  • Show
Clear All
new posts

  • Target enrichment performance

    Dear All,

    I would like to announce the inception of NGSrich, a software for evaluation of target enrichment performance in Illumina next-generation sequencing. An early release of the code has been uploaded to SourceForge at

    but we're still working on a Java version. Regards,

    Peter Frommolt
    University of Cologne

  • #2
    Java version of NGSrich

    Dear All,

    we have now prepared a fully-functional Java version of NGSrich which allows you to do a quick and detailed performance check for your target-enriched resequencing projects. We are using this as part of an exome analysis pipeline in our medium-sized genome center.

    The reports can be integrated into a webserver in a very efficient and user-friendly way. You should definitely download this and give it a try!

    Peter and Ali


    • #3
      Dear Peter and Ali,
      Your tool sounds great! I just have a question about its usage.
      I get an error when trying it out, but this might be because I probably specified the wrong file for the parameter -a or -g. What kind of file is the "genome annotation" supposed to be and where can I get it? I specified the genome as a fasta file, but I am pretty sure that's not what is needed, right?
      Can't wait to get this working, thanks for your help!


      • #4
        The genome annotation parameter is not supposed to be a file. You just need to specify the UCSC version number, e.g. 'hg19' and the software will download the correct annotation from the internet. Are you using data from a human sample?



        • #5
          Yes, my data are from human samples. But I get the following error:
          >>> STEP 1: reducing files

          Exception in thread "main" java.lang.ArrayIndexOutOfBoundsException: Array index out of range: 3
          at java.util.Vector.get(
          at adapter.Adapter.field(
          at adapter.readAdapter.SamAdapter.pos(
          at adapter.readAdapter.SamAdapter.adapt(
          at Enrichment.reduceFiles(
          at NGSrich.main(


          • #6
            Okay, which syntax are you using for the analysis? Is your read alignment file in SAM format?

            Regards, Peter


            • #7
              Thanks for the quick reply!
              I am using the following syntax:
              java NGSrich -r /path/to/file.sam -a hg18 -T /path/to/temp_folder -t /path/to/sure_select_targets.bed
              The sam file was generated with bwa and I am in the bin folder of NGSrich.


              • #8
                Do you have a header section in your SAM file? Could you please try whether it works after removal of those lines?



                • #9
                  what is the difference between -a and -g options for the annotation?

                  And where does the annotation come from, the UCSC track?

                  I ask because I have UCSC track for hg18 already downloaded.

                  Thanks for sharing your program. I am looking forward to trying it out.


                  • #10
                    The -a and -g flags are equivalent, so you can choose any one of these. The annotation comes from the UCSC track but the download usually finishes within seconds, so you do not need to worry about this.


                    • #11
                      Originally posted by pfrommolt View Post
                      The -a and -g flags are equivalent, so you can choose any one of these. The annotation comes from the UCSC track but the download usually finishes within seconds, so you do not need to worry about this.

                      It is a shame that I am behind a firewall - would it be possible to have the UCSC track file stored locally and accessed by NGSrich instead of downloaded?


                      • #12
                        I tried it again after removing the header of my samfile and the previous error is gone, but now I get the following:

                        >>> STEP 1: reducing files

                        READS FILE:
                        /home/bpetersen/exome_9_A0019.sam was reduced to /home/bpetersen/temp/1305783668487/NGSrich_exome_9_A0019_cl008.27100.txt
                        Reduced file /home/bpetersen/temp/1305783668487/NGSrich_exome_9_A0019_cl008.27100.txt sorted

                        GENOME ANNOTATION FILE:
                        /home/bpetersen/temp/1305783668487/refGene.genome reduced to /home/bpetersen/temp/1305783668487/NGSrich_genome_cl008.27100.txt

                        TARGET REGIONS FILE:
                        /home/bpetersen/temp/1305783668487/ reduced to /home/bpetersen/temp/1305783668487/NGSrich_target_cl008.27100.txt
                        Reduced file /home/bpetersen/temp/1305783668487/NGSrich_target_cl008.27100.txt sorted

                        STEP 1 successfully completed

                        >>> STEP 2: computing target coverage data

                        Starting computing target coverage files.
                        Mean target coverage data computed (/home/bpetersen/NGSricCreating coverage barplot for chr21 ... ready.Creating coverage barplot for chr21_random ...
                        HTML FILE /home/bpetersen/NGSrich_test/exome_9_A0019_enrichment.html not founddy.Creating coverage barplot for chr21 ...
                        Coverage summary computed (/hCreating coverage barplot for chr2 ... ready.Creating coverage barplot for chr20 ...
                        STEP 3 unsuccessfulting coverage barplot for chr19 ... ready.Creating coverage barplot for chr2 ...
                        Creating coverage barplot for chr18 ... ready.Creating coverage barplot for chr19 ... Creating coverage barplot for chr17_random ... ready.Creating coverage barplot for chr18 .=====================4========================= Creating coverage barplot for chr17 ... ready.Creating coverage barplot for chr17_random ...
                        >>> STEP 4: computing targets wiggle data====== Creating coverage barplot for chr16 ... ready.Creating coverage barplot for chr17 ...
                        >>> STEP 3: evaluating enrichment files Creating coverage barplot for chr15 ... ready.Creating coverage barplot for chr16 ...
                        Start computing target-based wiggle data Creating coverage barplot for chr14 ... ready.Creating coverage barplot for chr15 ...
                        Details File Name: /home/bpetersen/temp/1305783668487/coverage_cl008.27100.txtlot for chr13 ... ready.Creating coverage barplot for chr14 ...
                        Output Dir: /home/bpetersen/NGSrich_test/data xml Creating coverage barplot for chr12 ... ready.Creating coverage barplot for chr13 ...
                        End of computing target-based wiggle datang coverage barplot for chr11 ... ready.Creating coverage barplot for chr12 ...
                        XML summary file: /hCreating coverage barplot for chr10 ... ready.Creating coverage barplot for chr11 ...
                        STEP 4 successfully completed.t for chr1 ... ready.Creating coverage barplot for chr10 ...
                        Preparing coverage barplots ... ready.Creating coverage barplot for chr1 ...
                        =====================5=========================overage barplots ...
                        >>> STEP 5: computing overall wiggle dataage pieplot ...
                        Reading XML file ... ready.Reading BED file ...
                        Start computing overall wiggle data
                        Align File Name: /home/bpetersen/temp/1305783668487/NGSrich_exome_9_A0019_cl008.27100.txt
                        Output Dir: /home/bpetersen/NGSrich_test/data
                        /home/bpetersen/NGSrich_test/exome_9_A0019_enrichment.wignot found

                        STEP 5 unsuccessful


                        The plots were successfully generated but not the html file.
                        Any idea why this might be?
                        I think it would be great if you could get NGSrich to work with samfiles with header or even better with bamfiles, is this planned anytime soon?
                        Last edited by bpetersen; 05-18-2011, 11:21 PM.


                        • #13
                          Yes, we have already prepared a version which can handle the header section adequately. This is coming very soon.

                          Did your run create a BED and XML file in the 'data' directory? If so, could you email these to my address given in the README file?

                          Does the SureSelect target file have more than one BED track? You should provide a BED file with only one track.



                          • #14
                            Version 0.4.2 uploaded to SourceForge!

                            Recent changes:
                            -> SAM header sections are skipped
                            -> unassembled contigs with less than 3 genes are ignored
                            -> Bug in the computation of coverage statistics was removed


                            • #15

                              Could you add a feature to allow the user to point to an annotation file instead of download it everytime?


                              Latest Articles


                              • seqadmin
                                Latest Developments in Precision Medicine
                                by seqadmin

                                Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                                Somatic Genomics
                                “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                                05-24-2024, 01:16 PM
                              • seqadmin
                                Recent Advances in Sequencing Analysis Tools
                                by seqadmin

                                The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                                05-06-2024, 07:48 AM





                              Topics Statistics Last Post
                              Started by seqadmin, 05-24-2024, 07:15 AM
                              0 responses
                              Last Post seqadmin  
                              Started by seqadmin, 05-23-2024, 10:28 AM
                              0 responses
                              Last Post seqadmin  
                              Started by seqadmin, 05-23-2024, 07:35 AM
                              0 responses
                              Last Post seqadmin  
                              Started by seqadmin, 05-22-2024, 02:06 PM
                              0 responses
                              Last Post seqadmin