Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • annovar output - the meaning of REJECT?

    Hi,

    I want to know the meaning of 'REJECT' of output file of ANNOVAR like follow:

    line2 nonsynonymous SNV MLH1:NM_001167619:exon11:c.T428A.V143D,MLH1:NM_001167618:exon12:c.T428A.V143D,MLH1:NM_000249:exon12:c.T1151A.V384D,MLH1:NM_001167617:exon12:c.T857A.V286D, 3 37067240 37067240 T A tissue blood 0 DBSNP UNCOVERED 0 0.120436 0 0 0 0 9.276976 0.363636 0.02 -0.096427 7 4 181 105 0 0 AT -20.427446 3 7 84 201 REJECT

    Thank you!

  • #2
    ANNOVAR keeps all of the input columns and puts them at the end of the annotation columns. "REJECT' was in the input file.

    Comment


    • #3
      It appears you have used MuTect to call your SNVs. The current beta version of MuTect is very verbose and the initial output is more than a hundred fold bigger than it should be.

      use "grep -v REJECT variant_file > variant_file_KEEP" to get the real variants and then annotate.
      You may also want to remove "UNCOVERED" variants and "DBSNP" variants.. The example variant you have given is a total failure..

      Comment


      • #4
        thank you

        Thank you!!!!

        I see it now because the alarm mail was in spam box...

        another question!

        is there any option of MuTect to get output with VCF format???

        Comment


        • #5
          No, not currently, as far as I know.. May I ask why you specifically want it in vcf format?

          Comment


          • #6
            because in my pipeline, i use ANNOVAR tool for annotation step and ANNOVAR gets input file with VCF format

            I know I can change some column to make input file(vcf4.0 format) for ANNOVAR,

            but I also want to bring other info as i wrote above.

            however, after changing the form of MuTect result file to vcf4.0 file, the ANNOVAR result file is not uniform according to each line of MuTect result file, so it's hard to see it.

            it's the reason

            Comment


            • #7
              The annovar input file only needs 5 columns; you can check the website to see the proper format (it's chr/base/base/ref/var for SNPs, check for indels and make sure it makes sense to you). Once you figure that out it should be easy to take your variant files and add 5 initial columns with the correct information; the rest will carry through so you won't lose anything.

              Comment


              • #8
                I tried using MuTect output .bam.out to convert into avinput, however, the format was not proper and I got no annotations after annovar. Moreover, when I used sed '/REJCT/d' input.bam.out > output_file ; it worked and there was no problem. But I want to include 'REJCT' also and filter it out later after annovar is done. Is there something wrong with my commands. The same commands worked after removing 'REJCT' but not working with 'REJECT'.
                My commands are:
                $ perl ./annovar/convert2annovar.pl -format vcf4 call_status.bam.out > call_status.bam.avinput

                $ perl ./annovar/annotate_variation.pl --geneanno call_status.bam.avinput ./annovar/humandb --buildver hg19

                $ perl ./annovar/summarize_annovar.pl call_status.bam.avinput ./annovar/humandb/ --buildver hg19 --verdbsnp 137 --ver1000g 1000g2012apr --veresp 6500 --genetype refgene --outfile annovar_snp_annotated.txt --remove
                Can somebody tell something ?
                thank you in advance.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Essential Discoveries and Tools in Epitranscriptomics
                  by seqadmin




                  The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                  Yesterday, 07:01 AM
                • seqadmin
                  Current Approaches to Protein Sequencing
                  by seqadmin


                  Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                  04-04-2024, 04:25 PM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, 04-11-2024, 12:08 PM
                0 responses
                55 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 10:19 PM
                0 responses
                52 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-10-2024, 09:21 AM
                0 responses
                45 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 04-04-2024, 09:00 AM
                0 responses
                55 views
                0 likes
                Last Post seqadmin  
                Working...
                X