Unconfigured Ad

**RDW** · 06-13-2012, 09:51 AM

With a few lines of Perl (or equivalent) you can convert the MuTect output into something that ANNOVAR can read - I think ANNOVAR needs explicit start and end positions for the variant, even if these are identical (as with a SNP), so you'll need to duplicate MuTect's single variant position column:

404 Not Found

http://www.openbioinformatics.org/annovar/annovar_input.html

ANNOVAR only cares about the first 5 columns of data, but can (optionally) retain the other columns in its output, which can be useful.

**shyam_la** · 06-13-2012, 12:19 PM

Originally posted by RDW View Post

With a few lines of Perl (or equivalent) you can convert the MuTect output into something that ANNOVAR can read - I think ANNOVAR needs explicit start and end positions for the variant, even if these are identical (as with a SNP), so you'll need to duplicate MuTect's single variant position column:

404 Not Found

http://www.openbioinformatics.org/annovar/annovar_input.html

ANNOVAR only cares about the first 5 columns of data, but can (optionally) retain the other columns in its output, which can be useful.

I am a MD, not a Bioinformatician proper and writing a few lines of perl is beyond me. I would have given ANNOVAR a shot, if I had a .vcf file as output from MuTect..

In any case, I used SNPeff - it accepts text input now but support unfortunately is soon to be discontinued in later builds. I retained only the first 4 columns of MuTect output and it did a great job..

Thank you.

Q: Can IGV display annotated tab delimited text files graphically? Any other software recommendations for visualisation?

**krawitz** · 06-14-2012, 09:13 AM

Hi Shyam_la,
GeneTalk is designed for non Bioinformaticians analyzing human sequence variants. So this might be an option for you. Who gave you the data? There is a standard format for reporting variants in NGS resequencing projects. It is called variant call format, vcf. Usually your sequencing facility can provide your data in this format. The rest is automatically done in GeneTalk. you just upload the data and the annotation will be done automatically in the background. A tutorial video tutorial explains how to filter.
best,
peter

**RDW** · 06-14-2012, 09:21 AM

Originally posted by shyam_la View Post

I am a MD, not a Bioinformatician proper and writing a few lines of perl is beyond me. I would have given ANNOVAR a shot, if I had a .vcf file as output from MuTect.

ANNOVAR can also accept text files - see my link. You just need an extra column that duplicates the variant position (since for SNPs, the start position is the same as the end). Any program that can manipulate tab-delimited text files can handle this, even Excel (just check that the first five columns don't get mangled!).

**JackieBadger** · 06-14-2012, 03:22 PM

BLAST2GO program

**shyam_la** · 06-14-2012, 08:52 PM

Originally posted by krawitz View Post

Hi Shyam_la,
GeneTalk is designed for non Bioinformaticians analyzing human sequence variants. So this might be an option for you. Who gave you the data? There is a standard format for reporting variants in NGS resequencing projects. It is called variant call format, vcf. Usually your sequencing facility can provide your data in this format. The rest is automatically done in GeneTalk. you just upload the data and the annotation will be done automatically in the background. A tutorial video tutorial explains how to filter.
best,
peter

Hi Peter,

Our sequencing facility provides only the raw reads. Im doing all the downstream analyses. I have perfected my pipeline upto annotation and my mutation caller MuTect provides only text output at the moment, as it is in beta stage (but provides excellent mutation calls in my opinion). vcf is not an option now..
Thank you anyway.

**shyam_la** · 06-14-2012, 08:58 PM

Originally posted by RDW View Post

ANNOVAR can also accept text files - see my link. You just need an extra column that duplicates the variant position (since for SNPs, the start position is the same as the end). Any program that can manipulate tab-delimited text files can handle this, even Excel (just check that the first five columns don't get mangled!).

Oh, thank you! I had assumed for another source that annovar was not capable of that.. Will give it a try too now!!

**shyam_la** · 06-14-2012, 09:01 PM

Originally posted by JackieBadger View Post

BLAST2GO program

Just checked out their home page.. Sounds good! Will give it a shot right away..

**JackieBadger** · 06-15-2012, 03:06 AM

Originally posted by shyam_la View Post

Just checked out their home page.. Sounds good! Will give it a shot right away..

If you want to know if these SNPs are non-synonymous (and you do not know the reading frame) you should use the tBLASTx against the nr database. Then once you have these results you can get GOannotations. You can also predict open reading frames using the OrfFinder program.

Topics	Statistics	Last Post
UC San Diego Bioengineers Map Gene Function in Human Stem Cells by SEQadmin2 Started by SEQadmin2, 07-13-2026, 10:26 AM	0 responses 25 views 0 reactions	Last Post by SEQadmin2 07-13-2026, 10:26 AM
New Analysis Splits Leukemia Into 16 Epigenomic Subgroups by SEQadmin2 Started by SEQadmin2, 07-09-2026, 10:04 AM	0 responses 35 views 0 reactions	Last Post by SEQadmin2 07-09-2026, 10:04 AM
Genome-Wide CRISPR Screen Uncovers Unlikely Psoriasis Target by SEQadmin2 Started by SEQadmin2, 07-08-2026, 10:08 AM	0 responses 22 views 0 reactions	Last Post by SEQadmin2 07-08-2026, 10:08 AM
Engineered Protein Motor Takes Its First Steps Along DNA Track by SEQadmin2 Started by SEQadmin2, 07-07-2026, 11:05 AM	0 responses 34 views 0 reactions	Last Post by SEQadmin2 07-07-2026, 11:05 AM

Unconfigured Ad

Need help annotating..

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News